kaen2891

June-Woo Kim's repositories

profile

Language:HTML000

bts

(INTERSPEECH 2024) Official Implementation of "BTS: Bridging Text and Sound Modalities for Metadata-Aided Respiratory Sound Classification"

500

stethoscope-guided_supervised_contrastive_learning

(ICASSP 2024) Official Implementation of "Stethoscope-guided Supervised Contrastive Learning for Cross-domin Adaptation on Respiratory Sound Classification"

Language:Python1000

military_audio_dataset

Official code implementation of "MAD: A Military Audio Dataset for Situational Awareness and Surveillance"

Language:Python100

Llama-2

All the projects related to Llama

000

adversarial_fine-tuning_using_generated_respiratory_sound

(NeurIPS 2023 Workshop on DGM4H) Official Implementation of "Adversarial Fine-tuning using Generated Respiratory Sound to Address Class Imbalance"

Language:Python1300

s3prl2

modifying s3prl

Language:PythonApache-2.0300

SupContrast

PyTorch implementation of "Supervised Contrastive Learning" (and SimCLR incidentally)

Language:PythonBSD-2-Clause000

audiomentations

A Python library for audio data augmentation. Inspired by albumentations. Useful for machine learning.

Language:PythonMIT000

AudioMAE

This repo hosts the code and models of "Masked Autoencoders that Listen".

Language:PythonNOASSERTION000

SMART-G2P

Language:PythonGPL-3.0000

etri_multimodal

Language:Python000

a1003

Language:Jupyter Notebook000

spec_augment

🔦 A Pytorch implementation of GoogleBrain's SpecAugment: A Simple Data Augmentation Method for Automatic Speech Recognition

Language:Jupyter NotebookMIT100

al-folio

A beautiful, simple, clean, and responsive Jekyll theme for academics

MIT000

vall-e

An unofficial PyTorch implementation of the audio LM VALL-E

MIT000

misp2022_baseline

000

ast

Code for the Interspeech 2021 paper "AST: Audio Spectrogram Transformer".

BSD-3-Clause000

korean_speech_data_preprocessing

preprocessing of AIHub Korean speech dataset

Language:Python000

ssast

Code for the AAAI 2022 paper "SSAST: Self-Supervised Audio Spectrogram Transformer".

BSD-3-Clause000

releasing-research-code

Tips for releasing research code in Machine Learning (with official NeurIPS 2020 recommendations)

MIT000

GenSCL

Official Pytorch implementation of "Generalized Supervised Contrastive Learning Framework"

MIT000

FilterAugSED

MIT000

s3prl_modified

s3prl modeification

Language:PythonMIT000

improved_spoken_language_representation

Official code of Improved Spoken Language Representation for Intent Understanding in a Task-Oriented Dialogue System

Language:PythonMIT100

nn_basic_2021

codes for neural network basic course

Language:Jupyter Notebook000

epd_for_vad

find end point with statistical voice activity detection model

Language:Python100

self-supervised-speech-recognition

speech to text with self-supervised learning based on wav2vec 2.0 framework

000

kenlm

KenLM: Faster and Smaller Language Model Queries

NOASSERTION000

lva

LG AI Intermediate Courses (Computer Vision)

000