dori2063

Youngdo Ahn's repositories

SER_Augmentation_CycleGAN

speech emotion recognition, augmentation

Language:Python4 1 2

advanced-deep-learning-2019-fall

Language:Jupyter Notebook010

attention-cnn

Source code for "On the Relationship between Self-Attention and Convolutional Layers"

Language:Python000

attentive-modality-hopping-for-SER

TensorFlow implementation of "Attentive Modality Hopping for Speech Emotion Recognition"

Language:PythonMIT000

CloserLookFewShot

source code to ICLR'19, 'A Closer Look at Few-shot Classification'

Language:PythonNOASSERTION000

CrossDomainFewShot

Cross-Domain Few-Shot Classification via Learned Feature-Wise Transformation (ICLR 2020 spotlight)

Language:Python000

DomainBed

DomainBed is a suite to test domain generalization algorithms

Language:PythonMIT000

emotion

Tools for testing emotion recognition methods.

Language:PythonMIT000

meta-weight-net

NeurIPS'19: Meta-Weight-Net: Learning an Explicit Mapping For Sample Weighting (Pytorch implementation for noisy labels).

Language:PythonMIT000

vcc20_baseline_cyclevae

Voice Conversion Challenge 2020 CycleVAE baseline system

Language:PythonMIT000

audiolm-pytorch

Implementation of AudioLM, a SOTA Language Modeling Approach to Audio Generation out of Google Research, in Pytorch

Language:PythonMIT000

cleanlab

Finding label errors in datasets and learning with noisy labels.

NOASSERTION000

DB-AIAT

The implementation of "Dual-branch Attention-In-Attention Transformer for single-channel speech enhancement"

Language:Python000

DeepEmbeddingModel_ZSL

Tensorflow code for CVPR 2017 paper: Learning a Deep Embedding Model for Zero-Shot Learning

000

DeepEMD

Code for paper "DeepEMD: Few-Shot Image Classification with Differentiable Earth Mover's Distance and Structured Classifiers", CVPR2020

MIT000

espnet

End-to-End Speech Processing Toolkit

Language:ShellApache-2.0000

FLUDA

Separate block diagrams for training and test phases.

010

im2wav

Implementation of the pipeline presented in I hear your true colors: Image Guided Audio Generation

Language:PythonMIT000

jukemir

Perform transfer learning for MIR using Jukebox!

Language:Shell000

nara_wpe

Different implementations of "Weighted Prediction Error" for speech dereverberation

MIT000

selectivenet

code for the ICML paper "SelectiveNet - A Deep Neural Network with an Integrated Reject Option"

000

SkipVQVC

An implementation of SkipVQVC with various settings.

000

Speech-Transformer

A PyTorch implementation of Speech Transformer, an End-to-End ASR with Transformer network on Mandarin Chinese.

Language:Python000

TVLT

PyTorch code for “TVLT: Textless Vision-Language Transformer” (NeurIPS 2022)

MIT000

Universal-Domain-Adaptation

Code release for Universal Domain Adaptation(CVPR 2019)

000

Baseline pipeline LiFE to reproduce the extracted linguistic features from the ComParE2020_USOMS-e challenge. We utilise and provide contextual word embeddings using a frozen (not fine-tuned) German Bidirectional Language Transformer (Bert).

Apache-2.0000

youtube-8m

Starter code for working with the YouTube-8M dataset.

Apache-2.0000

dori2063

Youngdo Ahn's repositories

SER_Augmentation_CycleGAN

advanced-deep-learning-2019-fall

attention-cnn

attentive-modality-hopping-for-SER

CloserLookFewShot

CrossDomainFewShot

DomainBed

emotion

meta-weight-net

vcc20_baseline_cyclevae

audiolm-pytorch

cleanlab

DB-AIAT

DeepEmbeddingModel_ZSL

DeepEMD

espnet

few-shot-gnn

FLUDA

im2wav

jukemir

MCD_DA

MQTTS

nara_wpe

selectivenet

SkipVQVC

Speech-Transformer

TVLT

Universal-Domain-Adaptation

USOMS-e_LiFE

youtube-8m