Youngdo Ahn (dori2063)

dori2063

Geek Repo

Company:GIST

Location:Gwangju, Republic of Korea

Github PK Tool:Github PK Tool

Youngdo Ahn's repositories

SER_Augmentation_CycleGAN

speech emotion recognition, augmentation

Language:Jupyter NotebookStargazers:0Issues:1Issues:0

attention-cnn

Source code for "On the Relationship between Self-Attention and Convolutional Layers"

Language:PythonStargazers:0Issues:0Issues:0

attentive-modality-hopping-for-SER

TensorFlow implementation of "Attentive Modality Hopping for Speech Emotion Recognition"

Language:PythonLicense:MITStargazers:0Issues:0Issues:0

CloserLookFewShot

source code to ICLR'19, 'A Closer Look at Few-shot Classification'

Language:PythonLicense:NOASSERTIONStargazers:0Issues:0Issues:0

CrossDomainFewShot

Cross-Domain Few-Shot Classification via Learned Feature-Wise Transformation (ICLR 2020 spotlight)

Language:PythonStargazers:0Issues:0Issues:0

DomainBed

DomainBed is a suite to test domain generalization algorithms

Language:PythonLicense:MITStargazers:0Issues:0Issues:0

emotion

Tools for testing emotion recognition methods.

Language:PythonLicense:MITStargazers:0Issues:0Issues:0

meta-weight-net

NeurIPS'19: Meta-Weight-Net: Learning an Explicit Mapping For Sample Weighting (Pytorch implementation for noisy labels).

Language:PythonLicense:MITStargazers:0Issues:0Issues:0

vcc20_baseline_cyclevae

Voice Conversion Challenge 2020 CycleVAE baseline system

Language:PythonLicense:MITStargazers:0Issues:0Issues:0

audiolm-pytorch

Implementation of AudioLM, a SOTA Language Modeling Approach to Audio Generation out of Google Research, in Pytorch

Language:PythonLicense:MITStargazers:0Issues:0Issues:0

cleanlab

Finding label errors in datasets and learning with noisy labels.

License:NOASSERTIONStargazers:0Issues:0Issues:0

DB-AIAT

The implementation of "Dual-branch Attention-In-Attention Transformer for single-channel speech enhancement"

Language:PythonStargazers:0Issues:0Issues:0

DeepEmbeddingModel_ZSL

Tensorflow code for CVPR 2017 paper: Learning a Deep Embedding Model for Zero-Shot Learning

Stargazers:0Issues:0Issues:0

DeepEMD

Code for paper "DeepEMD: Few-Shot Image Classification with Differentiable Earth Mover's Distance and Structured Classifiers", CVPR2020

License:MITStargazers:0Issues:0Issues:0

espnet

End-to-End Speech Processing Toolkit

Language:ShellLicense:Apache-2.0Stargazers:0Issues:0Issues:0
Stargazers:0Issues:0Issues:0

FLUDA

Separate block diagrams for training and test phases.

Stargazers:0Issues:1Issues:0

im2wav

Implementation of the pipeline presented in I hear your true colors: Image Guided Audio Generation

Language:PythonLicense:MITStargazers:0Issues:0Issues:0

jukemir

Perform transfer learning for MIR using Jukebox!

Language:ShellStargazers:0Issues:0Issues:0
License:MITStargazers:0Issues:0Issues:0
License:MITStargazers:0Issues:0Issues:0

nara_wpe

Different implementations of "Weighted Prediction Error" for speech dereverberation

License:MITStargazers:0Issues:0Issues:0

selectivenet

code for the ICML paper "SelectiveNet - A Deep Neural Network with an Integrated Reject Option"

Stargazers:0Issues:0Issues:0

SkipVQVC

An implementation of SkipVQVC with various settings.

Stargazers:0Issues:0Issues:0

Speech-Transformer

A PyTorch implementation of Speech Transformer, an End-to-End ASR with Transformer network on Mandarin Chinese.

Language:PythonStargazers:0Issues:0Issues:0

TVLT

PyTorch code for “TVLT: Textless Vision-Language Transformer” (NeurIPS 2022)

License:MITStargazers:0Issues:0Issues:0

Universal-Domain-Adaptation

Code release for Universal Domain Adaptation(CVPR 2019)

Stargazers:0Issues:0Issues:0

USOMS-e_LiFE

Baseline pipeline LiFE to reproduce the extracted linguistic features from the ComParE2020_USOMS-e challenge. We utilise and provide contextual word embeddings using a frozen (not fine-tuned) German Bidirectional Language Transformer (Bert).

License:Apache-2.0Stargazers:0Issues:0Issues:0

youtube-8m

Starter code for working with the YouTube-8M dataset.

License:Apache-2.0Stargazers:0Issues:0Issues:0