hyejwon's repositories
DCASE2020-Task6-PKU
A Pytorch implementation of the DCASE2020 Task6 by PKU team : Automated Audio Captioning With Temporal Attention
MLQuestions
Machine Learning and Computer Vision Engineer - Technical Interview Questions
audio_captioning
2021_dcase_task6
annotated_deep_learning_paper_implementations
🧑🏫 60 Implementations/tutorials of deep learning papers with side-by-side notes 📝; including transformers (original, xl, switch, feedback, vit, ...), optimizers (adam, adabelief, sophia, ...), gans(cyclegan, stylegan2, ...), 🎮 reinforcement learning (ppo, dqn), capsnet, distillation, ... 🧠
audioset-vggish-tensorflow-to-pytorch
Script for converting the pretrained VGGish model provided with AudioSet from TensorFlow to PyTorch, along with a basic smoke test.
binary_mask_from_json
Making binary mask images from JSON annotation
dcase_2020_T6
2nd place solution for 2020 DCASE challenge task 6 audio captioning. http://dcase.community/challenge2020/task-automatic-audio-captioning-results#wuyusong2020_t6
Deep-Learning-Papers-Reading-Roadmap
Deep Learning papers reading roadmap for anyone who are eager to learn this amazing tech!
ECGTransformer
Source code of BIBM 2019 Paper "Fusing Transformer Model with Temporal Features for ECG Heartbeat Classification"
HEAR2021_EfficientLatent
Submission to the HEAR2021 Challenge
mlp-mixer-pytorch
An All-MLP solution for Vision, from Google AI
models
Models and examples built with TensorFlow
survey
A Survey on Neural Speech Synthesis https://arxiv.org/pdf/2106.15561.pdf
triplet_loss_kws
Learning Efficient Representations for Keyword Spotting with Triplet Loss
tutorials
PyTorch tutorials.