김동화's repositories
audiotext-transformer
cross-modal model between audio(MFCC) and text(KoBERT)
emotion-datasets
Popular emotion-datasets
aws-ai-ml-workshop-kr
A collection of localized (Korean) AWS AI/ML workshop materials for hands-on labs.
axe-recruit
axe 채용 task
CGD
A PyTorch implementation of CGD based on the paper "Combination of Multiple Global Descriptors for Image Retrieval"
Conv-TasNet
A PyTorch implementation of Conv-TasNet described in "TasNet: Surpassing Ideal Time-Frequency Masking for Speech Separation" with Permutation Invariant Training (PIT).
detectron2-sagemaker
Port of Detectron2 to train/deploy model on Amazon Sagemaker
fbrs_interactive_segmentation
[CVPR2020] f-BRS: Rethinking Backpropagating Refinement for Interactive Segmentation https://arxiv.org/abs/2001.10331
grad-cam-pytorch
PyTorch implementation of Grad-CAM, vanilla/guided backpropagation, deconvnet, and occlusion sensitivity maps
maskrcnn-benchmark
Fast, modular reference implementation of Instance Segmentation and Object Detection algorithms in PyTorch.
mfcc_pytorch
example
modality-adaptation
To alleviate only one modality
pytorch-nlp-template
BERT-based nlp template
PyTorch-Pretrained-ViT
Vision Transformer (ViT) in PyTorch
segment-anything
The repository provides code for running inference with the SegmentAnything Model (SAM), links for downloading the trained model checkpoints, and example notebooks that show how to use the model.
self-supervised-speech-recognition
speech to text with self-supervised learning based on wav2vec 2.0 framework
Speech-Transformer
A PyTorch implementation of Speech Transformer, an End-to-End ASR with Transformer network on Mandarin Chinese.
tensorflow
An Open Source Machine Learning Framework for Everyone
wav2vec2-live
A live speech recognition using Facebooks wav2vec 2.0 model.
X-Punctuator
A PyTorch implementation of a punctuation prediction system using (B)LSTM, which automatically adds suitable punctuation into text without punctuation.