Donghwa-KIM

A PyTorch implementation of Conv-TasNet described in "TasNet: Surpassing Ideal Time-Frequency Masking for Speech Separation" with Permutation Invariant Training (PIT).

Language:PythonMIT010

detectron2-sagemaker

Port of Detectron2 to train/deploy model on Amazon Sagemaker

Language:Jupyter Notebook010

diffusion

Efficient Diffusion for Image Retrieval

Language:PythonMIT010

Donghwa-KIM.github.io

Language:HTMLMIT000

fbrs_interactive_segmentation

[CVPR2020] f-BRS: Rethinking Backpropagating Refinement for Interactive Segmentation https://arxiv.org/abs/2001.10331

Language:Jupyter NotebookMPL-2.0010

FMix

Official implementation of 'Understanding and Enhancing Mixed Sample Data Augmentation'

Language:Jupyter NotebookMIT010

grad-cam-pytorch

PyTorch implementation of Grad-CAM, vanilla/guided backpropagation, deconvnet, and occlusion sensitivity maps

Language:PythonMIT010

K-wav2vec

Language:PythonApache-2.0010

maskrcnn-benchmark

Fast, modular reference implementation of Instance Segmentation and Object Detection algorithms in PyTorch.

Language:PythonMIT010

matrix-factorization-in-python

Language:Jupyter Notebook000

mfcc_pytorch

example

Language:Jupyter Notebook020

modality-adaptation

To alleviate only one modality

Language:Python020

pytorch-cosine-annealing-with-warmup

Language:Python010

pytorch-nlp-template

BERT-based nlp template

Language:PythonMIT010

PyTorch-Pretrained-ViT

Vision Transformer (ViT) in PyTorch

Language:Python010

segment-anything

The repository provides code for running inference with the SegmentAnything Model (SAM), links for downloading the trained model checkpoints, and example notebooks that show how to use the model.

Language:Jupyter NotebookApache-2.0000

self-supervised-speech-recognition

speech to text with self-supervised learning based on wav2vec 2.0 framework

Language:Python010

Speech-Transformer

A PyTorch implementation of Speech Transformer, an End-to-End ASR with Transformer network on Mandarin Chinese.

Language:Python010

study

AI/ML Study

Language:CSSMIT020

tensorflow

An Open Source Machine Learning Framework for Everyone

Language:C++Apache-2.0010

wav2vec2-live

A live speech recognition using Facebooks wav2vec 2.0 model.

Language:PythonMIT010

X-Punctuator

A PyTorch implementation of a punctuation prediction system using (B)LSTM, which automatically adds suitable punctuation into text without punctuation.

Language:Python010

XDC

Self-Supervised Learning by Cross-Modal Audio-Video Clustering (NeurIPS 2020)

Language:Python010