JungwonChang's starred repositories
transformers
๐ค Transformers: State-of-the-art Machine Learning for Pytorch, TensorFlow, and JAX.
segment-anything
The repository provides code for running inference with the SegmentAnything Model (SAM), links for downloading the trained model checkpoints, and example notebooks that show how to use the model.
ultimatevocalremovergui
GUI for a Vocal Remover that uses Deep Neural Networks.
speechbrain
A PyTorch-based Speech Toolkit
pyannote-audio
Neural building blocks for speaker diarization: speech activity detection, speaker change detection, overlapped speech detection, speaker embedding
silero-vad
Silero VAD: pre-trained enterprise-grade Voice Activity Detector
distil-whisper
Distilled variant of Whisper for speech recognition. 6x faster, 50% smaller, within 1% word error rate.
Deep3DFaceReconstruction
Accurate 3D Face Reconstruction with Weakly-Supervised Learning: From Single Image to Image Set (CVPRW 2019)
audiomentations
A Python library for audio data augmentation. Inspired by albumentations. Useful for machine learning.
awesome-whisper
๐ Awesome list for Whisper โ an open-source AI-powered speech recognition system developed by OpenAI
whispering
Streaming transcriber with whisper
MonocularTotalCapture
Code for CVPR19 paper "Monocular Total Capture: Posing Face, Body and Hands in the Wild"
ICT-FaceKit
ICT's Vision and Graphics Lab's morphable face model and toolkit
FaceMeshFaceGeometry
FaceMeshFaceGeometry for FaceMesh
community-events
Place where folks can contribute to ๐ค community events
open-korean-instructions
์ธ์ด๋ชจ๋ธ์ ํ์ตํ๊ธฐ ์ํ ๊ณต๊ฐ ํ๊ตญ์ด instruction dataset๋ค์ ๋ชจ์๋์์ต๋๋ค.
ctc-segmentation
Segment an audio file and obtain utterance alignments. (Python package)
Knowledge-Distillation-Toolkit
:no_entry: [DEPRECATED] A knowledge distillation toolkit based on PyTorch and PyTorch Lightning.