JungwonChang (cjw414)

cjw414

Geek Repo

Company:Korea University

Location:Seoul, Korea

Github PK Tool:Github PK Tool

JungwonChang's starred repositories

transformers

๐Ÿค— Transformers: State-of-the-art Machine Learning for Pytorch, TensorFlow, and JAX.

Language:PythonLicense:Apache-2.0Stargazers:133175Issues:1118Issues:15890

yt-dlp

A feature-rich command-line audio/video downloader

Language:PythonLicense:UnlicenseStargazers:84439Issues:502Issues:7850

whisper

Robust Speech Recognition via Large-Scale Weak Supervision

Language:PythonLicense:MITStargazers:69201Issues:575Issues:0

segment-anything

The repository provides code for running inference with the SegmentAnything Model (SAM), links for downloading the trained model checkpoints, and example notebooks that show how to use the model.

Language:Jupyter NotebookLicense:Apache-2.0Stargazers:47061Issues:305Issues:663

FFmpeg

Mirror of https://git.ffmpeg.org/ffmpeg.git

Language:CLicense:NOASSERTIONStargazers:45343Issues:1440Issues:0

datasets

๐Ÿค— The largest hub of ready-to-use datasets for ML models with fast, easy-to-use and efficient data manipulation tools

Language:PythonLicense:Apache-2.0Stargazers:19127Issues:280Issues:2917

ultimatevocalremovergui

GUI for a Vocal Remover that uses Deep Neural Networks.

Language:PythonLicense:MITStargazers:17799Issues:156Issues:1274

peft

๐Ÿค— PEFT: State-of-the-art Parameter-Efficient Fine-Tuning.

Language:PythonLicense:Apache-2.0Stargazers:16059Issues:109Issues:1053

speechbrain

A PyTorch-based Speech Toolkit

Language:PythonLicense:Apache-2.0Stargazers:8690Issues:133Issues:1089

Yi

A series of large language models trained from scratch by developers @01-ai

Language:Jupyter NotebookLicense:Apache-2.0Stargazers:7623Issues:106Issues:291

pyannote-audio

Neural building blocks for speaker diarization: speech activity detection, speaker change detection, overlapped speech detection, speaker embedding

Language:Jupyter NotebookLicense:MITStargazers:6084Issues:71Issues:990

silero-vad

Silero VAD: pre-trained enterprise-grade Voice Activity Detector

Language:PythonLicense:MITStargazers:4129Issues:50Issues:230

distil-whisper

Distilled variant of Whisper for speech recognition. 6x faster, 50% smaller, within 1% word error rate.

Language:PythonLicense:MITStargazers:3556Issues:65Issues:103

s3prl

Self-Supervised Speech Pre-training and Representation Learning Toolkit

Language:PythonLicense:Apache-2.0Stargazers:2229Issues:44Issues:397

Deep3DFaceReconstruction

Accurate 3D Face Reconstruction with Weakly-Supervised Learning: From Single Image to Image Set (CVPRW 2019)

Language:PythonLicense:MITStargazers:2187Issues:70Issues:208

audiomentations

A Python library for audio data augmentation. Inspired by albumentations. Useful for machine learning.

Language:PythonLicense:MITStargazers:1829Issues:21Issues:181

KoAlpaca

KoAlpaca: ํ•œ๊ตญ์–ด ๋ช…๋ น์–ด๋ฅผ ์ดํ•ดํ•˜๋Š” ์˜คํ”ˆ์†Œ์Šค ์–ธ์–ด๋ชจ๋ธ

Language:Jupyter NotebookLicense:Apache-2.0Stargazers:1537Issues:29Issues:99

awesome-whisper

๐Ÿ”Š Awesome list for Whisper โ€” an open-source AI-powered speech recognition system developed by OpenAI

License:CC0-1.0Stargazers:1218Issues:22Issues:0

voca

This codebase demonstrates how to synthesize realistic 3D character animations given an arbitrary speech signal and a static character mesh.

sox

SoX, Swiss Army knife of sound processing

Language:CLicense:NOASSERTIONStargazers:698Issues:28Issues:0

whispering

Streaming transcriber with whisper

Language:PythonLicense:MITStargazers:683Issues:19Issues:41

MonocularTotalCapture

Code for CVPR19 paper "Monocular Total Capture: Posing Face, Body and Hands in the Wild"

ICT-FaceKit

ICT's Vision and Graphics Lab's morphable face model and toolkit

Language:PythonLicense:MITStargazers:644Issues:35Issues:14

AudioMAE

This repo hosts the code and models of "Masked Autoencoders that Listen".

Language:PythonLicense:NOASSERTIONStargazers:526Issues:32Issues:28

FaceMeshFaceGeometry

FaceMeshFaceGeometry for FaceMesh

Language:JavaScriptLicense:MITStargazers:401Issues:12Issues:9

community-events

Place where folks can contribute to ๐Ÿค— community events

Language:Jupyter NotebookStargazers:397Issues:52Issues:32

open-korean-instructions

์–ธ์–ด๋ชจ๋ธ์„ ํ•™์Šตํ•˜๊ธฐ ์œ„ํ•œ ๊ณต๊ฐœ ํ•œ๊ตญ์–ด instruction dataset๋“ค์„ ๋ชจ์•„๋‘์—ˆ์Šต๋‹ˆ๋‹ค.

Language:PythonStargazers:343Issues:5Issues:0

ctc-segmentation

Segment an audio file and obtain utterance alignments. (Python package)

Language:PythonLicense:Apache-2.0Stargazers:319Issues:13Issues:29
Language:PythonLicense:Apache-2.0Stargazers:211Issues:10Issues:8

Knowledge-Distillation-Toolkit

:no_entry: [DEPRECATED] A knowledge distillation toolkit based on PyTorch and PyTorch Lightning.

Language:PythonLicense:Apache-2.0Stargazers:136Issues:14Issues:7