auspicious3000

A PyTorch implementation of the “Graph Network-based Simulators” (GNS) model from DeepMind for simulating particle-based dynamics using graph networks.

Language:PythonMIT1200

contentvec

speech self-supervised representations

Language:PythonMIT42500

silero-vad

Silero VAD: pre-trained enterprise-grade Voice Activity Detector

Language:PythonMIT306700

Easy-to-use Speech Toolkit including Self-Supervised Learning model, SOTA/Streaming ASR with punctuation, Streaming TTS with text frontend, Speaker Verification System, End-to-End Speech Translation and Keyword Spotting. Won NAACL2022 Best Demo Award.

Language:PythonApache-2.01037000

UnsupTTS

Language:ShellMIT3500

NeMo

A scalable generative AI framework built for researchers and developers working on Large Language Models, Multimodal, and Speech AI (Automatic Speech Recognition and Text-to-Speech)

Language:PythonApache-2.01041200

auspicious3000

Kaizhi Qian's starred repositories

video-physics-sound-diffusion

stable-audio-tools

speechbrain

urhythmic

tortoise-tts

dotfiles

SimCLR

textlesslib

Diffusion-LM

ec-nl

VID-Sentence

WSSTG

guanyingc

zfchenUnique

alfred

DCL-Release

Cops-Ref

EvalAI

property_learner_predictor

compositional_physics_learner

executor_comphy

deepbeam

GNS-PyTorch

STAR_Benchmark

contentvec

WavPrompt

silero-vad

PaddleSpeech

UnsupTTS

NeMo