Kaizhi Qian's starred repositories

Language:PythonStargazers:42Issues:0Issues:0

stable-audio-tools

Generative models for conditional audio generation

Language:PythonLicense:MITStargazers:1903Issues:0Issues:0

speechbrain

A PyTorch-based Speech Toolkit

Language:PythonLicense:Apache-2.0Stargazers:8082Issues:0Issues:0

urhythmic

Unsupervised Rhythm Modeling for Voice Conversion

Language:PythonLicense:MITStargazers:75Issues:0Issues:0

tortoise-tts

A multi-voice TTS system trained with an emphasis on quality

Language:Jupyter NotebookLicense:Apache-2.0Stargazers:12170Issues:0Issues:0

dotfiles

Personal dotfiles

Language:Emacs LispStargazers:1Issues:0Issues:0

SimCLR

PyTorch implementation of SimCLR: A Simple Framework for Contrastive Learning of Visual Representations by T. Chen et al.

Language:PythonLicense:MITStargazers:733Issues:0Issues:0

textlesslib

Library for Textless Spoken Language Processing

Language:PythonLicense:MITStargazers:507Issues:0Issues:0

Diffusion-LM

Diffusion-LM

Language:PythonLicense:Apache-2.0Stargazers:1000Issues:0Issues:0

ec-nl

[ICLR 2022] Linking Emergent and Natural Languages via Corpus Transfer

Language:PythonLicense:MITStargazers:29Issues:0Issues:0

VID-Sentence

This repository provides the dataset introduced by our WSSTG paper

Language:JavaScriptLicense:NOASSERTIONStargazers:12Issues:0Issues:0

WSSTG

This repository contains the main baselines introduced in WSSTG (ACL 2019).

Language:PythonLicense:NOASSERTIONStargazers:55Issues:0Issues:0
Stargazers:2Issues:0Issues:0

zfchenUnique

My personal repository

Stargazers:2Issues:0Issues:0

alfred

ALFRED - A Benchmark for Interpreting Grounded Instructions for Everyday Tasks

License:MITStargazers:2Issues:0Issues:0

DCL-Release

This repo contains the pytorch implementation for Dynamic Concept Learner (accepted by ICLR 2021).

Language:PythonLicense:MITStargazers:36Issues:0Issues:0

Cops-Ref

Accepted by CVPR 2020.

Stargazers:25Issues:0Issues:0

EvalAI

:cloud: :rocket: :bar_chart: :chart_with_upwards_trend: Evaluating state of the art in AI

License:NOASSERTIONStargazers:2Issues:0Issues:0
Language:PythonLicense:MITStargazers:4Issues:0Issues:0
Language:PythonStargazers:32Issues:0Issues:0
Language:PythonStargazers:5Issues:0Issues:0

deepbeam

Deep learning based Speech Beamforming

Language:Jupyter NotebookStargazers:61Issues:0Issues:0

GNS-PyTorch

A PyTorch implementation of the “Graph Network-based Simulators” (GNS) model from DeepMind for simulating particle-based dynamics using graph networks.

Language:PythonLicense:MITStargazers:12Issues:0Issues:0
Language:PythonLicense:Apache-2.0Stargazers:26Issues:0Issues:0

contentvec

speech self-supervised representations

Language:PythonLicense:MITStargazers:425Issues:0Issues:0
Language:PythonStargazers:35Issues:0Issues:0

silero-vad

Silero VAD: pre-trained enterprise-grade Voice Activity Detector

Language:PythonLicense:MITStargazers:3067Issues:0Issues:0

PaddleSpeech

Easy-to-use Speech Toolkit including Self-Supervised Learning model, SOTA/Streaming ASR with punctuation, Streaming TTS with text frontend, Speaker Verification System, End-to-End Speech Translation and Keyword Spotting. Won NAACL2022 Best Demo Award.

Language:PythonLicense:Apache-2.0Stargazers:10370Issues:0Issues:0
Language:ShellLicense:MITStargazers:35Issues:0Issues:0

NeMo

A scalable generative AI framework built for researchers and developers working on Large Language Models, Multimodal, and Speech AI (Automatic Speech Recognition and Text-to-Speech)

Language:PythonLicense:Apache-2.0Stargazers:10412Issues:0Issues:0