Feiteng's repositories
naturalspeech3_facodec
FACodec: Speech Codec with Attribute Factorization used for NaturalSpeech 3
TTS-TextAnalyzer
TTS Text Analyzer
Aligner-SUPERB
Speech-To-Text forced-alignment Speech processing Universal PERformance Benchmark
Optimizers
Tensorflow Optimizers
audio-ai-timeline
A timeline of the latest AI models for audio generation, starting in 2023!
SpeechTokenizer
This is the code for the SpeechTokenizer presented in the SpeechTokenizer: Unified Speech Tokenizer for Speech Language Models. Samples are presented on
transformers
🤗 Transformers: State-of-the-art Natural Language Processing for Pytorch, TensorFlow, and JAX.
ctc-forced-aligner
Text to speech alignment using CTC forced alignment
fairseq2
FAIR Sequence Modeling Toolkit 2
google-research
Google AI Research
kaldialign
Python wrappers for Kaldi Levenshtein's distance and alignment code.
lifeiteng.github.com
demos page
Montreal-Forced-Aligner
Command line utility for forced alignment using Kaldi
NeMo
A scalable generative AI framework built for researchers and developers working on Large Language Models, Multimodal, and Speech AI (Automatic Speech Recognition and Text-to-Speech)
open_spiel
OpenSpiel is a collection of environments and algorithms for research in general reinforcement learning and search/planning in games.