AltasK's repositories
100-nlp-papers
100 Must-Read NLP Papers
Amphion
Amphion (/æmˈfaɪən/) is a toolkit for Audio, Music, and Speech Generation. Its purpose is to support reproducible research and help junior researchers and engineers get started in the field of audio, music, and speech generation research and development.
awesome-speech-recognition-speech-synthesis-papers
Speech synthesis, voice conversion, self-supervised learning, music generation,Automatic Speech Recognition, Speaker Verification, Speech Synthesis, Language Modeling
bark
🔊 Text-Prompted Generative Audio Model
CNTK
Computational Network Toolkit (CNTK)
GitStart
A start test for git.
k2
FSA/FST algorithms, differentiable, with PyTorch compatibility.
kaldi
This is now the official location of the Kaldi project.
merlin
This is now the official location of the Merlin project.
MTBook
《机器翻译:统计建模与深度学习方法》肖桐 朱靖波 著 - Machine Translation: Statistical Modeling and Deep Learning Methods
pansori
Tools for ASR Corpus Generation from Online Video
PlotNeuralNet
Latex code for making neural networks diagrams
pyannote-audio
Neural building blocks for speaker diarization: speech activity detection, speaker change detection, speaker embedding
Qix
Machine Learning、Deep Learning、PostgreSQL、Distributed System、Node.Js、Golang
Spoon-Knife
This repo is for demonstration purposes only.
tacotron
A TensorFlow implementation of Google's Tacotron speech synthesis with pre-trained model (unofficial)
Tacotron-2
DeepMind's Tacotron-2 Tensorflow implementation
tensorflow
Computation using data flow graphs for scalable machine learning
Test
To test git function in the project.
wenet
Production First and Production Ready End-to-End Speech Recognition Toolkit
XTTS
🐸💬 - a deep learning toolkit for Text-to-Speech, battle-tested in research and production