AltasK

followers

following

stars

AltasK's repositories

100-nlp-papers

100 Must-Read NLP Papers

000

Amphion

Amphion (/æmˈfaɪən/) is a toolkit for Audio, Music, and Speech Generation. Its purpose is to support reproducible research and help junior researchers and engineers get started in the field of audio, music, and speech generation research and development.

Language:PythonMIT000

awesome-speech-recognition-speech-synthesis-papers

Speech synthesis, voice conversion, self-supervised learning, music generation,Automatic Speech Recognition, Speaker Verification, Speech Synthesis, Language Modeling

MIT000

bark

🔊 Text-Prompted Generative Audio Model

Language:Jupyter NotebookMIT000

CNTK

Computational Network Toolkit (CNTK)

Language:C++NOASSERTION000

draw_convnet

Language:Python000

GitStart

A start test for git.

000

k2

FSA/FST algorithms, differentiable, with PyTorch compatibility.

Language:CudaNOASSERTION000

kaldi

This is now the official location of the Kaldi project.

Language:ShellNOASSERTION000

merlin

This is now the official location of the Merlin project.

Language:PythonApache-2.0000

MTBook

《机器翻译：统计建模与深度学习方法》肖桐朱靖波著 - Machine Translation: Statistical Modeling and Deep Learning Methods

Language:TeX000

pansori

Tools for ASR Corpus Generation from Online Video

MIT000

PlotNeuralNet

Latex code for making neural networks diagrams

Language:TeXMIT000

pyannote-audio

Neural building blocks for speaker diarization: speech activity detection, speaker change detection, speaker embedding

MIT000

Qix

Machine Learning、Deep Learning、PostgreSQL、Distributed System、Node.Js、Golang

NOASSERTION000

Spoon-Knife

This repo is for demonstration purposes only.

Language:HTML000

tacotron

A TensorFlow implementation of Google's Tacotron speech synthesis with pre-trained model (unofficial)

Language:PythonMIT000

Tacotron-2

DeepMind's Tacotron-2 Tensorflow implementation

Language:PythonMIT000

tensorflow

Computation using data flow graphs for scalable machine learning

Language:C++Apache-2.0000

Test

To test git function in the project.

Apache-2.0000

wenet

Production First and Production Ready End-to-End Speech Recognition Toolkit

Apache-2.0000

XTTS

🐸💬 - a deep learning toolkit for Text-to-Speech, battle-tested in research and production

MPL-2.0000