Beast code in Giters

Explain, analyze, and visualize NLP language models. Ecco creates interactive visualizations directly in Jupyter notebooks explaining the behavior of Transformer-based language models (like GPT2, BERT, RoBERTA, T5, and T0).

Language:Jupyter NotebookBSD-3-Clause1943 24 63

IMS-Toucan

Multilingual and Controllable Text-to-Speech Toolkit of the Speech and Language Technologies Group at the University of Stuttgart.

Language:PythonApache-2.01284 18 149

chinese_speech_pretrain

chinese speech pretrained models

Language:Shell966 10 54

ru_transformers

Language:PythonApache-2.0774 31 44

PyTorch-Model-Compare

Compare neural networks by their feature similarity

Language:PythonMIT329 4 13

SpeechTransProgress

Tracking the progress in end-to-end speech translation

CC0-1.0247 27 2

VBx

Variational Bayes HMM over x-vectors diarization

Language:Python245 21 62

libriheavy

Libriheavy: a 50,000 hours ASR corpus with punctuation casing and context

Language:PythonApache-2.0161 6 6

elpis

🙊 software for creating speech recognition models.

Language:PythonApache-2.0151 15 175

gtn

Automatic differentiation with weighted finite-state transducers.

Language:C++MIT115 9 12

sova-dataset

NOASSERTION114 14 6

DNC

Discriminative Neural Clustering for Speaker Diarisation

Language:PythonApache-2.078 9 7

Voice-Privacy-Challenge-2022

Baseline Recipe for VoicePrivacy Challenge 2022: anonymization systems and evaluation software

Language:Python63 8 21

TurkicASR

A multilingual ASR model that can recognize ten Turkic languages—Azerbaijani, Bashkir, Chuvash, Kazakh, Kyrgyz, Sakha, Tatar, Turkish, Uyghur, and Uzbek.

Language:PythonCC-BY-4.054 6 2

bk_clustering

Burj Khalifa Clustering method

Language:Jupyter Notebook52 6 6

speaker-anonymization

Speaker anonymization pipeline for hiding the identity of the speaker of a recording by changing the voice in it.

Language:ShellGPL-3.049 5 4

VoicePAT

VoicePAT is a modular and efficient toolkit for voice privacy research, with main focus on speaker anonymization.

Language:ShellApache-2.046 5 5

speechcatcher

Language:PythonMIT36 4 6

llama

Inference code for LLaMA 2 models

Language:Jupyter NotebookGPL-3.030 50

speechGLUE

SpeechGLUE is a speech version of the GLUE benchmark, driven by text-to-speech.

Language:PythonNOASSERTION13 20

mt-bigscience

Evaluation results for Machine Translation within the BigScience project

Language:Jupyter Notebook11 20

zamia-speech

Open tools and data for cloudless automatic speech recognition

Language:PythonLGPL-3.010 70

akreal

Pavel Denisov's starred repositories

netron

tuning_playbook

sentence-transformers

ivy

espnet

silero-vad

mkchromecast

pytorch-meta

ecco