Pavel Denisov's starred repositories

netron

Visualizer for neural network, deep learning and machine learning models

Language:JavaScriptLicense:MITStargazers:26979Issues:299Issues:1102

tuning_playbook

A playbook for systematically maximizing the performance of deep learning models.

sentence-transformers

Multilingual Sentence & Image Embeddings with BERT

Language:PythonLicense:Apache-2.0Stargazers:14524Issues:134Issues:2070

ivy

The Unified AI Framework

Language:PythonLicense:NOASSERTIONStargazers:14026Issues:70Issues:16874

espnet

End-to-End Speech Processing Toolkit

Language:PythonLicense:Apache-2.0Stargazers:8159Issues:179Issues:2335

silero-vad

Silero VAD: pre-trained enterprise-grade Voice Activity Detector

Language:PythonLicense:MITStargazers:3474Issues:44Issues:213

mkchromecast

Cast macOS and Linux Audio/Video to your Google Cast and Sonos Devices

Language:PythonLicense:NOASSERTIONStargazers:2193Issues:48Issues:374

pytorch-meta

A collection of extensions and data-loaders for few-shot learning & meta-learning in PyTorch

Language:PythonLicense:MITStargazers:1960Issues:44Issues:141

ecco

Explain, analyze, and visualize NLP language models. Ecco creates interactive visualizations directly in Jupyter notebooks explaining the behavior of Transformer-based language models (like GPT2, BERT, RoBERTA, T5, and T0).

Language:Jupyter NotebookLicense:BSD-3-ClauseStargazers:1943Issues:24Issues:63

IMS-Toucan

Multilingual and Controllable Text-to-Speech Toolkit of the Speech and Language Technologies Group at the University of Stuttgart.

Language:PythonLicense:Apache-2.0Stargazers:1284Issues:18Issues:149

chinese_speech_pretrain

chinese speech pretrained models

Language:PythonLicense:Apache-2.0Stargazers:774Issues:31Issues:44

PyTorch-Model-Compare

Compare neural networks by their feature similarity

Language:PythonLicense:MITStargazers:329Issues:4Issues:13

SpeechTransProgress

Tracking the progress in end-to-end speech translation

VBx

Variational Bayes HMM over x-vectors diarization

libriheavy

Libriheavy: a 50,000 hours ASR corpus with punctuation casing and context

Language:PythonLicense:Apache-2.0Stargazers:161Issues:6Issues:6

elpis

🙊 software for creating speech recognition models.

Language:PythonLicense:Apache-2.0Stargazers:151Issues:15Issues:175

gtn

Automatic differentiation with weighted finite-state transducers.

Language:C++License:MITStargazers:115Issues:9Issues:12

DNC

Discriminative Neural Clustering for Speaker Diarisation

Language:PythonLicense:Apache-2.0Stargazers:78Issues:9Issues:7

Voice-Privacy-Challenge-2022

Baseline Recipe for VoicePrivacy Challenge 2022: anonymization systems and evaluation software

TurkicASR

A multilingual ASR model that can recognize ten Turkic languages—Azerbaijani, Bashkir, Chuvash, Kazakh, Kyrgyz, Sakha, Tatar, Turkish, Uyghur, and Uzbek.

Language:PythonLicense:CC-BY-4.0Stargazers:54Issues:6Issues:2

bk_clustering

Burj Khalifa Clustering method

Language:Jupyter NotebookStargazers:52Issues:6Issues:6

speaker-anonymization

Speaker anonymization pipeline for hiding the identity of the speaker of a recording by changing the voice in it.

Language:ShellLicense:GPL-3.0Stargazers:49Issues:5Issues:4

VoicePAT

VoicePAT is a modular and efficient toolkit for voice privacy research, with main focus on speaker anonymization.

Language:ShellLicense:Apache-2.0Stargazers:46Issues:5Issues:5

llama

Inference code for LLaMA 2 models

Language:Jupyter NotebookLicense:GPL-3.0Stargazers:30Issues:5Issues:0

speechGLUE

SpeechGLUE is a speech version of the GLUE benchmark, driven by text-to-speech.

Language:PythonLicense:NOASSERTIONStargazers:13Issues:2Issues:0

mt-bigscience

Evaluation results for Machine Translation within the BigScience project

Language:Jupyter NotebookStargazers:11Issues:2Issues:0

zamia-speech

Open tools and data for cloudless automatic speech recognition

Language:PythonLicense:LGPL-3.0Stargazers:10Issues:7Issues:0