Beast code in Giters

Abhigyan Raman's starred repositories

uWebSockets

Simple, secure & standards compliant web server for the most demanding of applications

Language:C++Apache-2.017171 406 504

python-mastery

Advanced Python Mastery (course by @dabeaz)

Language:PythonCC-BY-SA-4.010624 81 36

manticoresearch

Easy to use open source fast database for search | Good alternative to Elasticsearch now | Drop-in replacement for E in the ELK soon

Language:C++GPL-3.08809 108 1746

StyleTTS2

StyleTTS 2: Towards Human-Level Text-to-Speech through Style Diffusion and Adversarial Training with Large Speech Language Models

Language:PythonMIT4616 80 187

Amphion (/æmˈfaɪən/) is a toolkit for Audio, Music, and Speech Generation. Its purpose is to support reproducible research and help junior researchers and engineers get started in the field of audio, music, and speech generation research and development.

Language:PythonMIT4408 58 150

MeloTTS

High-quality multi-lingual text-to-speech library by MyShell.ai. Support English, Spanish, French, Chinese, Japanese and Korean.

Language:PythonMIT4304 39 152

Pearl

A Production-ready Reinforcement Learning AI Agent Library brought by the Applied Reinforcement Learning team at Meta.

Language:Jupyter NotebookMIT2563 32 56

RAG-Survey

1667 31 15

IMS-Toucan

Multilingual and Controllable Text-to-Speech Toolkit of the Speech and Language Technologies Group at the University of Stuttgart.

Language:PythonApache-2.01348 20 156

ffmpeg-normalize

Audio Normalization for Python/ffmpeg

Language:PythonMIT1232 28 208

AVeryComfyNerd

ComfyUI related stuff and things

MIT1176 410

lhotse

Tools for handling speech data in machine learning projects.

Language:PythonApache-2.0921 44 407

string2string

String-to-String Algorithms for Natural Language Processing

Language:Jupyter NotebookMIT522 9 4

CMGAN

Conformer-based Metric GAN for speech enhancement

Language:PythonMIT295 9 45

gecko

Gecko - A Tool for Effective Annotation of Human Conversations

Language:JavaScriptBSD-3-Clause272 16 30

nanodl

A Jax-based library for designing and training transformer models from scratch.

Language:PythonMIT267 9 9

pheme

Language:PythonCC-BY-4.0240 11 18

speech_course

YSDA course in Speech Processing.

Language:Jupyter NotebookMIT188 23 3

speech-dataset-generator

🔊 Create labeled datasets, enhance audio quality, identify speakers, support diverse dataset types. 🎧👥📊 Advanced audio processing.

Language:PythonMIT182 14 10

speechlib

speechlib is a library that can do speaker diarization, transcription and speaker recognition on an audio file to create transcripts with actual speaker names

Language:PythonMIT123 3 13