Abhigyan Raman (RamanHacks)

RamanHacks

Geek Repo

Company:IIT Delhi

Github PK Tool:Github PK Tool

Abhigyan Raman's starred repositories

uWebSockets

Simple, secure & standards compliant web server for the most demanding of applications

Language:C++License:Apache-2.0Stargazers:17171Issues:406Issues:504

python-mastery

Advanced Python Mastery (course by @dabeaz)

Language:PythonLicense:CC-BY-SA-4.0Stargazers:10624Issues:81Issues:36

manticoresearch

Easy to use open source fast database for search | Good alternative to Elasticsearch now | Drop-in replacement for E in the ELK soon

Language:C++License:GPL-3.0Stargazers:8809Issues:108Issues:1746

StyleTTS2

StyleTTS 2: Towards Human-Level Text-to-Speech through Style Diffusion and Adversarial Training with Large Speech Language Models

Language:PythonLicense:MITStargazers:4616Issues:80Issues:187

Amphion

Amphion (/æmˈfaɪən/) is a toolkit for Audio, Music, and Speech Generation. Its purpose is to support reproducible research and help junior researchers and engineers get started in the field of audio, music, and speech generation research and development.

Language:PythonLicense:MITStargazers:4408Issues:58Issues:150

MeloTTS

High-quality multi-lingual text-to-speech library by MyShell.ai. Support English, Spanish, French, Chinese, Japanese and Korean.

Language:PythonLicense:MITStargazers:4304Issues:39Issues:152

Pearl

A Production-ready Reinforcement Learning AI Agent Library brought by the Applied Reinforcement Learning team at Meta.

Language:Jupyter NotebookLicense:MITStargazers:2563Issues:32Issues:56

IMS-Toucan

Multilingual and Controllable Text-to-Speech Toolkit of the Speech and Language Technologies Group at the University of Stuttgart.

Language:PythonLicense:Apache-2.0Stargazers:1348Issues:20Issues:156

ffmpeg-normalize

Audio Normalization for Python/ffmpeg

Language:PythonLicense:MITStargazers:1232Issues:28Issues:208

AVeryComfyNerd

ComfyUI related stuff and things

License:MITStargazers:1176Issues:41Issues:0

lhotse

Tools for handling speech data in machine learning projects.

Language:PythonLicense:Apache-2.0Stargazers:921Issues:44Issues:407

string2string

String-to-String Algorithms for Natural Language Processing

Language:Jupyter NotebookLicense:MITStargazers:522Issues:9Issues:4

CMGAN

Conformer-based Metric GAN for speech enhancement

Language:PythonLicense:MITStargazers:295Issues:9Issues:45

gecko

Gecko - A Tool for Effective Annotation of Human Conversations

Language:JavaScriptLicense:BSD-3-ClauseStargazers:272Issues:16Issues:30

nanodl

A Jax-based library for designing and training transformer models from scratch.

Language:PythonLicense:MITStargazers:267Issues:9Issues:9
Language:PythonLicense:CC-BY-4.0Stargazers:240Issues:11Issues:18

speech_course

YSDA course in Speech Processing.

Language:Jupyter NotebookLicense:MITStargazers:188Issues:23Issues:3

speech-dataset-generator

🔊 Create labeled datasets, enhance audio quality, identify speakers, support diverse dataset types. 🎧👥📊 Advanced audio processing.

Language:PythonLicense:MITStargazers:182Issues:14Issues:10

speechlib

speechlib is a library that can do speaker diarization, transcription and speaker recognition on an audio file to create transcripts with actual speaker names

Language:PythonLicense:MITStargazers:123Issues:3Issues:13
Language:PythonLicense:Apache-2.0Stargazers:73Issues:5Issues:3

lookwhostalking

Look Who’s Talking: Active Speaker Detection in the Wild

Language:PythonLicense:MITStargazers:71Issues:10Issues:7
Language:PythonLicense:GPL-3.0Stargazers:60Issues:3Issues:2

jenny-tts-dataset

A high-quality, varied ~30hr voice dataset suitable for training a TTS model

audio-degradation-toolbox

easy-to-use implementation of the ISMIR 2013 Audio Degradation Toolbox

Language:PythonLicense:GPL-2.0Stargazers:46Issues:2Issues:8

redis-feast-gcp

A demo of Redis Enterprise as the Online Feature Store deployed on GCP with Feast and NVIDIA Triton Inference Server.

Language:Jupyter NotebookLicense:MITStargazers:15Issues:5Issues:13

Triton-ASR-Client

ASR client for Triton ASR Service

Language:PythonLicense:BSD-3-ClauseStargazers:15Issues:2Issues:4
Language:Jupyter NotebookStargazers:8Issues:2Issues:0

uwebsockets

highly optimized C++ websocket server

Language:JavaScriptStargazers:7Issues:1Issues:0

StyleTTS2

StyleTTS 2: Towards Human-Level Text-to-Speech through Style Diffusion and Adversarial Training with Large Speech Language Models

Language:Jupyter NotebookLicense:MITStargazers:3Issues:2Issues:0