RamanHacks

followers

following

stars

IIT Delhi

Abhigyan Raman's starred repositories

build-your-own-x

Master programming by recreating your favorite technologies from scratch.

Language:Markdown303123 5414 680

open-webui

User-friendly WebUI for AI (Formerly Ollama WebUI)

Language:SvelteMIT41303 203 2286

llm-course

Course to get into Large Language Models (LLMs) with roadmaps and Colab notebooks.

Language:Jupyter NotebookApache-2.037749 396 67

dspy

DSPy: The framework for programming—not prompting—foundation models

Language:PythonMIT17478 143 745

Awesome-Diffusion-Models

A collection of resources and papers on Diffusion Models

Language:HTMLMIT10839 268 47

seamless_communication

Foundational Models for State-of-the-Art Speech and Text Translation

Language:Jupyter NotebookNOASSERTION10807 140 350

python-mastery

Advanced Python Mastery (course by @dabeaz)

Language:PythonCC-BY-SA-4.010664 82 36

insanely-fast-whisper

Language:Jupyter NotebookApache-2.07516 65 189

EmotiVoice

EmotiVoice 😊: a Multi-Voice and Prompt-Controlled TTS Engine

Language:PythonApache-2.07294 63 150

streaming-llm

[ICLR 2024] Efficient Streaming Language Models with Attention Sinks

Language:PythonMIT6589 65 80

silero-models

Silero Models: pre-trained speech-to-text, text-to-speech and text-enhancement models made embarrassingly simple

Language:Jupyter NotebookNOASSERTION4900 84 129

Pearl

A Production-ready Reinforcement Learning AI Agent Library brought by the Applied Reinforcement Learning team at Meta.

Language:Jupyter NotebookMIT2598 33 57

LLMDataHub

A quick guide (especially) for trending instruction finetuning datasets

RealtimeTTS

Converts text to speech in realtime

Language:Python1793 22 98

ParallelWaveGAN

Unofficial Parallel WaveGAN (+ MelGAN & Multi-band MelGAN & HiFi-GAN & StyleMelGAN) with Pytorch

Language:Jupyter NotebookMIT1544 45 255

AVeryComfyNerd

ComfyUI related stuff and things

HierSpeechpp

The official implementation of HierSpeech++

Language:PythonMIT1171 56 52

speech-synthesis-paper

List of speech synthesis papers.

vocos

Vocos: Closing the gap between time-domain and Fourier-based neural vocoders for high-quality audio synthesis

Language:PythonMIT771 33 46

textbook_quality

Generate textbook-quality synthetic LLM pretraining data

Language:PythonMIT483 8 6

vits2

VITS2: Improving Quality and Efficiency of Single-Stage Text-to-Speech with Adversarial Learning and Architecture Design

Language:Jupyter NotebookMIT473 12 15

wetts

Production First and Production Ready End-to-End Text-to-Speech Toolkit

Language:PythonApache-2.0368 13 54

deep-image-matching

Multiview matching with deep-learning and hand-crafted local features for COLMAP and other SfM software. Supports high-resolution formats and images with rotations. Both CLI and GUI are supported.

Language:PythonBSD-3-Clause338 12 39

VoiceFlow-TTS

[ICASSP 2024] This is the official code for "VoiceFlow: Efficient Text-to-Speech with Rectified Flow Matching"

Language:Python301 15 16

xtts-streaming-server

Language:PythonMPL-2.0281 10 14

gecko

Gecko - A Tool for Effective Annotation of Human Conversations

Language:JavaScriptBSD-3-Clause274 16 30

guidelines

C++ Default Guidelines

MIT140 6 8

PromptingWhisper

Promting Whisper for Audio-Visual Speech Recognition, Code-Switched Speech Recognition, and Zero-Shot Speech Translation

Language:Python132 4 8

gcs-fuse-csi-driver

The Google Cloud Storage FUSE Container Storage Interface (CSI) Plugin.

Language:GoApache-2.0115 18 82

redis-feast-gcp

A demo of Redis Enterprise as the Online Feature Store deployed on GCP with Feast and NVIDIA Triton Inference Server.

Language:Jupyter NotebookMIT15 5 13