Abhigyan Raman (RamanHacks)

RamanHacks

Geek Repo

Company:IIT Delhi

Github PK Tool:Github PK Tool

Abhigyan Raman's starred repositories

build-your-own-x

Master programming by recreating your favorite technologies from scratch.

open-webui

User-friendly WebUI for AI (Formerly Ollama WebUI)

Language:SvelteLicense:MITStargazers:41303Issues:203Issues:2286

llm-course

Course to get into Large Language Models (LLMs) with roadmaps and Colab notebooks.

Language:Jupyter NotebookLicense:Apache-2.0Stargazers:37749Issues:396Issues:67

dspy

DSPy: The framework for programming—not prompting—foundation models

Language:PythonLicense:MITStargazers:17478Issues:143Issues:745

Awesome-Diffusion-Models

A collection of resources and papers on Diffusion Models

Language:HTMLLicense:MITStargazers:10839Issues:268Issues:47

seamless_communication

Foundational Models for State-of-the-Art Speech and Text Translation

Language:Jupyter NotebookLicense:NOASSERTIONStargazers:10807Issues:140Issues:350

python-mastery

Advanced Python Mastery (course by @dabeaz)

Language:PythonLicense:CC-BY-SA-4.0Stargazers:10664Issues:82Issues:36
Language:Jupyter NotebookLicense:Apache-2.0Stargazers:7516Issues:65Issues:189

EmotiVoice

EmotiVoice 😊: a Multi-Voice and Prompt-Controlled TTS Engine

Language:PythonLicense:Apache-2.0Stargazers:7294Issues:63Issues:150

streaming-llm

[ICLR 2024] Efficient Streaming Language Models with Attention Sinks

Language:PythonLicense:MITStargazers:6589Issues:65Issues:80

silero-models

Silero Models: pre-trained speech-to-text, text-to-speech and text-enhancement models made embarrassingly simple

Language:Jupyter NotebookLicense:NOASSERTIONStargazers:4900Issues:84Issues:129

Pearl

A Production-ready Reinforcement Learning AI Agent Library brought by the Applied Reinforcement Learning team at Meta.

Language:Jupyter NotebookLicense:MITStargazers:2598Issues:33Issues:57

LLMDataHub

A quick guide (especially) for trending instruction finetuning datasets

RealtimeTTS

Converts text to speech in realtime

ParallelWaveGAN

Unofficial Parallel WaveGAN (+ MelGAN & Multi-band MelGAN & HiFi-GAN & StyleMelGAN) with Pytorch

Language:Jupyter NotebookLicense:MITStargazers:1544Issues:45Issues:255

AVeryComfyNerd

ComfyUI related stuff and things

License:MITStargazers:1195Issues:41Issues:0

HierSpeechpp

The official implementation of HierSpeech++

Language:PythonLicense:MITStargazers:1171Issues:56Issues:52

speech-synthesis-paper

List of speech synthesis papers.

vocos

Vocos: Closing the gap between time-domain and Fourier-based neural vocoders for high-quality audio synthesis

Language:PythonLicense:MITStargazers:771Issues:33Issues:46

textbook_quality

Generate textbook-quality synthetic LLM pretraining data

Language:PythonLicense:MITStargazers:483Issues:8Issues:6

vits2

VITS2: Improving Quality and Efficiency of Single-Stage Text-to-Speech with Adversarial Learning and Architecture Design

Language:Jupyter NotebookLicense:MITStargazers:473Issues:12Issues:15

wetts

Production First and Production Ready End-to-End Text-to-Speech Toolkit

Language:PythonLicense:Apache-2.0Stargazers:368Issues:13Issues:54

deep-image-matching

Multiview matching with deep-learning and hand-crafted local features for COLMAP and other SfM software. Supports high-resolution formats and images with rotations. Both CLI and GUI are supported.

Language:PythonLicense:BSD-3-ClauseStargazers:338Issues:12Issues:39

VoiceFlow-TTS

[ICASSP 2024] This is the official code for "VoiceFlow: Efficient Text-to-Speech with Rectified Flow Matching"

gecko

Gecko - A Tool for Effective Annotation of Human Conversations

Language:JavaScriptLicense:BSD-3-ClauseStargazers:274Issues:16Issues:30

guidelines

C++ Default Guidelines

PromptingWhisper

Promting Whisper for Audio-Visual Speech Recognition, Code-Switched Speech Recognition, and Zero-Shot Speech Translation

gcs-fuse-csi-driver

The Google Cloud Storage FUSE Container Storage Interface (CSI) Plugin.

Language:GoLicense:Apache-2.0Stargazers:115Issues:18Issues:82

redis-feast-gcp

A demo of Redis Enterprise as the Online Feature Store deployed on GCP with Feast and NVIDIA Triton Inference Server.

Language:Jupyter NotebookLicense:MITStargazers:15Issues:5Issues:13