RohanTondulkar

Rohan Tondulkar's starred repositories

openrouter-runner

Inference engine powering open source models on OpenRouter

Language:PythonMIT54700

rank_llm

RankLLM is a Python toolkit for reproducible information retrieval research using rerankers, with a focus on listwise reranking.

Language:PythonApache-2.032000

openchat

OpenChat: Advancing Open-source Language Models with Imperfect Data

Language:PythonApache-2.0523700

HierSpeechpp

The official implementation of HierSpeech++

Language:PythonMIT117500

StyleTTS2

StyleTTS 2: Towards Human-Level Text-to-Speech through Style Diffusion and Adversarial Training with Large Speech Language Models

Language:PythonMIT485100

marker

Convert PDF to markdown quickly with high accuracy

Language:PythonGPL-3.01704000

insanely-fast-whisper

Language:Jupyter NotebookApache-2.0758600

vectorflow

VectorFlow is a high volume vector embedding pipeline that ingests raw data, transforms it into vectors and writes it to a vector DB of your choice.

Language:PythonApache-2.067000

awesome-foundation-and-multimodal-models

👁️ + 💬 + 🎧 = 🤖 Curated list of top foundation and multimodal models! [Paper + Code + Examples + Tutorials]

Language:Python56800

intel-extension-for-transformers

⚡ Build your chatbot within minutes on your favorite device; offer SOTA compression techniques for LLMs; run LLMs efficiently on Intel Platforms⚡

Language:PythonApache-2.0212900

TTS

🐸💬 - a deep learning toolkit for Text-to-Speech, battle-tested in research and production

Language:PythonMPL-2.03466800

LibreTranslate

Free and Open Source Machine Translation API. Self-hosted, offline capable and easy to setup.

Language:PythonAGPL-3.0917800

Video-LLaVA

【EMNLP 2024🔥】Video-LLaVA: Learning United Visual Representation by Alignment Before Projection

Language:PythonApache-2.0290900

Real-Time-Voice-Cloning

Clone a voice in 5 seconds to generate arbitrary speech in real-time

Language:PythonNOASSERTION5242500

bark-with-voice-clone

🔊 Text-prompted Generative Audio Model - With the ability to clone voices

Language:Jupyter NotebookNOASSERTION310800

CycleGAN-VC2

Voice Conversion by CycleGAN (语音克隆/语音转换): CycleGAN-VC2

Language:PythonMIT52700

Easy-to-use Speech Toolkit including Self-Supervised Learning model, SOTA/Streaming ASR with punctuation, Streaming TTS with text frontend, Speaker Verification System, End-to-End Speech Translation and Keyword Spotting. Won NAACL2022 Best Demo Award.

Language:PythonApache-2.01102100

gpt-researcher

LLM based autonomous agent that conducts in-depth web research on any given topic

Language:PythonApache-2.01449800

gorilla

Gorilla: Training and Evaluating LLMs for Function Calls (Tool Calls)

Language:PythonApache-2.01135100

filco

[Preprint] Learning to Filter Context for Retrieval-Augmented Generaton

Language:PythonCC-BY-SA-4.018300

insanely-fast-whisper

Incredibly fast Whisper-large-v3

Language:Jupyter NotebookApache-2.0184200

axolotl

Go ahead and axolotl questions

Language:PythonApache-2.0773200

Rerender_A_Video

[SIGGRAPH Asia 2023] Rerender A Video: Zero-Shot Text-Guided Video-to-Video Translation

Language:Jupyter NotebookNOASSERTION295200

deeplake

Database for AI. Store Vectors, Images, Texts, Videos, etc. Use with LLMs/LangChain. Store, query, version, & visualize any AI data. Stream data in real-time to PyTorch/TensorFlow. https://activeloop.ai

Language:PythonMPL-2.0809400

ModuleFormer

ModuleFormer is a MoE-based architecture that includes two different types of experts: stick-breaking attention heads and feedforward experts. We released a collection of ModuleFormer-based Language Models (MoLM) ranging in scale from 4 billion to 8 billion parameters.

Language:PythonApache-2.021700

RohanTondulkar

Rohan Tondulkar's starred repositories

openrouter-runner

optimum-nvidia

rank_llm

openchat

HierSpeechpp

StyleTTS2

marker

insanely-fast-whisper

vectorflow

awesome-foundation-and-multimodal-models

intel-extension-for-transformers

TTS

LibreTranslate

Video-LLaVA

Real-Time-Voice-Cloning

bark-with-voice-clone

CycleGAN-VC2

PaddleSpeech

gpt-researcher

gorilla

filco

insanely-fast-whisper

axolotl

Rerender_A_Video

deeplake

ModuleFormer

ml4a

MiniGPT-4

Wav2Lip

LLMLingua