Rohan Tondulkar (RohanTondulkar)

RohanTondulkar

Geek Repo

Company:@TypesetIO

Location:Bangalore

Github PK Tool:Github PK Tool

Rohan Tondulkar's starred repositories

openrouter-runner

Inference engine powering open source models on OpenRouter

Language:PythonLicense:MITStargazers:547Issues:0Issues:0
Language:PythonLicense:Apache-2.0Stargazers:889Issues:0Issues:0

rank_llm

RankLLM is a Python toolkit for reproducible information retrieval research using rerankers, with a focus on listwise reranking.

Language:PythonLicense:Apache-2.0Stargazers:320Issues:0Issues:0

openchat

OpenChat: Advancing Open-source Language Models with Imperfect Data

Language:PythonLicense:Apache-2.0Stargazers:5237Issues:0Issues:0

HierSpeechpp

The official implementation of HierSpeech++

Language:PythonLicense:MITStargazers:1175Issues:0Issues:0

StyleTTS2

StyleTTS 2: Towards Human-Level Text-to-Speech through Style Diffusion and Adversarial Training with Large Speech Language Models

Language:PythonLicense:MITStargazers:4851Issues:0Issues:0

marker

Convert PDF to markdown quickly with high accuracy

Language:PythonLicense:GPL-3.0Stargazers:17040Issues:0Issues:0
Language:Jupyter NotebookLicense:Apache-2.0Stargazers:7586Issues:0Issues:0

vectorflow

VectorFlow is a high volume vector embedding pipeline that ingests raw data, transforms it into vectors and writes it to a vector DB of your choice.

Language:PythonLicense:Apache-2.0Stargazers:670Issues:0Issues:0

awesome-foundation-and-multimodal-models

👁️ + 💬 + 🎧 = 🤖 Curated list of top foundation and multimodal models! [Paper + Code + Examples + Tutorials]

Language:PythonStargazers:568Issues:0Issues:0

intel-extension-for-transformers

⚡ Build your chatbot within minutes on your favorite device; offer SOTA compression techniques for LLMs; run LLMs efficiently on Intel Platforms⚡

Language:PythonLicense:Apache-2.0Stargazers:2129Issues:0Issues:0

TTS

🐸💬 - a deep learning toolkit for Text-to-Speech, battle-tested in research and production

Language:PythonLicense:MPL-2.0Stargazers:34668Issues:0Issues:0

LibreTranslate

Free and Open Source Machine Translation API. Self-hosted, offline capable and easy to setup.

Language:PythonLicense:AGPL-3.0Stargazers:9178Issues:0Issues:0

Video-LLaVA

【EMNLP 2024🔥】Video-LLaVA: Learning United Visual Representation by Alignment Before Projection

Language:PythonLicense:Apache-2.0Stargazers:2909Issues:0Issues:0

Real-Time-Voice-Cloning

Clone a voice in 5 seconds to generate arbitrary speech in real-time

Language:PythonLicense:NOASSERTIONStargazers:52425Issues:0Issues:0

bark-with-voice-clone

🔊 Text-prompted Generative Audio Model - With the ability to clone voices

Language:Jupyter NotebookLicense:NOASSERTIONStargazers:3108Issues:0Issues:0

CycleGAN-VC2

Voice Conversion by CycleGAN (语音克隆/语音转换): CycleGAN-VC2

Language:PythonLicense:MITStargazers:527Issues:0Issues:0

PaddleSpeech

Easy-to-use Speech Toolkit including Self-Supervised Learning model, SOTA/Streaming ASR with punctuation, Streaming TTS with text frontend, Speaker Verification System, End-to-End Speech Translation and Keyword Spotting. Won NAACL2022 Best Demo Award.

Language:PythonLicense:Apache-2.0Stargazers:11021Issues:0Issues:0

gpt-researcher

LLM based autonomous agent that conducts in-depth web research on any given topic

Language:PythonLicense:Apache-2.0Stargazers:14498Issues:0Issues:0

gorilla

Gorilla: Training and Evaluating LLMs for Function Calls (Tool Calls)

Language:PythonLicense:Apache-2.0Stargazers:11351Issues:0Issues:0

filco

[Preprint] Learning to Filter Context for Retrieval-Augmented Generaton

Language:PythonLicense:CC-BY-SA-4.0Stargazers:183Issues:0Issues:0

insanely-fast-whisper

Incredibly fast Whisper-large-v3

Language:Jupyter NotebookLicense:Apache-2.0Stargazers:1842Issues:0Issues:0

axolotl

Go ahead and axolotl questions

Language:PythonLicense:Apache-2.0Stargazers:7732Issues:0Issues:0

Rerender_A_Video

[SIGGRAPH Asia 2023] Rerender A Video: Zero-Shot Text-Guided Video-to-Video Translation

Language:Jupyter NotebookLicense:NOASSERTIONStargazers:2952Issues:0Issues:0

deeplake

Database for AI. Store Vectors, Images, Texts, Videos, etc. Use with LLMs/LangChain. Store, query, version, & visualize any AI data. Stream data in real-time to PyTorch/TensorFlow. https://activeloop.ai

Language:PythonLicense:MPL-2.0Stargazers:8094Issues:0Issues:0

ModuleFormer

ModuleFormer is a MoE-based architecture that includes two different types of experts: stick-breaking attention heads and feedforward experts. We released a collection of ModuleFormer-based Language Models (MoLM) ranging in scale from 4 billion to 8 billion parameters.

Language:PythonLicense:Apache-2.0Stargazers:217Issues:0Issues:0

ml4a

A python library and collection of notebooks for making art with machine learning.

Language:PythonLicense:MITStargazers:1580Issues:0Issues:0

MiniGPT-4

Open-sourced codes for MiniGPT-4 and MiniGPT-v2 (https://minigpt-4.github.io, https://minigpt-v2.github.io/)

Language:PythonLicense:BSD-3-ClauseStargazers:25357Issues:0Issues:0

Wav2Lip

This repository contains the codes of "A Lip Sync Expert Is All You Need for Speech to Lip Generation In the Wild", published at ACM Multimedia 2020. For HD commercial model, please try out Sync Labs

Language:PythonStargazers:10474Issues:0Issues:0

LLMLingua

[EMNLP'23, ACL'24] To speed up LLMs' inference and enhance LLM's perceive of key information, compress the prompt and KV-Cache, which achieves up to 20x compression with minimal performance loss.

Language:PythonLicense:MITStargazers:4521Issues:0Issues:0