Bibek Chaudhary (imbibekk)

imbibekk

Geek Repo

Location:Seoul

Github PK Tool:Github PK Tool

Bibek Chaudhary's starred repositories

python-patterns

A collection of design patterns/idioms in Python

text-generation-webui

A Gradio web UI for Large Language Models.

Language:PythonLicense:AGPL-3.0Stargazers:39157Issues:326Issues:3590

FastChat

An open platform for training, serving, and evaluating large language models. Release repo for Vicuna and Chatbot Arena.

Language:PythonLicense:Apache-2.0Stargazers:36245Issues:348Issues:1752

TTS

🐸💬 - a deep learning toolkit for Text-to-Speech, battle-tested in research and production

Language:PythonLicense:MPL-2.0Stargazers:32945Issues:277Issues:1089

generative-models

Generative Models by Stability AI

Language:PythonLicense:MITStargazers:23876Issues:253Issues:292

seamless_communication

Foundational Models for State-of-the-Art Speech and Text Translation

Language:Jupyter NotebookLicense:NOASSERTIONStargazers:10688Issues:140Issues:343

ml-engineering

Machine Learning Engineering Open Book

Language:PythonLicense:CC-BY-SA-4.0Stargazers:10524Issues:109Issues:20

mistral-src

Reference implementation of Mistral AI 7B v0.1 model.

Language:Jupyter NotebookLicense:Apache-2.0Stargazers:8772Issues:116Issues:115

nougat

Implementation of Nougat Neural Optical Understanding for Academic Documents

Language:PythonLicense:MITStargazers:8652Issues:64Issues:204

dinov2

PyTorch code and models for the DINOv2 self-supervised learning method.

Language:Jupyter NotebookLicense:Apache-2.0Stargazers:8651Issues:96Issues:384

so-vits-svc-fork

so-vits-svc fork with realtime support, improved interface and more features.

Language:PythonLicense:NOASSERTIONStargazers:8642Issues:67Issues:359

Yi

A series of large language models trained from scratch by developers @01-ai

Language:Jupyter NotebookLicense:Apache-2.0Stargazers:7564Issues:108Issues:291

StyleTTS2

StyleTTS 2: Towards Human-Level Text-to-Speech through Style Diffusion and Adversarial Training with Large Speech Language Models

Language:PythonLicense:MITStargazers:4609Issues:79Issues:187

Amphion

Amphion (/æmˈfaɪən/) is a toolkit for Audio, Music, and Speech Generation. Its purpose is to support reproducible research and help junior researchers and engineers get started in the field of audio, music, and speech generation research and development.

Language:PythonLicense:MITStargazers:4400Issues:58Issues:149

Awesome-GPTs

Curated list of awesome GPTs 👍.

vector-quantize-pytorch

Vector (and Scalar) Quantization, in Pytorch

Language:PythonLicense:MITStargazers:2335Issues:31Issues:112

usearch

Fast Open-Source Search & Clustering engine × for Vectors & 🔜 Strings × in C++, C, Python, JavaScript, Rust, Java, Objective-C, Swift, C#, GoLang, and Wolfram 🔍

Language:C++License:Apache-2.0Stargazers:2065Issues:27Issues:138

AudioSep

Official implementation of "Separate Anything You Describe"

Language:PythonLicense:MITStargazers:1547Issues:64Issues:21

segment-anything-fast

A batched offline inference oriented version of segment-anything

Language:PythonLicense:Apache-2.0Stargazers:1171Issues:10Issues:42
Language:PythonStargazers:1087Issues:0Issues:0

descript-audio-codec

State-of-the-art audio codec with 90x compression factor. Supports 44.1kHz, 24kHz, and 16kHz mono/stereo audio.

Language:PythonLicense:MITStargazers:1084Issues:26Issues:72

LookaheadDecoding

[ICML 2024] Break the Sequential Dependency of LLM Inference Using Lookahead Decoding

Language:PythonLicense:Apache-2.0Stargazers:1075Issues:11Issues:55

HeyGenClone

A simple and open-source analogue of the HeyGen system

parseq

Scene Text Recognition with Permuted Autoregressive Sequence Models (ECCV 2022)

Language:PythonLicense:Apache-2.0Stargazers:544Issues:13Issues:139

ChatGPT-in-Slack

Swift demonstration of how to build a Slack app that enables end-users to interact with a ChatGPT bot

Language:PythonLicense:MITStargazers:436Issues:15Issues:44

mustango

Mustango: Toward Controllable Text-to-Music Generation

Language:PythonLicense:MITStargazers:317Issues:16Issues:12

XPhoneBERT

XPhoneBERT: A Pre-trained Multilingual Model for Phoneme Representations for Text-to-Speech (INTERSPEECH 2023)

Language:PythonLicense:MITStargazers:292Issues:10Issues:21

ai-audio-datasets-list

This is a list of datasets consisting of speech, music, and sound effects, which can provide training data for Generative AI, AIGC, AI model training, intelligent audio tool development, and audio applications. It is mainly used for speech recognition, speech synthesis, singing voice synthesis, music information retrieval, music generation, etc.

AQUA-Tk

AQUA-Tk = Audio QUality Assessment-Toolkit. (In development)

Language:PythonLicense:GPL-3.0Stargazers:93Issues:3Issues:3

sensorium

NeurIPS | 1st place solution for Sensorium 2023 Competition

Language:PythonLicense:MITStargazers:22Issues:2Issues:1