Vlad Bataev (vladbataev)

vladbataev

Geek Repo

Company:@yandex

Location:Istanbul

Github PK Tool:Github PK Tool

Vlad Bataev's starred repositories

bark

🔊 Text-Prompted Generative Audio Model

Language:Jupyter NotebookLicense:MITStargazers:33932Issues:316Issues:423

vllm

A high-throughput and memory-efficient inference and serving engine for LLMs

Language:PythonLicense:Apache-2.0Stargazers:23536Issues:217Issues:3588

spotify-downloader

Download your Spotify playlists and songs along with album art and metadata (from YouTube if a match is found).

Language:PythonLicense:MITStargazers:15745Issues:189Issues:1448

seamless_communication

Foundational Models for State-of-the-Art Speech and Text Translation

Language:Jupyter NotebookLicense:NOASSERTIONStargazers:10581Issues:141Issues:338

ml-engineering

Machine Learning Engineering Open Book

Language:PythonLicense:CC-BY-SA-4.0Stargazers:10290Issues:107Issues:18

lyra

A Very Low-Bitrate Codec for Speech Compression

Language:C++License:Apache-2.0Stargazers:3803Issues:113Issues:125

adapters

A Unified Library for Parameter-Efficient and Modular Transfer Learning

Language:Jupyter NotebookLicense:Apache-2.0Stargazers:2480Issues:30Issues:376

wemake-python-styleguide

The strictest and most opinionated python linter ever!

Language:PythonLicense:MITStargazers:2473Issues:31Issues:1089

AudioLDM

AudioLDM: Generate speech, sound effects, music and beyond, with text.

Language:PythonLicense:NOASSERTIONStargazers:2346Issues:42Issues:102

audiolm-pytorch

Implementation of AudioLM, a SOTA Language Modeling Approach to Audio Generation out of Google Research, in Pytorch

Language:PythonLicense:MITStargazers:2331Issues:60Issues:167

DeepFilterNet

Noise supression using deep filtering

Language:PythonLicense:NOASSERTIONStargazers:2210Issues:32Issues:268

AudioLDM2

Text-to-Audio/Music Generation

Language:PythonLicense:NOASSERTIONStargazers:2164Issues:44Issues:66

audio-ai-timeline

A timeline of the latest AI models for audio generation, starting in 2023!

eng-handbook

A developer's guide to management: an open-sourced handbook for leading software engineering teams.

ftp

FTP client package for Go

CLAP

Contrastive Language-Audio Pretraining

Language:PythonLicense:CC0-1.0Stargazers:1264Issues:29Issues:84

versatile_audio_super_resolution

Versatile audio super resolution (any -> 48kHz) with AudioSR.

Language:PythonLicense:MITStargazers:1007Issues:24Issues:52

YaFSDP

YaFSDP: Yet another Fully Sharded Data Parallel

Language:PythonLicense:Apache-2.0Stargazers:793Issues:14Issues:3

pytriton

PyTriton is a Flask/FastAPI-like interface that simplifies Triton's deployment in Python environments.

Language:PythonLicense:Apache-2.0Stargazers:692Issues:17Issues:72

hidet

An open-source efficient deep learning framework/compiler, written in python.

Language:PythonLicense:Apache-2.0Stargazers:635Issues:17Issues:81

audio-dataset

Audio Dataset for training CLAP and other models

speech-trident

Awesome speech/audio LLMs, representation learning, and codec models

COMET

A Neural Framework for MT Evaluation

Language:PythonLicense:Apache-2.0Stargazers:453Issues:17Issues:161

knn-vc

Voice Conversion With Just Nearest Neighbors

Language:PythonLicense:NOASSERTIONStargazers:431Issues:14Issues:35

WaveDiff

Official Pytorch Implementation of the paper: Wavelet Diffusion Models are fast and scalable Image Generators (CVPR'23)

Language:PythonLicense:AGPL-3.0Stargazers:356Issues:12Issues:13

cookbook

Deep learning for dummies. All the practical details and useful utilities that go into working with real models.

Language:PythonLicense:Apache-2.0Stargazers:215Issues:8Issues:11

DiscreteSpeechMetrics

Reference-aware automatic speech evaluation toolkit

Language:PythonLicense:MITStargazers:80Issues:4Issues:2

podcasts-dataset

dataset of podcasts and episodes

Language:PythonStargazers:13Issues:3Issues:0

sd-benchmarks

Stable Diffusion inference benchmarks

Language:PythonStargazers:10Issues:4Issues:0