yigityu's starred repositories

vllm

A high-throughput and memory-efficient inference and serving engine for LLMs

Language:PythonLicense:Apache-2.0Stargazers:21685Issues:197Issues:3196

audiocraft

Audiocraft is a library for audio processing and generation with deep learning. It features the state-of-the-art EnCodec audio compressor / tokenizer, along with MusicGen, a simple and controllable music generation LM with textual and melodic conditioning.

Language:PythonLicense:MITStargazers:20117Issues:193Issues:363

demucs

Code for the paper Hybrid Spectrogram and Waveform Source Separation

Language:PythonLicense:MITStargazers:7863Issues:151Issues:528

VALL-E-X

An open source implementation of Microsoft's VALL-E X zero-shot TTS model. Demo is available in https://plachtaa.github.io

Language:PythonLicense:MITStargazers:7396Issues:83Issues:148

hybridclr

HybridCLR是一个特性完整、零成本、高性能、低内存的Unity全平台原生c#热更新解决方案。 HybridCLR is a fully featured, zero-cost, high-performance, low-memory solution for Unity's all-platform native c# hotupdate.

Language:C++License:MITStargazers:6026Issues:83Issues:122

lmdeploy

LMDeploy is a toolkit for compressing, deploying, and serving LLMs.

Language:PythonLicense:Apache-2.0Stargazers:3084Issues:29Issues:958

bark-with-voice-clone

🔊 Text-prompted Generative Audio Model - With the ability to clone voices

Language:Jupyter NotebookLicense:NOASSERTIONStargazers:2935Issues:47Issues:77

DeepFilterNet

Noise supression using deep filtering

Language:PythonLicense:NOASSERTIONStargazers:2095Issues:30Issues:259

StableDiffusion-CheatSheet

A list of StableDiffusion styles and some notes for offline use. Pure HTML, CSS and a bit of JS.

Language:JavaScriptLicense:MITStargazers:1735Issues:38Issues:0

ParallelWaveGAN

Unofficial Parallel WaveGAN (+ MelGAN & Multi-band MelGAN & HiFi-GAN & StyleMelGAN) with Pytorch

Language:Jupyter NotebookLicense:MITStargazers:1510Issues:46Issues:251

soundstorm-pytorch

Implementation of SoundStorm, Efficient Parallel Audio Generation from Google Deepmind, in Pytorch

Language:PythonLicense:MITStargazers:1142Issues:51Issues:15

Prompt-Free-Diffusion

Prompt-Free Diffusion: Taking "Text" out of Text-to-Image Diffusion Models, arxiv 2023 / CVPR 2024

Language:PythonLicense:MITStargazers:713Issues:12Issues:25

vocos

Vocos: Closing the gap between time-domain and Fourier-based neural vocoders for high-quality audio synthesis

Language:PythonLicense:MITStargazers:679Issues:33Issues:44

bark.cpp

Suno AI's Bark model in C/C++ for fast text-to-speech

Language:C++License:MITStargazers:618Issues:34Issues:74

bark-voice-cloning-HuBERT-quantizer

The code for the bark-voicecloning model. Training and inference.

Language:PythonLicense:MITStargazers:608Issues:17Issues:42

StyleDrop-PyTorch

Unoffical implement for [StyleDrop](https://arxiv.org/abs/2306.00983)

Language:PythonLicense:MITStargazers:552Issues:9Issues:25

Prompt-Diffusion

Official PyTorch implementation of the paper "In-Context Learning Unlocked for Diffusion Models"

Language:PythonLicense:Apache-2.0Stargazers:362Issues:7Issues:13

AnimeInbet

Code and data for ICCV23 work "Deep Geometrized Cartoon Line Inbetweening"

SoundStorm

The reproduced code for Google's SoundStorm

Whispering-LLaMA

EMNLP 23 - Integrating Whisper Encoder to LLaMA Decoder for Generative ASR Error Correction

Language:Jupyter NotebookLicense:MITStargazers:204Issues:4Issues:10

aero

This repo contains the official PyTorch implementation of "Audio Super Resolution in the Spectral Domain" (ICASSP 2023)

Language:PythonLicense:MITStargazers:184Issues:6Issues:24

RAVE-Latent-Diffusion

Generate new latent codes for RAVE with Denoising Diffusion models.

Language:PythonLicense:MITStargazers:156Issues:15Issues:3

Barkify

Barkify: an unoffical training implementation of Bark TTS by suno-ai

react-native-unity

React Native package to use Unity as library with RN app

Kandinsky-2-textual-inversion

Kandinsky 2 — multilingual text2image latent diffusion model

Language:Jupyter NotebookLicense:Apache-2.0Stargazers:74Issues:1Issues:0

com.unity.addressables

Addressables extended with the ability to build multiple catalogs, ideal for traditional DLC purposes.

Language:C#License:NOASSERTIONStargazers:72Issues:3Issues:10

react-native-unity2

Embed Unity content in your React Native app!

Language:JavaLicense:MITStargazers:33Issues:3Issues:24

Philia-GUI

An easy to use imageboard scraper.

bark-with-voice-clone

🔊 Text-prompted Generative Audio Model - With the ability to clone voices

Language:Jupyter NotebookLicense:NOASSERTIONStargazers:19Issues:1Issues:0

com.unity.addressables

[Mirrored from UPM, without any changes. Maintained by Needle. Not affiliated with Unity Technologies.] 📦 The Addressable Asset System allows the developer to ask for an asset via its address. Once an asset (e.g. a prefab) is marked "addressable", it generates an address which can be called from anywhere. Wherever the asset resides (local or remote), the system will locate it and its dependencies, then return it.Use Window->Asset Management->Addressables to begin working with the system.Addressables use asynchronous loading to support loading from any location with any collection of dependencies. Whether you have been using direct references, traditional asset bundles, or Resource folders, addressables provide a simpler way to make your game more dynamic. Addressables simultaneously opens up the world of asset bundles while managing all the complexity.For usage samples, see github.com/Unity-Technologies/Addressables-Sample

Language:C#License:NOASSERTIONStargazers:1Issues:0Issues:0