SetoKaiba's starred repositories
text-generation-webui
A Gradio web UI for Large Language Models. Supports transformers, GPTQ, AWQ, EXL2, llama.cpp (GGUF), Llama models.
ultimatevocalremovergui
GUI for a Vocal Remover that uses Deep Neural Networks.
Awesome-Multimodal-Large-Language-Models
:sparkles::sparkles:Latest Advances on Multimodal Large Language Models
text-generation-inference
Large Language Model Text Generation Inference
chatgpt_system_prompt
A collection of GPT system prompts and various prompt injection/leaking knowledge.
Bert-VITS2
vits2 backbone with multilingual-bert
pyannote-audio
Neural building blocks for speaker diarization: speech activity detection, speaker change detection, overlapped speech detection, speaker embedding
VITS-fast-fine-tuning
This repo is a pipeline of VITS finetuning for fast speaker adaptation TTS, and many-to-many voice conversion
motion-diffusion-model
The official PyTorch implementation of the paper "Human Motion Diffusion Model"
LucidDreamer
Official implementation of "LucidDreamer: Towards High-Fidelity Text-to-3D Generation via Interval Score Matching"
MB-iSTFT-VITS
Lightweight and High-Fidelity End-to-End Text-to-Speech with Multi-Band Generation and Inverse Short-Time Fourier Transform
UnityRuntimeNodeEditor
Unity runtime node editor using with Unity UI.
Awesome-Multimodal-LLM
Research Trends in LLM-guided Multimodal Learning.
Consistent4D
[ICLR 2024] Official Implementation of Consistent4D: Consistent 360° Dynamic Object Generation from Monocular Video
URP-ScreenSpaceCavity
Blender Cavity Effect for Unity
ai-town-rwkv-proxy
Run a large AI town, locally, via RWKV !
MB-iSTFT-VITS2
Application of MB-iSTFT-VITS components to vits2_pytorch
simulcast-playground
single-page simulcast tests