SetoKaiba's starred repositories

text-generation-webui

A Gradio web UI for Large Language Models. Supports transformers, GPTQ, AWQ, EXL2, llama.cpp (GGUF), Llama models.

Language:PythonLicense:AGPL-3.0Stargazers:37926Issues:328Issues:3477

MiniGPT-4

Open-sourced codes for MiniGPT-4 and MiniGPT-v2 (https://minigpt-4.github.io, https://minigpt-v2.github.io/)

Language:PythonLicense:BSD-3-ClauseStargazers:25103Issues:219Issues:450

ultimatevocalremovergui

GUI for a Vocal Remover that uses Deep Neural Networks.

Language:PythonLicense:MITStargazers:16295Issues:154Issues:1150

Awesome-Multimodal-Large-Language-Models

:sparkles::sparkles:Latest Advances on Multimodal Large Language Models

qlora

QLoRA: Efficient Finetuning of Quantized LLMs

Language:Jupyter NotebookLicense:MITStargazers:9648Issues:85Issues:246

LAVIS

LAVIS - A One-stop Library for Language-Vision Intelligence

Language:Jupyter NotebookLicense:BSD-3-ClauseStargazers:9111Issues:95Issues:623

text-generation-inference

Large Language Model Text Generation Inference

Language:PythonLicense:Apache-2.0Stargazers:8277Issues:100Issues:1157

chatgpt_system_prompt

A collection of GPT system prompts and various prompt injection/leaking knowledge.

Language:HTMLLicense:MITStargazers:7620Issues:83Issues:9

VALL-E-X

An open source implementation of Microsoft's VALL-E X zero-shot TTS model. Demo is available in https://plachtaa.github.io

Language:PythonLicense:MITStargazers:7380Issues:83Issues:148

Bert-VITS2

vits2 backbone with multilingual-bert

Language:PythonLicense:AGPL-3.0Stargazers:7289Issues:48Issues:0

chat-ui

Open source codebase powering the HuggingChat app

Language:TypeScriptLicense:Apache-2.0Stargazers:6686Issues:79Issues:489

vits

VITS: Conditional Variational Autoencoder with Adversarial Learning for End-to-End Text-to-Speech

Language:PythonLicense:MITStargazers:6437Issues:54Issues:199

pyannote-audio

Neural building blocks for speaker diarization: speech activity detection, speaker change detection, overlapped speech detection, speaker embedding

Language:Jupyter NotebookLicense:MITStargazers:5414Issues:67Issues:969

VITS-fast-fine-tuning

This repo is a pipeline of VITS finetuning for fast speaker adaptation TTS, and many-to-many voice conversion

Language:PythonLicense:Apache-2.0Stargazers:4621Issues:38Issues:561

motion-diffusion-model

The official PyTorch implementation of the paper "Human Motion Diffusion Model"

Language:PythonLicense:MITStargazers:2921Issues:68Issues:195

SMPLer-X

Official Code for "SMPLer-X: Scaling Up Expressive Human Pose and Shape Estimation"

Language:PythonLicense:NOASSERTIONStargazers:897Issues:21Issues:61

sliders

Concept Sliders for Precise Control of Diffusion Models

Language:Jupyter NotebookLicense:MITStargazers:771Issues:12Issues:89

LucidDreamer

Official implementation of "LucidDreamer: Towards High-Fidelity Text-to-3D Generation via Interval Score Matching"

Language:PythonLicense:MITStargazers:700Issues:23Issues:33
Language:Jupyter NotebookLicense:MITStargazers:427Issues:5Issues:5

MB-iSTFT-VITS

Lightweight and High-Fidelity End-to-End Text-to-Speech with Multi-Band Generation and Inverse Short-Time Fourier Transform

Language:PythonLicense:Apache-2.0Stargazers:399Issues:17Issues:25

UnityRuntimeNodeEditor

Unity runtime node editor using with Unity UI.

Language:C#License:MITStargazers:377Issues:11Issues:13

Awesome-Multimodal-LLM

Research Trends in LLM-guided Multimodal Learning.

xrmocap

OpenXRLab Multi-view Motion Capture Toolbox and Benchmark

Language:PythonLicense:NOASSERTIONStargazers:320Issues:8Issues:70

jle

'Jet-Lagged Engine' is a work-in-progress C++/Lua game engine supporting Windows, Linux, Mac and browsers.

Language:C++License:GPL-3.0Stargazers:238Issues:11Issues:3

Consistent4D

[ICLR 2024] Official Implementation of Consistent4D: Consistent 360° Dynamic Object Generation from Monocular Video

Language:PythonLicense:Apache-2.0Stargazers:220Issues:9Issues:9

URP-ScreenSpaceCavity

Blender Cavity Effect for Unity

Language:C#License:BSD-3-ClauseStargazers:170Issues:10Issues:8

ai-town-rwkv-proxy

Run a large AI town, locally, via RWKV !

MB-iSTFT-VITS2

Application of MB-iSTFT-VITS components to vits2_pytorch

Language:PythonLicense:MITStargazers:99Issues:5Issues:16

simulcast-playground

single-page simulcast tests