Beast code in Giters

SetoKaiba's starred repositories

text-generation-webui

A Gradio web UI for Large Language Models. Supports transformers, GPTQ, AWQ, EXL2, llama.cpp (GGUF), Llama models.

Language:PythonAGPL-3.037926 328 3477

MiniGPT-4

Open-sourced codes for MiniGPT-4 and MiniGPT-v2 (https://minigpt-4.github.io, https://minigpt-v2.github.io/)

Language:PythonBSD-3-Clause25103 219 450

ultimatevocalremovergui

GUI for a Vocal Remover that uses Deep Neural Networks.

Language:PythonMIT16295 154 1150

Awesome-Multimodal-Large-Language-Models

:sparkles::sparkles:Latest Advances on Multimodal Large Language Models

10167 237 98

qlora

QLoRA: Efficient Finetuning of Quantized LLMs

Language:Jupyter NotebookMIT9648 85 246

LAVIS

LAVIS - A One-stop Library for Language-Vision Intelligence

Language:Jupyter NotebookBSD-3-Clause9111 95 623

text-generation-inference

Large Language Model Text Generation Inference

Language:PythonApache-2.08277 100 1157

chatgpt_system_prompt

A collection of GPT system prompts and various prompt injection/leaking knowledge.

Language:HTMLMIT7620 83 9

VALL-E-X

An open source implementation of Microsoft's VALL-E X zero-shot TTS model. Demo is available in https://plachtaa.github.io

Language:PythonMIT7380 83 148

Bert-VITS2

vits2 backbone with multilingual-bert

Language:PythonAGPL-3.07289 480

chat-ui

Open source codebase powering the HuggingChat app

Language:TypeScriptApache-2.06686 79 489

vits

VITS: Conditional Variational Autoencoder with Adversarial Learning for End-to-End Text-to-Speech

Language:PythonMIT6437 54 199

pyannote-audio

Neural building blocks for speaker diarization: speech activity detection, speaker change detection, overlapped speech detection, speaker embedding

Language:Jupyter NotebookMIT5414 67 969

sub-web

Language:VueMIT4790 42 121

VITS-fast-fine-tuning

This repo is a pipeline of VITS finetuning for fast speaker adaptation TTS, and many-to-many voice conversion

Language:PythonApache-2.04621 38 561

motion-diffusion-model

The official PyTorch implementation of the paper "Human Motion Diffusion Model"

Language:PythonMIT2921 68 195

SMPLer-X

Official Code for "SMPLer-X: Scaling Up Expressive Human Pose and Shape Estimation"

Language:PythonNOASSERTION897 21 61

sliders

Concept Sliders for Precise Control of Diffusion Models

Language:Jupyter NotebookMIT771 12 89

LucidDreamer

Official implementation of "LucidDreamer: Towards High-Fidelity Text-to-3D Generation via Interval Score Matching"

Language:PythonMIT700 23 33

nlp

Language:Jupyter NotebookMIT427 5 5

MB-iSTFT-VITS

Lightweight and High-Fidelity End-to-End Text-to-Speech with Multi-Band Generation and Inverse Short-Time Fourier Transform

Language:PythonApache-2.0399 17 25

UnityRuntimeNodeEditor

Unity runtime node editor using with Unity UI.

Language:C#MIT377 11 13

Awesome-Multimodal-LLM

Research Trends in LLM-guided Multimodal Learning.

MIT334 16 4

xrmocap

OpenXRLab Multi-view Motion Capture Toolbox and Benchmark

Language:PythonNOASSERTION320 8 70

jle

'Jet-Lagged Engine' is a work-in-progress C++/Lua game engine supporting Windows, Linux, Mac and browsers.

Language:C++GPL-3.0238 11 3

Consistent4D

[ICLR 2024] Official Implementation of Consistent4D: Consistent 360° Dynamic Object Generation from Monocular Video

Language:PythonApache-2.0220 9 9

URP-ScreenSpaceCavity

Blender Cavity Effect for Unity

Language:C#BSD-3-Clause170 10 8

ai-town-rwkv-proxy

Run a large AI town, locally, via RWKV !

Language:TypeScript140 3 2

MB-iSTFT-VITS2

Application of MB-iSTFT-VITS components to vits2_pytorch

Language:PythonMIT99 5 16

simulcast-playground

single-page simulcast tests

Language:HTML32 7 1