Beast code in Giters

Jou-ching (George) Sung's starred repositories

voice-changer

リアルタイムボイスチェンジャー Realtime Voice Changer

Language:PythonNOASSERTION1495300

brew

🍺 The missing package manager for macOS (or Linux)

Language:RubyBSD-2-Clause3967900

:robot: The free, Open Source OpenAI alternative. Self-hosted, community-driven and local-first. Drop-in replacement for OpenAI running on consumer-grade hardware. No GPU required. Runs gguf, transformers, diffusers and many more models architectures. It allows to generate Text, Audio, Video, Images. Also with voice cloning capabilities.

Language:C++MIT2058100

amadeus

Create RP training data from a VN, using GPT-4

Language:Jupyter NotebookMIT1300

tabbyAPI

An OAI compatible exllamav2 API that's both lightweight and fast

Language:PythonAGPL-3.031300

lerobot

🤗 LeRobot: State-of-the-art Machine Learning for Real-World Robotics in Pytorch

Language:PythonApache-2.0319800

openWakeWord

An open-source audio wake word (or phrase) detection framework with a focus on performance and simplicity.

Language:Jupyter NotebookApache-2.048300

InternVL

[CVPR 2024 Oral] InternVL Family: A Pioneering Open-Source Alternative to GPT-4V. 接近GPT-4V表现的可商用开源多模态对话模型

Language:Jupyter NotebookMIT273100

ui

Beautifully designed components that you can copy and paste into your apps. Accessible. Customizable. Open Source.

Language:TypeScriptMIT5929100

llm-datasets

High-quality datasets, tools, and concepts for LLM fine-tuning.

86300

OpenVoice

Instant voice cloning by MyShell.

Language:PythonMIT2586700

vits

VITS: Conditional Variational Autoencoder with Adversarial Learning for End-to-End Text-to-Speech

Language:PythonMIT636600

exllamav2

A fast inference library for running LLMs locally on modern consumer-class GPUs

Language:PythonMIT304900

streamlit-audio-recorder

Record Audio from the User's Microphone in Apps that are Deployed to the Web. (via Browser Media-API, REACT-based, Streamlit Custom Component)

Language:TypeScriptMIT34900

tts-generation-webui

TTS Generation Web UI (Bark, MusicGen + AudioGen, Tortoise, RVC, Vocos, Demucs, SeamlessM4T, MAGNet, StyleTTS2, MMS)

Language:TypeScriptMIT135600

streamlit-webrtc

Real-time video and audio streams over the network, with Streamlit.

Language:PythonMIT121200

ssl-proxy

:lock: Simple zero-config SSL reverse proxy with real autogenerated certificates (LetsEncrypt, self-signed, provided)

Language:GoMIT71700

TTS

🐸💬 - a deep learning toolkit for Text-to-Speech, battle-tested in research and production

Language:PythonMPL-2.03006100

distil-whisper

Distilled variant of Whisper for speech recognition. 6x faster, 50% smaller, within 1% word error rate.

Language:PythonMIT324100

diffusion-fast

Faster generation with text-to-image diffusion models.

Language:PythonApache-2.015100

nicegui

Create web-based user interfaces with Python. The nice way.

Language:PythonMIT772900

parler-tts

Inference and training library for high-quality TTS models.

Language:PythonApache-2.0263800

Fooocus

Focus on prompting and generating

Language:PythonGPL-3.03643200

InstantStyle

InstantStyle: Free Lunch towards Style-Preserving in Text-to-Image Generation 🔥

Language:Jupyter Notebook130100

llamafile

Distribute and run LLMs with a single file.

Language:C++NOASSERTION1565500

tryondiffusion

PyTorch implementation of "TryOnDiffusion: A Tale of Two UNets", a virtual try-on diffusion-based network by Google

Language:PythonMIT13000

sd-forge-layerdiffuse

[WIP] Layer Diffusion for WebUI (via Forge)

Language:PythonApache-2.0352000

fsgan

FSGAN - Official PyTorch Implementation

Language:Jupyter NotebookCC0-1.073800

Yi

A series of large language models trained from scratch by developers @01-ai

Language:PythonApache-2.0729400

moondream

tiny vision language model

Language:Jupyter NotebookApache-2.0402700

georgesung