Jou-ching (George) Sung's starred repositories
face_recognition
The world's simplest facial recognition api for Python and the command line
Real-Time-Voice-Cloning
Clone a voice in 5 seconds to generate arbitrary speech in real-time
segment-anything
The repository provides code for running inference with the SegmentAnything Model (SAM), links for downloading the trained model checkpoints, and example notebooks that show how to use the model.
InvokeAI
InvokeAI is a leading creative engine for Stable Diffusion models, empowering professionals, artists, and enthusiasts to generate and create visual media using the latest AI-driven technologies. The solution offers an industry leading WebUI, supports terminal use through a CLI, and serves as the foundation for multiple commercial products.
facefusion
Industry leading face manipulation platform
nerfstudio
A collaboration friendly studio for NeRFs
bark-with-voice-clone
🔊 Text-prompted Generative Audio Model - With the ability to clone voices
Stable-Diffusion-WebUI-TensorRT
TensorRT Extension for Stable Diffusion Web UI
stable-fast
Best inference performance optimization framework for HuggingFace Diffusers on NVIDIA GPUs.
sd-webui-inpaint-anything
Inpaint Anything extension performs stable diffusion inpainting on a browser UI using masks from Segment Anything.
awesome-japanese-llm
日本語LLMまとめ - Overview of Japanese LLMs
HCP-Diffusion
A universal Stable-Diffusion toolbox
anime-segmentation
high-accuracy segmentation for anime character
Wuerstchen
Official implementation of Würstchen: Efficient Pretraining of Text-to-Image Models
trainable-agents
Code and datasets for "Character-LLM: A Trainable Agent for Role-Playing"