Guang YANG's starred repositories
llm-transparency-tool
LLM Transparency Tool (LLM-TT), an open-source interactive toolkit for analyzing internal workings of Transformer-based language models. *Check out demo at* https://huggingface.co/spaces/facebook/llm-transparency-tool-demo
Awesome-Video-Diffusion-Models
[Arxiv] A Survey on Video Diffusion Models
awesome-RLHF
A curated list of reinforcement learning with human feedback resources (continually updated)
aesthetic-predictor-v2-5
SigLIP-based Aesthetic Score Predictor
magvit2-pytorch
Implementation of MagViT2 Tokenizer in Pytorch
Awesome-Text-to-Image
(ෆ`꒳´ෆ) A Survey on Text-to-Image Generation/Synthesis.
stable-diffusion-webui
Stable Diffusion web UI
stablediffusion
High-Resolution Image Synthesis with Latent Diffusion Models
stable-diffusion-webui-wd14-tagger
Labeling extension for Automatic1111's Web UI
PixArt-alpha
PixArt-α: Fast Training of Diffusion Transformer for Photorealistic Text-to-Image Synthesis
EmotiVoice
EmotiVoice 😊: a Multi-Voice and Prompt-Controlled TTS Engine
generative-models
Generative Models by Stability AI
labelImg
LabelImg is now part of the Label Studio community. The popular image annotation tool created by Tzutalin is no longer actively being developed, but you can check out Label Studio, the open source data labeling tool for images, text, hypertext, audio, video and time-series data.
SuperCLUE-Open
中文通用大模型开放域多轮测评基准 | An Open Domain Benchmark for Foundation Models in Chinese
open_flamingo
An open-source framework for training large multimodal models.