liubo0902's repositories
a1111-sd-webui-lycoris
An extension for stable-diffusion-webui to load lycoris models.
AniPortrait
AniPortrait: Audio-Driven Synthesis of Photorealistic Portrait Animation
ComfyUI
The most powerful and modular stable diffusion GUI with a graph/nodes interface.
diffusers
🤗 Diffusers: State-of-the-art diffusion models for image and audio generation in PyTorch
DynamiCrafter
DynamiCrafter: Animating Open-domain Images with Video Diffusion Priors
ELLA
ELLA: Equip Diffusion Models with LLM for Enhanced Semantic Alignment
Emu
Emu Series: Generative Multimodal Models from BAAI
FiT
FiT: Flexible Vision Transformer for Diffusion Model
InstantID
InstantID : Zero-shot Identity-Preserving Generation in Seconds
Latte
The official implementation of Latte: Latent Diffusion Transformer for Video Generation.
lmdeploy
LMDeploy is a toolkit for compressing, deploying, and serving LLMs.
MoAI
Official PyTorch implementation code for realizing the technical part of Mixture of All Intelligence (MoAI) to improve performance of numerous zero-shot vision language tasks. (Under Review)
MobileVLM
Strong and Open Vision Language Assistant for Mobile Devices
PixArt-alpha
Fast Training of Diffusion Transformer for Photorealistic Text-to-Image Synthesis
PixArt-sigma
New PixArt Model, Faster, Stronger, Better
QAnything
Question and Answer based on Anything.
ScaleLLM
A high-performance inference system for large language models, designed for production environments.
sd-webui-controlnet
WebUI extension for ControlNet
sd-webui-EasyPhoto
📷 EasyPhoto | Your Smart AI Photo Generator.
SiT
Official PyTorch Implementation of "SiT: Exploring Flow and Diffusion-based Generative Models with Scalable Interpolant Transformers"
TensorRT-LLM
TensorRT-LLM provides users with an easy-to-use Python API to define Large Language Models (LLMs) and build TensorRT engines that contain state-of-the-art optimizations to perform inference efficiently on NVIDIA GPUs. TensorRT-LLM also contains components to create Python and C++ runtimes that execute those TensorRT engines.
VideoCrafter
VideoCrafter2: Overcoming Data Limitations for High-Quality Video Diffusion Models
VILA
VILA - a multi-image visual language model with training, inference and evaluation recipe, deployable from cloud to edge (Jetson Orin and laptops)
ViR
Official Repository for ViR: Towards Efficient Vision Retention Backbones
xtuner
An efficient, flexible and full-featured toolkit for fine-tuning large models (InternLM, Llama, Baichuan, Qwen, ChatGLM)