HuaZheLei's starred repositories
ControlNet
Let us control diffusion models!
generative_agents
Generative Agents: Interactive Simulacra of Human Behavior
ml-engineering
Machine Learning Engineering Open Book
IP-Adapter
The image prompt adapter is designed to enable a pretrained text-to-image diffusion model to generate images with image prompt.
consistencydecoder
Consistency Distilled Diff VAE
modelscope-agent
ModelScope-Agent: An agent framework connecting models in ModelScope with the world
ml-fastvit
This repository contains the official implementation of the research paper, "FastViT: A Fast Hybrid Vision Transformer using Structural Reparameterization" ICCV 2023
InternLM-XComposer
InternLM-XComposer2 is a groundbreaking vision-language large model (VLLM) excelling in free-form text-image composition and comprehension.
ControlNet-for-Diffusers
Transfer the ControlNet with any basemodel in diffusers🔥
LLaVA-Plus-Codebase
LLaVA-Plus: Large Language and Vision Assistants that Plug and Learn to Use Skills
ScaleCrafter
[ICLR 2024 Spotlight] Official implementation of ScaleCrafter for higher-resolution visual generation at inference time.
prompt-pretraining
Official implementation for the paper "Prompt Pre-Training with Over Twenty-Thousand Classes for Open-Vocabulary Visual Recognition"
ai-town-rwkv-proxy
Run a large AI town, locally, via RWKV !