Chenxi's repositories
Semantic-Segment-Anything
Automated dense category annotation engine that serves as the initial semantic labeling for the Segment Anything dataset (SA-1B).
Grounded-Segment-Anything
Marrying Grounding DINO with Segment Anything & Stable Diffusion & Tag2Text & BLIP & Whisper & ChatBot - Automatically Detect , Segment and Generate Anything with Image, Text, and Audio Inputs
ControlVideo
Official pytorch implementation of "ControlVideo: Training-free Controllable Text-to-Video Generation"
cog-stable-diffusion
Diffusers Stable Diffusion as a Cog model
Prompt-Free-Diffusion
Prompt-Free Diffusion: Taking "Text" out of Text-to-Image Diffusion Models
StyleDrop-PyTorch
Unoffical implement for [StyleDrop](https://arxiv.org/abs/2306.00983)
difformer
The offical codebase for Difformer: Empowering Diffusion Models on the Embedding Space for Text Generation
FastSAM
Fast Segment Anything
lorahub
The official repository of paper "LoraHub: Efficient Cross-Task Generalization via Dynamic LoRA Composition".
ProFusion
Code for Enhancing Detail Preservation for Customized Text-to-Image Generation: A Regularization-Free Approach
recognize-anything
Code for the Recognize Anything Model (RAM) and Tag2Text Model
ResShift
ResShift: Efficient Diffusion Model for Image Super-resolution by Residual Shifting (PyTorch)
Text2Video-Zero
Text-to-Image Diffusion Models are Zero-Shot Video Generators
webie
Dataset for web-scaled information extraction.