Long Chen's starred repositories
ColossalAI
Making large AI models cheaper, faster and more accessible
anything-llm
The all-in-one Desktop & Docker AI application with full RAG and AI Agent capabilities.
search_with_lepton
Building a quick conversation-based search demo with Lepton AI.
video-retalking
[SIGGRAPH Asia 2022] VideoReTalking: Audio-based Lip Synchronization for Talking Head Video Editing In the Wild
GroundingDINO
Official implementation of the paper "Grounding DINO: Marrying DINO with Grounded Pre-Training for Open-Set Object Detection"
IP-Adapter
The image prompt adapter is designed to enable a pretrained text-to-image diffusion model to generate images with image prompt.
img2dataset
Easily turn large sets of image urls to an image dataset. Can download, resize and package 100M urls in 20h on one machine.
T2I-Adapter
T2I-Adapter
EditAnything
Edit anything in images powered by segment-anything, ControlNet, StableDiffusion, etc. (ACM MM)
CVinW_Readings
A collection of papers on the topic of ``Computer Vision in the Wild (CVinW)''
LLaVA-Plus-Codebase
LLaVA-Plus: Large Language and Vision Assistants that Plug and Learn to Use Skills
gaussian-grouping
Gaussian Grouping for open-world Anything reconstruction, segmentation and editing.
MVDream-threestudio
3D generation code for MVDream
multimodal-garment-designer
This is the official repository for the paper "Multimodal Garment Designer: Human-Centric Latent Diffusion Models for Fashion Image Editing". ICCV 2023
laion-datasets
Description and pointers of laion datasets