longcw

Long Chen's starred repositories

ColossalAI

Making large AI models cheaper, faster and more accessible

Language:PythonApache-2.038022 378 1577

FastChat

An open platform for training, serving, and evaluating large language models. Release repo for Vicuna and Chatbot Arena.

Language:PythonApache-2.034733 347 1672

Dify is an open-source LLM app development platform. Dify's intuitive interface combines AI workflow, RAG pipeline, agent capabilities, model management, observability features and more, letting you quickly go from prototype to production.

Language:TypeScriptNOASSERTION29944 245 1930

LLaVA

[NeurIPS'23 Oral] Visual Instruction Tuning (LLaVA) built towards GPT-4V level capabilities and beyond.

Language:PythonApache-2.016811 153 1306

anything-llm

The all-in-one Desktop & Docker AI application with full RAG and AI Agent capabilities.

Language:JavaScriptMIT14425 117 933

Wav2Lip

This repository contains the codes of "A Lip Sync Expert Is All You Need for Speech to Lip Generation In the Wild", published at ACM Multimedia 2020. For HD commercial model, please try out Sync Labs

Language:Python9415 161 630

LAVIS

LAVIS - A One-stop Library for Language-Vision Intelligence

Language:Jupyter NotebookBSD-3-Clause8884 94 613

open_clip

An open source implementation of CLIP.

Language:Jupyter NotebookNOASSERTION8627 76 439

kohya_ss

Language:PythonApache-2.08445 81 1722

search_with_lepton

Building a quick conversation-based search demo with Lepton AI.

Language:TypeScriptApache-2.07103 49 55

video-retalking

[SIGGRAPH Asia 2022] VideoReTalking: Audio-based Lip Synchronization for Talking Head Video Editing In the Wild

Language:PythonApache-2.05769 71 211

GroundingDINO

Official implementation of the paper "Grounding DINO: Marrying DINO with Grounded Pre-Training for Open-Set Object Detection"

Language:PythonApache-2.05168 34 272

IP-Adapter

The image prompt adapter is designed to enable a pretrained text-to-image diffusion model to generate images with image prompt.

Language:Jupyter NotebookApache-2.04117 56 315

MiniCPM

MiniCPM-2B: An end-side LLM outperforms Llama2-13B.

Language:Jupyter NotebookApache-2.03910 52 110

AnyDoor

Official implementations for paper: Anydoor: zero-shot object-level image customization

Language:PythonMIT3735 85 85

img2dataset

Easily turn large sets of image urls to an image dataset. Can download, resize and package 100M urls in 20h on one machine.

Language:PythonMIT3317 30 248

T2I-Adapter

Language:PythonApache-2.03205 41 105

EditAnything

Edit anything in images powered by segment-anything, ControlNet, StableDiffusion, etc. (ACM MM)

Language:PythonApache-2.03155 39 57

promptfoo

Test your prompts, models, and RAGs. Catch regressions and improve prompt quality. LLM evals for OpenAI, Azure, Anthropic, Gemini, Mistral, Llama, Bedrock, Ollama, and other local & private models with CI/CD integration.

Language:TypeScriptMIT2951 16 383

TokenFlow

Official Pytorch Implementation for "TokenFlow: Consistent Diffusion Features for Consistent Video Editing" presenting "TokenFlow" (ICLR 2024)

Language:PythonMIT1477 78 40

CVinW_Readings

A collection of papers on the topic of ``Computer Vision in the Wild (CVinW)''

1028 37 6

Osprey

[CVPR2024] The code for "Osprey: Pixel Understanding with Visual Instruction Tuning"

Language:PythonApache-2.0696 13 31

MVDream

Multi-view Diffusion for 3D Generation

Language:PythonMIT668 20 31

LLaVA-Plus-Codebase

LLaVA-Plus: Large Language and Vision Assistants that Plug and Learn to Use Skills

Language:PythonApache-2.0637 10 22

gaussian-grouping

Gaussian Grouping for open-world Anything reconstruction, segmentation and editing.

Language:Jupyter NotebookApache-2.0447 19 36

MVDream-threestudio

3D generation code for MVDream

Language:PythonApache-2.0441 18 26

multimodal-garment-designer

This is the official repository for the paper "Multimodal Garment Designer: Human-Centric Latent Diffusion Models for Fashion Image Editing". ICCV 2023

Language:PythonNOASSERTION373 28 29

OmniLMM

Large Multi-modal Models for Strong Performance and Efficient Deployment

Language:PythonApache-2.0371 11 24

laion-datasets

Description and pointers of laion datasets

Language:HTMLMIT213 6 8