Long Chen (longcw)

longcw

Geek Repo

Company:Tsinghua University

Location:Beijing, China

Home Page:https://longcw.github.io

Github PK Tool:Github PK Tool

Long Chen's starred repositories

ColossalAI

Making large AI models cheaper, faster and more accessible

Language:PythonLicense:Apache-2.0Stargazers:38022Issues:378Issues:1577

FastChat

An open platform for training, serving, and evaluating large language models. Release repo for Vicuna and Chatbot Arena.

Language:PythonLicense:Apache-2.0Stargazers:34733Issues:347Issues:1672
Language:PythonLicense:NOASSERTIONStargazers:34456Issues:309Issues:348

dify

Dify is an open-source LLM app development platform. Dify's intuitive interface combines AI workflow, RAG pipeline, agent capabilities, model management, observability features and more, letting you quickly go from prototype to production.

Language:TypeScriptLicense:NOASSERTIONStargazers:29944Issues:245Issues:1930

LLaVA

[NeurIPS'23 Oral] Visual Instruction Tuning (LLaVA) built towards GPT-4V level capabilities and beyond.

Language:PythonLicense:Apache-2.0Stargazers:16811Issues:153Issues:1306

anything-llm

The all-in-one Desktop & Docker AI application with full RAG and AI Agent capabilities.

Language:JavaScriptLicense:MITStargazers:14425Issues:117Issues:933

Wav2Lip

This repository contains the codes of "A Lip Sync Expert Is All You Need for Speech to Lip Generation In the Wild", published at ACM Multimedia 2020. For HD commercial model, please try out Sync Labs

LAVIS

LAVIS - A One-stop Library for Language-Vision Intelligence

Language:Jupyter NotebookLicense:BSD-3-ClauseStargazers:8884Issues:94Issues:613

open_clip

An open source implementation of CLIP.

Language:Jupyter NotebookLicense:NOASSERTIONStargazers:8627Issues:76Issues:439
Language:PythonLicense:Apache-2.0Stargazers:8445Issues:81Issues:1722

search_with_lepton

Building a quick conversation-based search demo with Lepton AI.

Language:TypeScriptLicense:Apache-2.0Stargazers:7103Issues:49Issues:55

video-retalking

[SIGGRAPH Asia 2022] VideoReTalking: Audio-based Lip Synchronization for Talking Head Video Editing In the Wild

Language:PythonLicense:Apache-2.0Stargazers:5769Issues:71Issues:211

GroundingDINO

Official implementation of the paper "Grounding DINO: Marrying DINO with Grounded Pre-Training for Open-Set Object Detection"

Language:PythonLicense:Apache-2.0Stargazers:5168Issues:34Issues:272

IP-Adapter

The image prompt adapter is designed to enable a pretrained text-to-image diffusion model to generate images with image prompt.

Language:Jupyter NotebookLicense:Apache-2.0Stargazers:4117Issues:56Issues:315

MiniCPM

MiniCPM-2B: An end-side LLM outperforms Llama2-13B.

Language:Jupyter NotebookLicense:Apache-2.0Stargazers:3910Issues:52Issues:110

AnyDoor

Official implementations for paper: Anydoor: zero-shot object-level image customization

Language:PythonLicense:MITStargazers:3735Issues:85Issues:85

img2dataset

Easily turn large sets of image urls to an image dataset. Can download, resize and package 100M urls in 20h on one machine.

Language:PythonLicense:MITStargazers:3317Issues:30Issues:248

T2I-Adapter

T2I-Adapter

Language:PythonLicense:Apache-2.0Stargazers:3205Issues:41Issues:105

EditAnything

Edit anything in images powered by segment-anything, ControlNet, StableDiffusion, etc. (ACM MM)

Language:PythonLicense:Apache-2.0Stargazers:3155Issues:39Issues:57

promptfoo

Test your prompts, models, and RAGs. Catch regressions and improve prompt quality. LLM evals for OpenAI, Azure, Anthropic, Gemini, Mistral, Llama, Bedrock, Ollama, and other local & private models with CI/CD integration.

Language:TypeScriptLicense:MITStargazers:2951Issues:16Issues:383

TokenFlow

Official Pytorch Implementation for "TokenFlow: Consistent Diffusion Features for Consistent Video Editing" presenting "TokenFlow" (ICLR 2024)

Language:PythonLicense:MITStargazers:1477Issues:78Issues:40

CVinW_Readings

A collection of papers on the topic of ``Computer Vision in the Wild (CVinW)''

Osprey

[CVPR2024] The code for "Osprey: Pixel Understanding with Visual Instruction Tuning"

Language:PythonLicense:Apache-2.0Stargazers:696Issues:13Issues:31

MVDream

Multi-view Diffusion for 3D Generation

Language:PythonLicense:MITStargazers:668Issues:20Issues:31

LLaVA-Plus-Codebase

LLaVA-Plus: Large Language and Vision Assistants that Plug and Learn to Use Skills

Language:PythonLicense:Apache-2.0Stargazers:637Issues:10Issues:22

gaussian-grouping

Gaussian Grouping for open-world Anything reconstruction, segmentation and editing.

Language:Jupyter NotebookLicense:Apache-2.0Stargazers:447Issues:19Issues:36

MVDream-threestudio

3D generation code for MVDream

Language:PythonLicense:Apache-2.0Stargazers:441Issues:18Issues:26

multimodal-garment-designer

This is the official repository for the paper "Multimodal Garment Designer: Human-Centric Latent Diffusion Models for Fashion Image Editing". ICCV 2023

Language:PythonLicense:NOASSERTIONStargazers:373Issues:28Issues:29

OmniLMM

Large Multi-modal Models for Strong Performance and Efficient Deployment

Language:PythonLicense:Apache-2.0Stargazers:371Issues:11Issues:24

laion-datasets

Description and pointers of laion datasets

Language:HTMLLicense:MITStargazers:213Issues:6Issues:8