Xiaodong Wang (Wang-Xiaodong1899)

Wang-Xiaodong1899

Geek Repo

Company:Peking University

Home Page:https://wang-xiaodong1899.github.io/

Github PK Tool:Github PK Tool

Xiaodong Wang's starred repositories

autogen

A programming framework for agentic AI. Discord: https://aka.ms/autogen-dc. Roadmap: https://aka.ms/autogen-roadmap

Language:Jupyter NotebookLicense:CC-BY-4.0Stargazers:29170Issues:357Issues:1522

diffusers

🤗 Diffusers: State-of-the-art diffusion models for image and audio generation in PyTorch and FLAX.

Language:PythonLicense:Apache-2.0Stargazers:24401Issues:191Issues:3861

mlc-llm

Universal LLM Deployment Engine with ML Compilation

Language:PythonLicense:Apache-2.0Stargazers:18230Issues:170Issues:1248

LLM-Agent-Paper-List

The paper list of the 86-page paper "The Rise and Potential of Large Language Model Based Agents: A Survey" by Zhiheng Xi et al.

AgentVerse

🤖 AgentVerse 🪐 is designed to facilitate the deployment of multiple LLM-based agents in various applications, which primarily provides two frameworks: task-solving and simulation

Language:JavaScriptLicense:Apache-2.0Stargazers:3932Issues:57Issues:76

pytorch-fid

Compute FID scores with PyTorch.

Language:PythonLicense:Apache-2.0Stargazers:3275Issues:15Issues:85

UniAD

[CVPR'23 Best Paper Award] Planning-oriented Autonomous Driving

Language:PythonLicense:Apache-2.0Stargazers:3182Issues:34Issues:171

BEVFormer

[ECCV 2022] This is the official implementation of BEVFormer, a camera-only framework for autonomous driving perception, e.g., 3D object detection and semantic map segmentation.

Language:PythonLicense:Apache-2.0Stargazers:3130Issues:69Issues:260

Awesome-Video-Diffusion

A curated list of recent diffusion models for video generation, editing, restoration, understanding, etc.

InternImage

[CVPR 2023 Highlight] InternImage: Exploring Large-Scale Vision Foundation Models with Deformable Convolutions

Language:PythonLicense:MITStargazers:2438Issues:34Issues:260

nuscenes-devkit

The devkit of the nuScenes dataset.

Language:PythonLicense:NOASSERTIONStargazers:2210Issues:51Issues:780

DeepSeek-LLM

DeepSeek LLM: Let there be answers

Language:MakefileLicense:MITStargazers:1352Issues:23Issues:32

tomesd

Speed up Stable Diffusion with this one simple trick!

Language:PythonLicense:MITStargazers:1252Issues:19Issues:48

FateZero

[ICCV 2023 Oral] "FateZero: Fusing Attentions for Zero-shot Text-based Video Editing"

Language:Jupyter NotebookLicense:MITStargazers:1080Issues:14Issues:33

text2room

Text2Room generates textured 3D meshes from a given text prompt using 2D text-to-image models (ICCV2023).

Language:PythonLicense:MITStargazers:998Issues:10Issues:31

UniRepLKNet

[CVPR'24] UniRepLKNet: A Universal Perception Large-Kernel ConvNet for Audio, Video, Point Cloud, Time-Series and Image Recognition

Language:PythonLicense:Apache-2.0Stargazers:877Issues:12Issues:18

Awesome-LLM4AD

A curated list of awesome LLM for Autonomous Driving resources (continually updated)

Chat-UniVi

[CVPR 2024 Highlight🔥] Chat-UniVi: Unified Visual Representation Empowers Large Language Models with Image and Video Understanding

Language:PythonLicense:Apache-2.0Stargazers:736Issues:7Issues:54
Language:PythonLicense:MITStargazers:674Issues:9Issues:27

VAD

[ICCV 2023] VAD: Vectorized Scene Representation for Efficient Autonomous Driving

Language:PythonLicense:Apache-2.0Stargazers:540Issues:27Issues:71

Awesome-Papers-Autonomous-Agent

A collection of recent papers on building autonomous agent. Two topics included: RL-based / LLM-based agents.

MovieChat

[CVPR 2024] 🎬💭 chat with over 10K frames of video!

Language:PythonLicense:BSD-3-ClauseStargazers:471Issues:10Issues:71

Agent-Attention

Official repository of Agent Attention (ECCV2024)

TATS

Official PyTorch implementation of TATS: A Long Video Generation Framework with Time-Agnostic VQGAN and Time-Sensitive Transformer (ECCV 2022)

Language:PythonLicense:MITStargazers:259Issues:11Issues:31

ChatEval

Codes for our paper "ChatEval: Towards Better LLM-based Evaluators through Multi-Agent Debate"

Language:PythonLicense:Apache-2.0Stargazers:216Issues:3Issues:8

multi-lora-fine-tune

Provide Efficient LLM Fine-Tune via Multi-LoRA Optimization

Language:PythonLicense:Apache-2.0Stargazers:190Issues:3Issues:42

ReCo

ReCo: Region-Controlled Text-to-Image Generation, CVPR 2023

Language:Jupyter NotebookLicense:MITStargazers:112Issues:5Issues:10

InstructionGPT-4

InstructionGPT-4

Language:PythonLicense:MITStargazers:35Issues:1Issues:0
Language:PythonLicense:Apache-2.0Stargazers:4Issues:1Issues:0