Pengxiang Li (pixeli99)

pixeli99

Geek Repo

Company:DUT IIAU

Location:Dalian, China

Github PK Tool:Github PK Tool

Pengxiang Li's starred repositories

Awesome-Multimodal-Large-Language-Models

:sparkles::sparkles:Latest Advances on Multimodal Large Language Models

Stargazers:10717Issues:0Issues:0

DeepSeek-MoE

DeepSeekMoE: Towards Ultimate Expert Specialization in Mixture-of-Experts Language Models

Language:PythonLicense:MITStargazers:923Issues:0Issues:0

ESFT

Expert Specialized Fine-Tuning

Language:PythonLicense:MITStargazers:95Issues:0Issues:0

Kolors

Kolors Team

Language:PythonLicense:Apache-2.0Stargazers:2504Issues:0Issues:0

FlagScale

FlagScale is a large model toolkit based on open-sourced projects.

Language:PythonLicense:NOASSERTIONStargazers:103Issues:0Issues:0
Language:PythonStargazers:3Issues:0Issues:0

matmulfreellm

Implementation for MatMul-free LM.

Language:PythonLicense:Apache-2.0Stargazers:2689Issues:0Issues:0

1d-tokenizer

This repo contains the code for our paper An Image is Worth 32 Tokens for Reconstruction and Generation

Language:Jupyter NotebookLicense:Apache-2.0Stargazers:280Issues:0Issues:0

OpenSTL

OpenSTL: A Comprehensive Benchmark of Spatio-Temporal Predictive Learning

Language:PythonLicense:Apache-2.0Stargazers:667Issues:0Issues:0

Awesome-World-Model

Collect some World Models for Autonomous Driving papers.

Stargazers:308Issues:0Issues:0

Perturbed-Attention-Guidance

Official implementation of "Perturbed-Attention Guidance"

Language:Jupyter NotebookLicense:MITStargazers:224Issues:0Issues:0
Language:PythonStargazers:5Issues:0Issues:0
Language:PythonStargazers:8Issues:0Issues:0

cambrian

Cambrian-1 is a family of multimodal LLMs with a vision-centric design.

Language:PythonLicense:Apache-2.0Stargazers:1561Issues:0Issues:0
Language:HTMLStargazers:36Issues:0Issues:0

LingoQA

Official GitHub repository for the paper "LingoQA: Video Question Answering for Autonomous Driving"

Language:PythonLicense:NOASSERTIONStargazers:92Issues:0Issues:0
Language:PythonStargazers:1Issues:0Issues:0

DeepKE

[EMNLP 2022] An Open Toolkit for Knowledge Graph Extraction and Construction

Language:PythonLicense:MITStargazers:3230Issues:0Issues:0
Language:PythonLicense:Apache-2.0Stargazers:46Issues:0Issues:0

chameleon

Repository for Meta Chameleon, a mixed-modal early-fusion foundation model from FAIR.

Language:PythonLicense:NOASSERTIONStargazers:1544Issues:0Issues:0
Stargazers:39Issues:0Issues:0

EVE

EVE: Encoder-Free Vision-Language Models from BAAI

Language:PythonLicense:MITStargazers:149Issues:0Issues:0

RectifiedFlow

Official Implementation of Rectified Flow (ICLR2023 Spotlight)

Language:PythonStargazers:701Issues:0Issues:0

Open-MAGVIT2

Open-MAGVIT2: Democratizing Autoregressive Visual Generation

Language:PythonLicense:Apache-2.0Stargazers:325Issues:0Issues:0

Pandora

Pandora: Towards General World Model with Natural Language Actions and Video States

Language:PythonStargazers:435Issues:0Issues:0

MimicBrush

Official implementations for paper: Zero-shot Image Editing with Reference Imitation

Language:PythonLicense:Apache-2.0Stargazers:891Issues:0Issues:0

ml-4m

4M: Massively Multimodal Masked Modeling

Language:PythonLicense:Apache-2.0Stargazers:1385Issues:0Issues:0

eye_for_an_eye

Eye-for-an-eye: Appearance Transfer with Semantic Correspondence in Diffusion Models

Language:PythonStargazers:13Issues:0Issues:0

MotionClone

Official implementation of MotionClone: Training-Free Motion Cloning for Controllable Video Generation

Language:PythonStargazers:283Issues:0Issues:0

ctrl-x

Official implementation of "Ctrl-X: Controlling Structure and Appearance for Text-To-Image Generation Without Guidance"

Stargazers:72Issues:0Issues:0