Dingpx

Pengxiang ding's starred repositories

RoLD

Official code for Multi-task Manipulation Policy Modeling with Visuomotor Latent Diffusion

Language:Python800

3D-Diffusion-Policy

[RSS 2024] 3D Diffusion Policy: Generalizable Visuomotor Policy Learning via Simple 3D Representations

Language:PythonMIT25600

GALA3D

[ICML 2024] GALA3D: Towards Text-to-3D Complex Scene Generation via Layout-guided Generative Gaussian Splatting

Language:HTML22500

MambaOut

MambaOut: Do We Really Need Mamba for Vision?

Language:PythonApache-2.0179300

LLM-Agents-Papers

A repo lists papers related to LLM based agent

Language:Python74300

llm-mcts

[NeurIPS 2023] We use large language models as commonsense world model and heuristic policy within Monte-Carlo Tree Search, enabling better-reasoned decision-making for daily task planning problems.

Language:PythonApache-2.08400

GR-1

Code for "Unleashing Large-Scale Video Generative Pre-training for Visual Robot Manipulation"

Language:PythonApache-2.05900

RoboFlamingo

Code for RoboFlamingo

Language:PythonMIT24200

Mixture-of-depths

Unofficial implementation for the paper "Mixture-of-Depths: Dynamically allocating compute in transformer-based language models"

Language:Python9500

Mixture-of-Depths

Implementation of the paper: "Mixture-of-Depths: Dynamically allocating compute in transformer-based language models"

Language:PythonMIT4200

MGM

Official repo for "Mini-Gemini: Mining the Potential of Multi-modality Vision Language Models"

Language:PythonApache-2.0304700

llama3

The official Meta Llama 3 GitHub site

Language:PythonNOASSERTION2180600

SEED-X

Multimodal Models in Real World

Language:Jupyter NotebookNOASSERTION27700

IDM-VTON

IDM-VTON : Improving Diffusion Models for Authentic Virtual Try-on in the Wild

Language:Python278400

LLM-TAMP

LLM3: Large Language Model-based Task and Motion Planning with Motion Failure Reasoning

Language:PythonMIT2300

[GPT beats diffusion🔥] [scaling laws in visual generation📈] Official impl. of "Visual Autoregressive Modeling: Scalable Image Generation via Next-Scale Prediction". An *ultra-simple, user-friendly yet state-of-the-art* codebase for autoregressive image generation!

Language:PythonMIT363500

Ctrl-Adapter

Official implementation of Ctrl-Adapter: An Efficient and Versatile Framework for Adapting Diverse Controls to Any Diffusion Model

Language:PythonApache-2.031900

ManiGaussian

ManiGaussian: Dynamic Gaussian Splatting for Multi-task Robotic Manipulation

Language:PythonMIT9400

mamba-minimal

Simple, minimal implementation of the Mamba SSM in one file of PyTorch.

Language:PythonApache-2.0236900

InstantStyle

InstantStyle: Free Lunch towards Style-Preserving in Text-to-Image Generation 🔥

Language:Jupyter Notebook135900

cobra

Cobra: Extending Mamba to Multi-modal Large Language Model for Efficient Inference

Language:PythonMIT20300

Awesome-Motion-Diffusion-Models

A collection of resources and papers on Motion Diffusion Models.

MIT1200

DL-Course-2024

Official Repository for Westlake Deep Learning Course (2024)

Language:Jupyter Notebook1500

diffusion-of-thoughts

Code for the paper "Diffusion of Thoughts: Chain-of-Thought Reasoning in Diffusion Language Models"

Language:Python5000

Elysium

Elysium: Exploring Object-level Perception in Videos via MLLM

2100

FeatUp

Official code for "FeatUp: A Model-Agnostic Frameworkfor Features at Any Resolution" ICLR 2024

Language:Jupyter NotebookMIT125300

EAI

Official code of [AAAI2024] Expressive Forecasting of 3D Whole-body Human Motions

Language:Python1900

Awesome-Mamba-Papers

Awesome Papers related to Mamba.

86000

AAAI-2024-Papers

AAAI 2024 Papers: Explore a comprehensive collection of innovative research papers presented at one of the premier artificial intelligence conferences. Seamlessly integrate code implementations for better understanding. ⭐ experience the forefront of progress in artificial intelligence with this repository!

Language:PythonMIT24100