Pengxiang ding (Dingpx)

Dingpx

Geek Repo

Company:Zhejiang University

Github PK Tool:Github PK Tool

Pengxiang ding's starred repositories

RoLD

Official code for Multi-task Manipulation Policy Modeling with Visuomotor Latent Diffusion

Language:PythonStargazers:8Issues:0Issues:0
Language:PythonStargazers:21Issues:0Issues:0

3D-Diffusion-Policy

[RSS 2024] 3D Diffusion Policy: Generalizable Visuomotor Policy Learning via Simple 3D Representations

Language:PythonLicense:MITStargazers:256Issues:0Issues:0

GALA3D

[ICML 2024] GALA3D: Towards Text-to-3D Complex Scene Generation via Layout-guided Generative Gaussian Splatting

Language:HTMLStargazers:225Issues:0Issues:0

MambaOut

MambaOut: Do We Really Need Mamba for Vision?

Language:PythonLicense:Apache-2.0Stargazers:1793Issues:0Issues:0

LLM-Agents-Papers

A repo lists papers related to LLM based agent

Language:PythonStargazers:743Issues:0Issues:0

llm-mcts

[NeurIPS 2023] We use large language models as commonsense world model and heuristic policy within Monte-Carlo Tree Search, enabling better-reasoned decision-making for daily task planning problems.

Language:PythonLicense:Apache-2.0Stargazers:84Issues:0Issues:0

GR-1

Code for "Unleashing Large-Scale Video Generative Pre-training for Visual Robot Manipulation"

Language:PythonLicense:Apache-2.0Stargazers:59Issues:0Issues:0

RoboFlamingo

Code for RoboFlamingo

Language:PythonLicense:MITStargazers:242Issues:0Issues:0

Mixture-of-depths

Unofficial implementation for the paper "Mixture-of-Depths: Dynamically allocating compute in transformer-based language models"

Language:PythonStargazers:95Issues:0Issues:0

Mixture-of-Depths

Implementation of the paper: "Mixture-of-Depths: Dynamically allocating compute in transformer-based language models"

Language:PythonLicense:MITStargazers:42Issues:0Issues:0

MGM

Official repo for "Mini-Gemini: Mining the Potential of Multi-modality Vision Language Models"

Language:PythonLicense:Apache-2.0Stargazers:3047Issues:0Issues:0

llama3

The official Meta Llama 3 GitHub site

Language:PythonLicense:NOASSERTIONStargazers:21806Issues:0Issues:0

SEED-X

Multimodal Models in Real World

Language:Jupyter NotebookLicense:NOASSERTIONStargazers:277Issues:0Issues:0

IDM-VTON

IDM-VTON : Improving Diffusion Models for Authentic Virtual Try-on in the Wild

Language:PythonStargazers:2784Issues:0Issues:0

LLM-TAMP

LLM3: Large Language Model-based Task and Motion Planning with Motion Failure Reasoning

Language:PythonLicense:MITStargazers:23Issues:0Issues:0

VAR

[GPT beats diffusionšŸ”„] [scaling laws in visual generationšŸ“ˆ] Official impl. of "Visual Autoregressive Modeling: Scalable Image Generation via Next-Scale Prediction". An *ultra-simple, user-friendly yet state-of-the-art* codebase for autoregressive image generation!

Language:PythonLicense:MITStargazers:3635Issues:0Issues:0

Ctrl-Adapter

Official implementation of Ctrl-Adapter: An Efficient and Versatile Framework for Adapting Diverse Controls to Any Diffusion Model

Language:PythonLicense:Apache-2.0Stargazers:319Issues:0Issues:0

ManiGaussian

ManiGaussian: Dynamic Gaussian Splatting for Multi-task Robotic Manipulation

Language:PythonLicense:MITStargazers:94Issues:0Issues:0

mamba-minimal

Simple, minimal implementation of the Mamba SSM in one file of PyTorch.

Language:PythonLicense:Apache-2.0Stargazers:2369Issues:0Issues:0

InstantStyle

InstantStyle: Free Lunch towards Style-Preserving in Text-to-Image Generation šŸ”„

Language:Jupyter NotebookStargazers:1359Issues:0Issues:0

cobra

Cobra: Extending Mamba to Multi-modal Large Language Model for Efficient Inference

Language:PythonLicense:MITStargazers:203Issues:0Issues:0

Awesome-Motion-Diffusion-Models

A collection of resources and papers on Motion Diffusion Models.

License:MITStargazers:12Issues:0Issues:0

DL-Course-2024

Official Repository for Westlake Deep Learning Course (2024)

Language:Jupyter NotebookStargazers:15Issues:0Issues:0

diffusion-of-thoughts

Code for the paper "Diffusion of Thoughts: Chain-of-Thought Reasoning in Diffusion Language Models"

Language:PythonStargazers:50Issues:0Issues:0

Elysium

Elysium: Exploring Object-level Perception in Videos via MLLM

Stargazers:21Issues:0Issues:0

FeatUp

Official code for "FeatUp: A Model-Agnostic Frameworkfor Features at Any Resolution" ICLR 2024

Language:Jupyter NotebookLicense:MITStargazers:1253Issues:0Issues:0

EAI

Official code of [AAAI2024] Expressive Forecasting of 3D Whole-body Human Motions

Language:PythonStargazers:19Issues:0Issues:0

Awesome-Mamba-Papers

Awesome Papers related to Mamba.

Stargazers:860Issues:0Issues:0

AAAI-2024-Papers

AAAI 2024 Papers: Explore a comprehensive collection of innovative research papers presented at one of the premier artificial intelligence conferences. Seamlessly integrate code implementations for better understanding. ā­ experience the forefront of progress in artificial intelligence with this repository!

Language:PythonLicense:MITStargazers:241Issues:0Issues:0