Jingbo (wangjingbo1219)

wangjingbo1219

Geek Repo

Company:Shanghai AI LAB

Location:Shanghai

Github PK Tool:Github PK Tool

Jingbo 's starred repositories

generative-ai-for-beginners

18 Lessons, Get Started Building with Generative AI 🔗 https://microsoft.github.io/generative-ai-for-beginners/

Language:Jupyter NotebookLicense:MITStargazers:57798Issues:494Issues:102

audiocraft

Audiocraft is a library for audio processing and generation with deep learning. It features the state-of-the-art EnCodec audio compressor / tokenizer, along with MusicGen, a simple and controllable music generation LM with textual and melodic conditioning.

Language:PythonLicense:MITStargazers:20337Issues:198Issues:368

LLaVA

[NeurIPS'23 Oral] Visual Instruction Tuning (LLaVA) built towards GPT-4V level capabilities and beyond.

Language:PythonLicense:Apache-2.0Stargazers:18465Issues:158Issues:1423

CogVLM

a state-of-the-art-level open visual language model | 多模态预训练模型

Language:PythonLicense:Apache-2.0Stargazers:5726Issues:66Issues:410

VAR

[GPT beats diffusion🔥] [scaling laws in visual generation📈] Official impl. of "Visual Autoregressive Modeling: Scalable Image Generation via Next-Scale Prediction". An *ultra-simple, user-friendly yet state-of-the-art* codebase for autoregressive image generation!

Language:PythonLicense:MITStargazers:3895Issues:114Issues:73

MGM

Official repo for "Mini-Gemini: Mining the Potential of Multi-modality Vision Language Models"

Language:PythonLicense:Apache-2.0Stargazers:3110Issues:26Issues:129

efficientvit

EfficientViT is a new family of vision models for efficient high-resolution vision.

Language:PythonLicense:Apache-2.0Stargazers:1679Issues:34Issues:121

AnimatableGaussians

Code of [CVPR 2024] "Animatable Gaussians: Learning Pose-dependent Gaussian Maps for High-fidelity Human Avatar Modeling"

Language:PythonLicense:NOASSERTIONStargazers:840Issues:40Issues:39

LLM-in-Vision

Recent LLM-based CV and related works. Welcome to comment/contribute!

momask-codes

Official implementation of "MoMask: Generative Masked Modeling of 3D Human Motions (CVPR2024)"

Language:PythonLicense:MITStargazers:719Issues:26Issues:58

octo

Octo is a transformer-based robot policy trained on a diverse mix of 800k robot trajectories.

Language:PythonLicense:MITStargazers:701Issues:18Issues:88

awesome-diffusion-model-in-rl

A curated list of Diffusion Model in RL resources (continually updated)

Rofunc

🤖 The Full Process Python Package for Robot Learning from Demonstration and Robot Manipulation

Language:PythonLicense:Apache-2.0Stargazers:411Issues:5Issues:49
Language:PythonLicense:Apache-2.0Stargazers:368Issues:9Issues:13
Language:PythonLicense:NOASSERTIONStargazers:311Issues:6Issues:16

MiraData

Official repo for paper "MiraData: A Large-Scale Video Dataset with Long Durations and Structured Captions"

Language:PythonLicense:GPL-3.0Stargazers:306Issues:13Issues:14

embodied-generalist

[ICML 2024] Official code repository for 3D embodied generalist agent LEO

Language:PythonLicense:MITStargazers:301Issues:15Issues:40

GART

GART: Gaussian Articulated Template Models

Language:PythonLicense:MITStargazers:239Issues:9Issues:19

motion-planner-reinforcement-learning

End to end motion planner using Deep Deterministic Policy Gradient (DDPG) in gazebo

MimicPlay

"MimicPlay: Long-Horizon Imitation Learning by Watching Human Play" code repository

Language:PythonLicense:MITStargazers:193Issues:4Issues:9

ect

Consistency Models Made Easy

LIBERO

Benchmarking Knowledge Transfer in Lifelong Robot Learning

Language:Jupyter NotebookLicense:MITStargazers:171Issues:2Issues:16

tram

TRAM: Global Trajectory and Motion of 3D Humans from in-the-wild Videos

Language:PythonLicense:MITStargazers:150Issues:14Issues:8

UniTraj

A Unified Framework for scalable Vehicle Trajectory Prediction

Language:PythonLicense:NOASSERTIONStargazers:136Issues:6Issues:14

droid_policy_learning

DROID Policy Learning and Evaluation

Language:PythonLicense:MITStargazers:124Issues:4Issues:21

SuperNormal

[CVPR 2024] Official implementation of "SuperNormal: Neural Surface Reconstruction via Multi-View Normal Integration"

Language:PythonLicense:MITStargazers:123Issues:4Issues:3

mixture-density-network

Mixture density network implemented in PyTorch.

Language:PythonLicense:MITStargazers:122Issues:3Issues:6

PULSE

Official Implementation of the ICLR 2023 spotlight paper: Universal Humanoid Motion Representations for Physics-Based Control

Duolando

Code for ICLR 2024 paper "Duolando: Follower GPT with Off-Policy Reinforcement Learning for Dance Accompaniment"

GraspXL

This is a repository for GraspXL, which can generate objective-drive grasping motions for 500k+ objects with different dexterous hands.