Jingbo (wangjingbo1219)

wangjingbo1219

Geek Repo

Company:Shanghai AI LAB

Location:Shanghai

Github PK Tool:Github PK Tool

Jingbo 's starred repositories

Language:PythonLicense:NOASSERTIONStargazers:168Issues:0Issues:0

LLM-in-Vision

Recent LLM-based CV and related works. Welcome to comment/contribute!

Stargazers:776Issues:0Issues:0

VAR

[GPT beats diffusionšŸ”„] [scaling laws in visual generationšŸ“ˆ] Official impl. of "Visual Autoregressive Modeling: Scalable Image Generation via Next-Scale Prediction". An *ultra-simple, user-friendly yet state-of-the-art* codebase for autoregressive image generation!

Language:PythonLicense:MITStargazers:3729Issues:0Issues:0

mixture-density-network

Mixture density network implemented in PyTorch.

Language:PythonLicense:MITStargazers:119Issues:0Issues:0

Rofunc

šŸ¤– The Full Process Python Package for Robot Learning from Demonstration and Robot Manipulation

Language:PythonLicense:Apache-2.0Stargazers:388Issues:0Issues:0

generative-ai-for-beginners

18 Lessons, Get Started Building with Generative AI šŸ”— https://microsoft.github.io/generative-ai-for-beginners/

Language:Jupyter NotebookLicense:MITStargazers:48318Issues:0Issues:0

LIBERO

Benchmarking Knowledge Transfer in Lifelong Robot Learning

Language:Jupyter NotebookLicense:MITStargazers:156Issues:0Issues:0

audiocraft

Audiocraft is a library for audio processing and generation with deep learning. It features the state-of-the-art EnCodec audio compressor / tokenizer, along with MusicGen, a simple and controllable music generation LM with textual and melodic conditioning.

Language:PythonLicense:MITStargazers:20091Issues:0Issues:0
Language:PythonLicense:Apache-2.0Stargazers:315Issues:0Issues:0

CogVLM

a state-of-the-art-level open visual language model | 多ęØ”ę€é¢„č®­ē»ƒęؔ型

Language:PythonLicense:Apache-2.0Stargazers:5535Issues:0Issues:0

MimicPlay

"MimicPlay: Long-Horizon Imitation Learning by Watching Human Play" code repository

Language:PythonLicense:MITStargazers:172Issues:0Issues:0

awesome-diffusion-model-in-rl

A curated list of Diffusion Model in RL resources (continually updated)

License:Apache-2.0Stargazers:637Issues:0Issues:0

GART

GART: Gaussian Articulated Template Models

Language:PythonLicense:MITStargazers:227Issues:0Issues:0

tram

TRAM: Global Trajectory and Motion of 3D Humans from in-the-wild Videos

Language:PythonLicense:MITStargazers:124Issues:0Issues:0

LLaVA

[NeurIPS'23 Oral] Visual Instruction Tuning (LLaVA) built towards GPT-4V level capabilities and beyond.

Language:PythonLicense:Apache-2.0Stargazers:17654Issues:0Issues:0

efficientvit

EfficientViT is a new family of vision models for efficient high-resolution vision.

Language:PythonLicense:Apache-2.0Stargazers:1569Issues:0Issues:0

momask-codes

Official implementation of "MoMask: Generative Masked Modeling of 3D Human Motions (CVPR2024)"

Language:PythonLicense:MITStargazers:678Issues:0Issues:0

SuperNormal

[CVPR 2024] Official implementation of "SuperNormal: Neural Surface Reconstruction via Multi-View Normal Integration"

Language:PythonLicense:MITStargazers:95Issues:0Issues:0

MGM

Official repo for "Mini-Gemini: Mining the Potential of Multi-modality Vision Language Models"

Language:PythonLicense:Apache-2.0Stargazers:3061Issues:0Issues:0

ect

Consistency Models Made Easy

Language:PythonStargazers:154Issues:0Issues:0

motion-planner-reinforcement-learning

End to end motion planner using Deep Deterministic Policy Gradient (DDPG) in gazebo

Language:PythonStargazers:192Issues:0Issues:0

Duolando

Code for ICLR 2024 paper "Duolando: Follower GPT with Off-Policy Reinforcement Learning for Dance Accompaniment"

Language:PythonStargazers:84Issues:0Issues:0
Language:PythonLicense:NOASSERTIONStargazers:280Issues:0Issues:0

octo

Octo is a transformer-based robot policy trained on a diverse mix of 800k robot trajectories.

Language:PythonLicense:MITStargazers:606Issues:0Issues:0

embodied-generalist

[ICML 2024] Official code repository for 3D embodied generalist agent LEO

Language:PythonLicense:MITStargazers:272Issues:0Issues:0

GraspXL

This is a repository for GraspXL, which can generate objective-drive grasping motions for 500k+ objects with different dexterous hands.

Language:JavaScriptStargazers:22Issues:0Issues:0

UniTraj

A Unified Framework for scalable Vehicle Trajectory Prediction

Language:PythonLicense:NOASSERTIONStargazers:106Issues:0Issues:0

AnimatableGaussians

Code of [CVPR 2024] "Animatable Gaussians: Learning Pose-dependent Gaussian Maps for High-fidelity Human Avatar Modeling"

Language:PythonLicense:NOASSERTIONStargazers:797Issues:0Issues:0

PULSE

Official Implementation of the ICLR 2023 spotlight paper: Universal Humanoid Motion Representations for Physics-Based Control

Language:PythonStargazers:86Issues:0Issues:0

droid_policy_learning

DROID Policy Learning and Evaluation

Language:PythonLicense:MITStargazers:107Issues:0Issues:0