Jingbo (wangjingbo1219)

wangjingbo1219

Geek Repo

Company:Shanghai AI LAB

Location:Shanghai

Github PK Tool:Github PK Tool

Jingbo 's starred repositories

Open-Sora-Plan

This project aim to reproduce Sora (Open AI T2V model), we wish the open source community contribute to this project.

Language:PythonLicense:MITStargazers:10344Issues:152Issues:156

InternLM

Official release of InternLM2 7B and 20B base and chat models. 200K context support

Language:PythonLicense:Apache-2.0Stargazers:5320Issues:50Issues:288

dust3d

Dust3D is a cross-platform 3D modeling software that makes it easy to create low poly 3D models for video games, 3D printing, and more.

xtuner

An efficient, flexible and full-featured toolkit for fine-tuning large models (InternLM2, Llama3, Phi3, Qwen, Mistral, ...)

Language:PythonLicense:Apache-2.0Stargazers:2842Issues:29Issues:353

MuseTalk

MuseTalk: Real-Time High Quality Lip Synchorization with Latent Space Inpainting

Language:PythonLicense:NOASSERTIONStargazers:1228Issues:24Issues:78

CRATE

Code for CRATE (Coding RAte reduction TransformEr).

Language:PythonLicense:MITStargazers:1050Issues:20Issues:16

diffusion_policy

[RSS 2023] Diffusion Policy Visuomotor Policy Learning via Action Diffusion

Language:PythonLicense:MITStargazers:954Issues:11Issues:67

MobileVLM

Strong and Open Vision Language Assistant for Mobile Devices

Language:PythonLicense:Apache-2.0Stargazers:822Issues:21Issues:48

Gaussian-Head-Avatar

[CVPR 2024] Official repository for "Gaussian Head Avatar: Ultra High-fidelity Head Avatar via Dynamic Gaussians"

Language:PythonLicense:NOASSERTIONStargazers:662Issues:63Issues:24

GeoWizard

[arXiv'24] GeoWizard: Unleashing the Diffusion Priors for 3D Geometry Estimation from a Single Image

eai-vc

The repository for the largest and most comprehensive empirical study of visual foundation models for Embodied AI (EAI).

Language:PythonLicense:NOASSERTIONStargazers:431Issues:19Issues:10

GPS-Gaussian

[CVPR 2024 Highlight] The official repo for “GPS-Gaussian: Generalizable Pixel-wise 3D Gaussian Splatting for Real-time Human Novel View Synthesis”

Language:PythonLicense:MITStargazers:423Issues:25Issues:46

MiniGPT4-video

Official code for MiniGPT4-video

Language:PythonLicense:BSD-3-ClauseStargazers:392Issues:9Issues:26

trainable-agents

Code and datasets for "Character-LLM: A Trainable Agent for Role-Playing"

Language:PythonLicense:Apache-2.0Stargazers:358Issues:16Issues:8

PIP

A real-time system that captures physically correct human motion, joint torques, and ground reaction forces with only 6 inertial measurement units

Language:PythonLicense:GPL-3.0Stargazers:302Issues:16Issues:40

probe3d

[CVPR 2024] Probing the 3D Awareness of Visual Foundation Models

Language:PythonLicense:MITStargazers:195Issues:5Issues:4

LL3DA

[CVPR 2024] "LL3DA: Visual Interactive Instruction Tuning for Omni-3D Understanding, Reasoning, and Planning"; an interactive Large Language 3D Assistant.

Language:PythonLicense:MITStargazers:175Issues:3Issues:18

OpenCLAY

CLAY: A Controllable Large-scale Generative Model for Creating High-quality 3D Assets

FlowMDM

[CVPR 2024] Official Implementation of "Seamless Human Motion Composition with Blended Positional Encodings".

Language:PythonLicense:NOASSERTIONStargazers:160Issues:11Issues:11

Paper-List

A paper list of my history reading. Robotics, Learning, Vision.

SLD

🔥 [CVPR2024] Official implementation of "Self-correcting LLM-controlled Diffusion Models (SLD)

Language:PythonLicense:MITStargazers:126Issues:3Issues:4

insactor

[NeurIPS 2023] InsActor: Instruction-driven Physics-based Characters

UrbanArchitect

The official repository of our paper: "Urban Architect: Steerable 3D Urban Scene Generation with Layout Prior"

Language:PythonStargazers:60Issues:8Issues:0

Ada-LEval

The official implementation of "Ada-LEval: Evaluating long-context LLMs with length-adaptable benchmarks"

QuasiSim

Parameterized Quasi-Physical Simulators for Dexterous Manipulations Transfer

Language:PythonLicense:MITStargazers:33Issues:0Issues:0

Portrait-Mode-Video

Video dataset dedicated to portrait-mode video recognition.

SkillDiffuser

[CVPR'2024] "SkillDiffuser: Interpretable Hierarchical Planning via Skill Abstractions in Diffusion-Based Task Execution"

Language:PythonStargazers:19Issues:0Issues:0

PhysMoP

Code repository for Incorporating Physics Principles for Precise Human Motion Prediction

Language:PythonLicense:MITStargazers:14Issues:0Issues:0
Language:Jupyter NotebookLicense:Apache-2.0Stargazers:11Issues:0Issues:0

IceFormer

Implementation of IceFormer: Accelerated Inference with Long-Sequence Transformers on CPUs (ICLR 2024).

Language:HTMLLicense:MITStargazers:1Issues:0Issues:0