Bo Dai (doubledaibo)

doubledaibo

Geek Repo

Company:Shanghai AI Laboratory

Location:Shanghai

Home Page:http://daibo.info

Twitter:@doubledaibo

Github PK Tool:Github PK Tool

Bo Dai's starred repositories

MinerU

A one-stop, open-source, high-quality data extraction tool, supports PDF/webpage/e-book extraction.一站式开源高质量数据提取工具,支持PDF/网页/多格式电子书提取。

Language:PythonLicense:AGPL-3.0Stargazers:6498Issues:0Issues:0

segment-anything-2

The repository provides code for running inference with the Meta Segment Anything Model 2 (SAM 2), links for downloading the trained model checkpoints, and example notebooks that show how to use the model.

Language:Jupyter NotebookLicense:Apache-2.0Stargazers:8215Issues:0Issues:0

MotionLCM

[ ECCV 2024 ] MotionLCM: This repo is the official implementation of "MotionLCM: Real-time Controllable Motion Generation via Latent Consistency Model"

Language:PythonLicense:NOASSERTIONStargazers:194Issues:0Issues:0

Director3D

Code for "Director3D: Real-world Camera Trajectory and 3D Scene Generation from Text".

Language:PythonLicense:NOASSERTIONStargazers:235Issues:0Issues:0

InterScene

[3DV 2024] Official repo of "Synthesizing Physically Plausible Human Motions in 3D Scenes"

Language:PythonStargazers:95Issues:0Issues:0

PacerPlus

Official implementation of the paper "PACER+: On-Demand Pedestrian Animation Controller in Driving Scenarios" (CVPR 2024).

Language:PythonStargazers:48Issues:0Issues:0

CRATE

Code for CRATE (Coding RAte reduction TransformEr).

Language:PythonLicense:MITStargazers:1136Issues:0Issues:0

EpiDiff

[CVPR 2024] EpiDiff: Enhancing Multi-View Synthesis via Localized Epipolar-Constrained Diffusion

Language:PythonLicense:MITStargazers:92Issues:0Issues:0

sd-forge-layerdiffuse

[WIP] Layer Diffusion for WebUI (via Forge)

Language:PythonLicense:Apache-2.0Stargazers:3716Issues:0Issues:0
Language:PythonLicense:Apache-2.0Stargazers:374Issues:0Issues:0

Make-It-Vivid

[CVPR 2024] Make-It-Vivid: Dressing Your Animatable Biped Cartoon Characters from Text

Language:PythonStargazers:65Issues:0Issues:0

GSDF

GSDF: 3DGS Meets SDF for Improved Rendering and Reconstruction

Stargazers:196Issues:0Issues:0

Octree-GS

Octree-GS: Towards Consistent Real-time Rendering with LOD-Structured 3D Gaussians

Language:C++License:NOASSERTIONStargazers:490Issues:0Issues:0

LN3Diff

[ECCV-2024] LN3Diff creates high-quality 3D object mesh from text within 8 V100-SECONDS.

Language:PythonLicense:NOASSERTIONStargazers:92Issues:0Issues:0

dust3r

DUSt3R: Geometric 3D Vision Made Easy

Language:PythonLicense:NOASSERTIONStargazers:4813Issues:0Issues:0

SC-GS

[CVPR 2024] Code for SC-GS: Sparse-Controlled Gaussian Splatting for Editable Dynamic Scenes

Language:PythonLicense:MITStargazers:442Issues:0Issues:0

jepa

PyTorch code and models for V-JEPA self-supervised learning from video.

Language:PythonLicense:NOASSERTIONStargazers:2576Issues:0Issues:0

Qwen-VL

The official repo of Qwen-VL (通义千问-VL) chat & pretrained large vision language model proposed by Alibaba Cloud.

Language:PythonLicense:NOASSERTIONStargazers:4485Issues:0Issues:0

InternLM-Math

State-of-the-art bilingual open-sourced Math reasoning LLMs.

Language:PythonLicense:Apache-2.0Stargazers:379Issues:0Issues:0

UniHSI

[ICLR 2024 Spotlight] Unified Human-Scene Interaction via Prompted Chain-of-Contacts

Language:PythonStargazers:146Issues:0Issues:0

opencompass

OpenCompass is an LLM evaluation platform, supporting a wide range of models (Llama3, Mistral, InternLM2,GPT-4,LLaMa2, Qwen,GLM, Claude, etc) over 100+ datasets.

Language:PythonLicense:Apache-2.0Stargazers:3459Issues:0Issues:0

Open-AnimateAnyone

Unofficial Implementation of Animate Anyone

Language:PythonStargazers:2870Issues:0Issues:0

CoSeR

[CVPR 2024] CoSeR: Bridging Image and Language for Cognitive Super-Resolution

Stargazers:303Issues:0Issues:0

Marigold

[CVPR 2024 - Oral, Best Paper Award Candidate] Marigold: Repurposing Diffusion-Based Image Generators for Monocular Depth Estimation

Language:PythonLicense:Apache-2.0Stargazers:2087Issues:0Issues:0

HIMLoco

Learning-based locomotion control from OpenRobotLab, including Hybrid Internal Model & H-Infinity Locomotion Control

Language:PythonLicense:NOASSERTIONStargazers:222Issues:0Issues:0

Dataset

News: the 7k dataset is ready for download.

Language:HTMLLicense:NOASSERTIONStargazers:262Issues:0Issues:0

GaussianSplattingViewer

Tiny Gaussian Splatting Viewer

Language:PythonLicense:MITStargazers:277Issues:0Issues:0
Language:PythonStargazers:22Issues:0Issues:0

DiffMorpher

Official Code for DiffMorpher: Unleashing the Capability of Diffusion Models for Image Morphing (CVPR 2024)

Language:PythonLicense:NOASSERTIONStargazers:373Issues:0Issues:0

BerfScene

[CVPR 2024] BerfScene: Bev-conditioned Equivariant Radiance Fields for Infinite 3D Scene Generation

Language:PythonStargazers:39Issues:0Issues:0