Herman (zhanghm1995)

zhanghm1995

Geek Repo

Company:CUHKSZ

Location:Shenzhen, China

Home Page:https://blog.csdn.net/zhanghm1995

Github PK Tool:Github PK Tool

Herman's starred repositories

Open-Sora

Open-Sora: Democratizing Efficient Video Production for All

Language:PythonLicense:Apache-2.0Stargazers:21003Issues:178Issues:420

Yi

A series of large language models trained from scratch by developers @01-ai

Language:Jupyter NotebookLicense:Apache-2.0Stargazers:7525Issues:110Issues:291

gemma_pytorch

The official PyTorch implementation of Google's Gemma models

Language:PythonLicense:Apache-2.0Stargazers:5192Issues:38Issues:37

Rerender_A_Video

[SIGGRAPH Asia 2023] Rerender A Video: Zero-Shot Text-Guided Video-to-Video Translation

Language:Jupyter NotebookLicense:NOASSERTIONStargazers:2917Issues:27Issues:108

MuseV

MuseV: Infinite-length and High Fidelity Virtual Human Video Generation with Visual Conditioned Parallel Denoising

Language:PythonLicense:NOASSERTIONStargazers:2195Issues:34Issues:102
Language:PythonLicense:NOASSERTIONStargazers:1850Issues:94Issues:37

MetaTransformer

Meta-Transformer for Unified Multimodal Learning

Language:PythonLicense:Apache-2.0Stargazers:1476Issues:22Issues:65

3dv_tutorial

An Invitation to 3D Vision: A Tutorial for Everyone

Language:CMakeLicense:NOASSERTIONStargazers:1435Issues:44Issues:9

Pointcept

Pointcept: a codebase for point cloud perception research. Latest works: PTv3 (CVPR'24 Oral), PPT (CVPR'24), OA-CNNs (CVPR'24), MSC (CVPR'23)

Language:PythonLicense:MITStargazers:1394Issues:20Issues:274

waymax

A JAX-based simulator for autonomous driving research.

Language:PythonLicense:NOASSERTIONStargazers:807Issues:14Issues:55

DriveLM

[ECCV 2024] DriveLM: Driving with Graph Visual Question Answering

Language:HTMLLicense:Apache-2.0Stargazers:741Issues:21Issues:69

viser

Web-based 3D visualization + Python

Language:PythonLicense:Apache-2.0Stargazers:626Issues:30Issues:82

4d-gaussian-splatting

[ICLR 2024] Real-time Photorealistic Dynamic Scene Representation and Rendering with 4D Gaussian Splatting

Language:PythonLicense:MITStargazers:531Issues:25Issues:44

RenderOcc

[ICRA 2024] RenderOcc: Vision-Centric 3D Occupancy Prediction with 2D Rendering Supervision. (Former version: UniOcc)

OccWorld

[ECCV 2024] 3D World Model for Autonomous Driving

Language:PythonLicense:Apache-2.0Stargazers:307Issues:9Issues:27

SuperFusion

[ICRA 2024] SuperFusion: Multilevel LiDAR-Camera Fusion for Long-Range HD Map Generation

Language:PythonLicense:GPL-3.0Stargazers:297Issues:25Issues:12

ChatSim

[CVPR2024 Highlight] Editable Scene Simulation for Autonomous Driving via LLM-Agent Collaboration

Language:Jupyter NotebookLicense:MITStargazers:249Issues:4Issues:77

DNGaussian

[CVPR'24] DNGaussian: Optimizing Sparse-View 3D Gaussian Radiance Fields with Global-Local Depth Normalization

Language:PythonLicense:NOASSERTIONStargazers:216Issues:10Issues:31

Forge_VFM4AD

A comprehensive survey of forging vision foundation models for autonomous driving, including challenges, methodologies, and opportunities.

Language:PythonLicense:BSD-3-ClauseStargazers:142Issues:9Issues:16

PaSCo

[CVPR 2024 Oral, Best Paper Award Candidate] Official repository of "PaSCo: Urban 3D Panoptic Scene Completion with Uncertainty Awareness"

Language:PythonLicense:Apache-2.0Stargazers:131Issues:13Issues:9

DoGaussian

DoGaussian: Distributed-Oriented Gaussian Splatting for Large-Scale 3D Reconstruction Via Gaussian Consensus

talk2bev

Talk2BEV: Language-Enhanced Bird's Eye View Maps (Accepted to ICRA'24)

Language:PythonLicense:BSD-3-ClauseStargazers:85Issues:2Issues:7

Reason3D-PyTorch

Reasoning 3D Segmentation - "segment anything"/grounding/part seperation in 3D with natural conversations.

Language:PythonLicense:NOASSERTIONStargazers:73Issues:10Issues:1

outdoor-nerf-depth

[ACM MM 2023] Digging into Depth Priors for Outdoor Neural Radiance Fields

Point-PEFT

(AAAI2024) Point-PEFT: Parameter-Efficient Fine-Tuning for 3D Pre-trained Models

awesome-world-models-for-AD

A curated list of awesome world models for autonomous driving (continually updated)

License:Apache-2.0Stargazers:9Issues:3Issues:0