wqdong8

wqdong8

Geek Repo

Company:Zhejiang University

Location:Hangzhou, China

Github PK Tool:Github PK Tool

wqdong8's starred repositories

Language:PythonLicense:Apache-2.0Stargazers:37Issues:0Issues:0

lectures

Material for cuda-mode lectures

Language:Jupyter NotebookLicense:Apache-2.0Stargazers:2028Issues:0Issues:0

cambrian

Cambrian-1 is a family of multimodal LLMs with a vision-centric design.

Language:PythonLicense:Apache-2.0Stargazers:1621Issues:0Issues:0
Language:PythonLicense:MITStargazers:212Issues:0Issues:0

GeoLRM

Geometry-Aware Large Reconstruction Model for Efficient and High-Quality 3D Generation

Language:PythonLicense:Apache-2.0Stargazers:85Issues:0Issues:0

DiffSynth-Studio

Enjoy the magic of Diffusion models!

Language:PythonLicense:Apache-2.0Stargazers:6049Issues:0Issues:0

StableNormal

StableNormal: Reducing Diffusion Variance for Stable and Sharp Normal

Language:PythonLicense:Apache-2.0Stargazers:117Issues:0Issues:0

LLM101n

LLM101n: Let's build a Storyteller

Stargazers:26434Issues:0Issues:0

rectified_flow_prior

Official code for paper: Text-to-Image Rectified Flow as Plug-and-Play Priors

Language:PythonStargazers:56Issues:0Issues:0

hallo

Hallo: Hierarchical Audio-Driven Visual Synthesis for Portrait Image Animation

Language:PythonLicense:MITStargazers:7674Issues:0Issues:0

XCube

[CVPR 2024 Highlight] XCube: Large-Scale 3D Generative Modeling using Sparse Voxel Hierarchies

Language:PythonLicense:NOASSERTIONStargazers:181Issues:0Issues:0

3Doodle

Official implementation of 3Doodle: Compact Abstraction of Objects with 3D Strokes (SIGGRAPH 24', Journal track)

Language:PythonStargazers:46Issues:0Issues:0

NKSR

[CVPR 2023 Highlight] Neural Kernel Surface Reconstruction

Language:PythonLicense:NOASSERTIONStargazers:717Issues:0Issues:0

OHTA

[CVPR 2024] OHTA: One-shot Hand Avatar via Data-driven Implicit Priors

Language:PythonLicense:MITStargazers:18Issues:0Issues:0

Recap-DataComp-1B

This is the official repository of our paper "What If We Recaption Billions of Web Images with LLaMA-3 ?"

Stargazers:108Issues:0Issues:0

MeshAnything

From anything to mesh like human artists. Official impl. of "MeshAnything: Artist-Created Mesh Generation with Autoregressive Transformers"

Language:PythonLicense:NOASSERTIONStargazers:1843Issues:0Issues:0
License:MITStargazers:39Issues:0Issues:0

UniDepth

Universal Monocular Metric Depth Estimation

Language:PythonLicense:NOASSERTIONStargazers:499Issues:0Issues:0

Coverage_Axis

Official code for the paper Coverage Axis: Inner Point Selection for 3D Shape Skeletonization, Eurographics 2022.

Language:C++Stargazers:75Issues:0Issues:0

LlamaGen

Autoregressive Model Beats Diffusion: 🦙 Llama for Scalable Image Generation

Language:PythonLicense:MITStargazers:1104Issues:0Issues:0

GeoWizard

[ECCV'24] GeoWizard: Unleashing the Diffusion Priors for 3D Geometry Estimation from a Single Image

Language:PythonStargazers:658Issues:0Issues:0

masa

Official Implementation of CVPR24 highligt paper: Matching Anything by Segmenting Anything

Language:PythonLicense:Apache-2.0Stargazers:908Issues:0Issues:0
Language:PythonLicense:MITStargazers:46Issues:0Issues:0

Physics3D

Official implementation of Physics3D: Learning Physical Properties of 3D Gaussians via Video Diffusion

Language:PythonLicense:MITStargazers:128Issues:0Issues:0

EMAP

[CVPR'24] 3D Neural Edge Reconstruction

Language:PythonLicense:MITStargazers:137Issues:0Issues:0

streamv2v

Official Pytorch implementation of StreamV2V.

Language:PythonLicense:NOASSERTIONStargazers:411Issues:0Issues:0

3DHighlighter

Localizing Regions on 3D Shapes via Text Descriptions

Language:PythonStargazers:97Issues:0Issues:0

MiniCPM-V

MiniCPM-Llama3-V 2.5: A GPT-4V Level Multimodal LLM on Your Phone

Language:PythonLicense:Apache-2.0Stargazers:8160Issues:0Issues:0

Reason3D-PyTorch

Reasoning 3D Segmentation - "segment anything"/grounding/part seperation in 3D with natural conversations.

Language:PythonLicense:NOASSERTIONStargazers:73Issues:0Issues:0

swift

ms-swift: Use PEFT or Full-parameter to finetune 300+ LLMs or 50+ MLLMs. (Qwen2, GLM4v, Internlm2.5, Yi, Llama3.1, Llava-Video, Internvl2, MiniCPM-V, Deepseek, Baichuan2, Gemma2, Phi3-Vision, ...)

Language:PythonLicense:Apache-2.0Stargazers:2701Issues:0Issues:0