Shikun Liu (lorenmt)

lorenmt

Geek Repo

Company:Meta

Location:London, UK

Home Page:shikun.io

Twitter:@liu_shikun

Github PK Tool:Github PK Tool


Organizations
dyson-robotics-lab

Shikun Liu's starred repositories

StableCascade

Official Code for Stable Cascade

Language:Jupyter NotebookLicense:MITStargazers:6448Issues:61Issues:121

dust3r

DUSt3R: Geometric 3D Vision Made Easy

Language:PythonLicense:NOASSERTIONStargazers:4739Issues:54Issues:115
Language:PythonLicense:NOASSERTIONStargazers:3807Issues:63Issues:0

kubric

A data generation pipeline for creating semi-realistic synthetic multi-object videos with rich annotations such as instance segmentation masks, depth maps, and optical flow.

Language:Jupyter NotebookLicense:Apache-2.0Stargazers:2235Issues:42Issues:184

Marigold

[CVPR 2024 - Oral, Best Paper Award Candidate] Marigold: Repurposing Diffusion-Based Image Generators for Monocular Depth Estimation

Language:PythonLicense:Apache-2.0Stargazers:2056Issues:43Issues:67

Medusa

Medusa: Simple Framework for Accelerating LLM Generation with Multiple Decoding Heads

Language:Jupyter NotebookLicense:Apache-2.0Stargazers:2036Issues:34Issues:79

ml-hypersim

Hypersim: A Photorealistic Synthetic Dataset for Holistic Indoor Scene Understanding

Language:PythonLicense:NOASSERTIONStargazers:1626Issues:42Issues:68

multimodal

TorchMultimodal is a PyTorch library for training state-of-the-art multimodal multi-task models at scale.

Language:PythonLicense:BSD-3-ClauseStargazers:1376Issues:22Issues:38

GaLore

GaLore: Memory-Efficient LLM Training by Gradient Low-Rank Projection

Language:PythonLicense:Apache-2.0Stargazers:1275Issues:17Issues:46

MonoGS

[CVPR'24 Highlight & Best Demo Award] Gaussian Splatting SLAM

Language:PythonLicense:NOASSERTIONStargazers:1129Issues:14Issues:106

DSINE

[CVPR 2024 Oral] Rethinking Inductive Biases for Surface Normal Estimation

Language:Jupyter NotebookLicense:NOASSERTIONStargazers:632Issues:9Issues:7

GeoWizard

[ECCV'24] GeoWizard: Unleashing the Diffusion Priors for 3D Geometry Estimation from a Single Image

SiT

Official PyTorch Implementation of "SiT: Exploring Flow and Diffusion-based Generative Models with Scalable Interpolant Transformers"

Language:PythonLicense:MITStargazers:557Issues:10Issues:18

magvit2-pytorch

Implementation of MagViT2 Tokenizer in Pytorch

Language:PythonLicense:MITStargazers:495Issues:30Issues:33

MDT

Masked Diffusion Transformer is the SOTA for image synthesis. (ICCV 2023)

Language:PythonLicense:Apache-2.0Stargazers:488Issues:18Issues:45

V3D

V3D: Video Diffusion Models are Effective 3D Generators

VBench

[CVPR2024 Highlight] VBench - We Evaluate Video Generation

Language:PythonLicense:Apache-2.0Stargazers:403Issues:11Issues:44

TCD

Official Repository of the paper "Trajectory Consistency Distillation"

Dataset

News: the 7k dataset is ready for download.

Language:HTMLLicense:NOASSERTIONStargazers:260Issues:13Issues:22

laion-3d

Collect large 3d dataset and build models

PUG

This is the repository for the Photorealistic Unreal Graphics (PUG) datasets for representation learning.

Language:Jupyter NotebookLicense:NOASSERTIONStargazers:224Issues:8Issues:2

EscherNet

[CVPR2024 Oral] EscherNet: A Generative Model for Scalable View Synthesis

Language:PythonLicense:NOASSERTIONStargazers:219Issues:9Issues:8

flatten

Pytorch Implementation of FLATTEN: optical FLow-guided ATTENtion for consistent text-to-video editing (ICLR 2024)

Language:PythonLicense:Apache-2.0Stargazers:172Issues:8Issues:3

super_primitive

[CVPR'24, Demo Track Honourable Mention] SuperPrimitive: Scene Reconstruction at a Primitive Level

Language:PythonLicense:NOASSERTIONStargazers:152Issues:7Issues:1

MVDiffusion_plusplus

MVDiffusion++: A Dense High-resolution Multi-view Diffusion Model for Single or Sparse-view 3D Object Reconstruction

gta

[ICLR'24] GTA: A Geometry-Aware Attention Mechanism for Multi-view Transformers

Language:PythonLicense:MITStargazers:116Issues:13Issues:1

MorpheuS

[CVPR'24] MorpheuS: Neural Dynamic 360° Surface Reconstruction from Monocular RGB-D Video

Language:PythonLicense:Apache-2.0Stargazers:115Issues:10Issues:0

Dream2Real

[ICRA 2024] Dream2Real: Zero-Shot 3D Object Rearrangement with Vision-Language Models

Language:PythonStargazers:45Issues:5Issues:0

T5-Textual-Inversion

Textual Inversion for DeepFloyd IF

Language:Jupyter NotebookLicense:AGPL-3.0Stargazers:43Issues:3Issues:1
Language:JavaScriptStargazers:5Issues:0Issues:0