Muhammed Kocabas (mkocabas)

mkocabas

Geek Repo

Company:Max Planck Institute for Intelligent Systems

Home Page:https://ps.is.mpg.de/person/mkocabas

Github PK Tool:Github PK Tool

Muhammed Kocabas's starred repositories

Segment-Everything-Everywhere-All-At-Once

[NeurIPS 2023] Official implementation of the paper "Segment Everything Everywhere All at Once"

Language:PythonLicense:Apache-2.0Stargazers:4114Issues:56Issues:137

VAR

[GPT beats diffusion🔥] [scaling laws in visual generation📈] Official impl. of "Visual Autoregressive Modeling: Scalable Image Generation via Next-Scale Prediction". An *ultra-simple, user-friendly yet state-of-the-art* codebase for autoregressive image generation!

Language:PythonLicense:MITStargazers:3618Issues:112Issues:62

InternVL

[CVPR 2024 Oral] InternVL Family: A Pioneering Open-Source Alternative to GPT-4V. 接近GPT-4V表现的可商用开源多模态对话模型

Language:PythonLicense:MITStargazers:3229Issues:37Issues:207

llm

Access large language models from the command-line

Language:PythonLicense:Apache-2.0Stargazers:3227Issues:31Issues:366

NExT-GPT

Code and models for NExT-GPT: Any-to-Any Multimodal Large Language Model

Language:PythonLicense:BSD-3-ClauseStargazers:2966Issues:60Issues:86

InternImage

[CVPR 2023 Highlight] InternImage: Exploring Large-Scale Vision Foundation Models with Deformable Convolutions

Language:PythonLicense:MITStargazers:2364Issues:36Issues:250

LISA

Project Page for "LISA: Reasoning Segmentation via Large Language Model"

Language:PythonLicense:Apache-2.0Stargazers:1518Issues:10Issues:124

Awesome-CLIP

Awesome list for research on CLIP (Contrastive Language-Image Pre-Training).

Metric3D

The repo for "Metric3D: Towards Zero-shot Metric 3D Prediction from A Single Image" and "Metric3Dv2: A Versatile Monocular Geometric Foundation Model..."

Language:PythonLicense:CC0-1.0Stargazers:853Issues:23Issues:96

flowmap

Code for "FlowMap: High-Quality Camera Poses, Intrinsics, and Depth via Gradient Descent" by Cameron Smith*, David Charatan*, Ayush Tewari, and Vincent Sitzmann

Language:PythonLicense:MITStargazers:780Issues:17Issues:28

momask-codes

Official implementation of "MoMask: Generative Masked Modeling of 3D Human Motions (CVPR2024)"

Language:PythonLicense:MITStargazers:650Issues:29Issues:48

dift

[NeurIPS'23] Emergent Correspondence from Image Diffusion

Language:PythonLicense:MITStargazers:517Issues:8Issues:21

RADIO

Official repository for "AM-RADIO: Reduce All Domains Into One"

Language:PythonLicense:NOASSERTIONStargazers:442Issues:20Issues:15

MVDiffusion

MVDiffusion: Enabling Holistic Multi-view Image Generation with Correspondence-Aware Diffusion, NeurIPS 2023 (spotlight)

record3d

Accompanying library for the Record3D iOS app (https://record3d.app/). Allows you to receive RGBD stream from iOS devices with TrueDepth camera(s).

Language:CLicense:LGPL-2.1Stargazers:368Issues:16Issues:81

gaussian_surfels

Implementation of the SIGGRAPH 2024 conference paper "High-quality Surface Reconstruction using Gaussian Surfels".

DiffMOT

code for CVPR2024 paper: DiffMOT: A Real-time Diffusion-based Multiple Object Tracker with Non-linear Prediction

Language:PythonLicense:MITStargazers:344Issues:6Issues:14

PIP

A real-time system that captures physically correct human motion, joint torques, and ground reaction forces with only 6 inertial measurement units

Language:PythonLicense:GPL-3.0Stargazers:302Issues:16Issues:40

idisc

iDisc: Internal Discretization for Monocular Depth Estimation [CVPR 2023]

Language:PythonLicense:NOASSERTIONStargazers:279Issues:13Issues:23

ZipIt

A framework for merging models solving different tasks with different initializations into one multi-task model without any additional training

Language:PythonLicense:MITStargazers:262Issues:3Issues:24

flowsam

Official Implementation of "Moving Object Segmentation: All You Need Is SAM (and Flow)" Junyu Xie, Charig Yang, Weidi Xie, Andrew Zisserman

Language:PythonLicense:Apache-2.0Stargazers:203Issues:0Issues:0

dot

Dense Optical Tracking: Connecting the Dots

Language:PythonLicense:MITStargazers:202Issues:12Issues:15

ml-hugs

Official repository of HUGS: Human Gaussian Splats (CVPR 2024)

Language:PythonLicense:NOASSERTIONStargazers:91Issues:11Issues:3

ml-4m

4M: Massively Multimodal Masked Modeling (NeurIPS 2023 Spotlight)

Language:PythonLicense:Apache-2.0Stargazers:90Issues:0Issues:0

TokenHMR

[CVPR 2024] TokenHMR: Advancing Human Mesh Recovery with a Tokenized Pose Representation

SiTH

[CVPR 2024] SiTH: Single-view Textured Human Reconstruction with Image-Conditioned Diffusion

Language:PythonLicense:MITStargazers:62Issues:8Issues:3
Language:Jupyter NotebookStargazers:61Issues:2Issues:0

SCOPE

Reviatalizing Optimization for 3D Human Pose and Shape Estimation: A Sparse Constrained Formulation

Language:C++License:MITStargazers:57Issues:6Issues:5

stmc

Implementation of "Multi-Track Timeline Control for Text-Driven 3D Human Motion Generation" from CVPR Workshop on Human Motion Generation 2024.

Language:PythonLicense:NOASSERTIONStargazers:52Issues:8Issues:1