Sudhir Yarram's starred repositories

Open-Sora

Open-Sora: Democratizing Efficient Video Production for All

Language:PythonLicense:Apache-2.0Stargazers:21794Issues:185Issues:490

awesome-computer-vision

A curated list of awesome computer vision resources

unsloth

Finetune Llama 3.2, Mistral, Phi & Gemma LLMs 2-5x faster with 80% less memory

Language:PythonLicense:Apache-2.0Stargazers:16589Issues:116Issues:873

StoryDiffusion

Accepted as [NeurIPS 2024] Spotlight Presentation Paper

Language:Jupyter NotebookLicense:Apache-2.0Stargazers:5842Issues:85Issues:143

awesome-3D-gaussian-splatting

Curated list of papers and resources focused on 3D Gaussian Splatting, intended to keep pace with the anticipated surge of research in the coming months.

VAR

[NeurIPS 2024 Oral][GPT beats diffusion🔥] [scaling laws in visual generation📈] Official impl. of "Visual Autoregressive Modeling: Scalable Image Generation via Next-Scale Prediction". An *ultra-simple, user-friendly yet state-of-the-art* codebase for autoregressive image generation!

Language:PythonLicense:MITStargazers:4048Issues:115Issues:80

Marigold

[CVPR 2024 - Oral, Best Paper Award Candidate] Marigold: Repurposing Diffusion-Based Image Generators for Monocular Depth Estimation

Language:PythonLicense:Apache-2.0Stargazers:2268Issues:41Issues:95

OpenRLHF

An Easy-to-use, Scalable and High-performance RLHF Framework (70B+ PPO Full Tuning & Iterative DPO & LoRA & Mixtral)

Language:PythonLicense:Apache-2.0Stargazers:1763Issues:21Issues:179

StreamingT2V

StreamingT2V: Consistent, Dynamic, and Extendable Long Video Generation from Text

awesome-3D-generation

A curated list of awesome 3d generation papers

flowmap

Code for "FlowMap: High-Quality Camera Poses, Intrinsics, and Depth via Gradient Descent" by Cameron Smith*, David Charatan*, Ayush Tewari, and Vincent Sitzmann

Language:PythonLicense:MITStargazers:873Issues:23Issues:47

Awesome-Controllable-T2I-Diffusion-Models

A collection of resources on controllable generation with text-to-image diffusion models.

GaussianObject

GaussianObject: High-Quality 3D Object Reconstruction from Four Views with Gaussian Splatting (SIGGRAPH Asia 2024, TOG)

Language:Jupyter NotebookStargazers:852Issues:23Issues:53

Neural-Network-Parameter-Diffusion

We introduce a novel approach for parameter generation, named neural network parameter diffusion (p-diff), which employs a standard latent diffusion model to synthesize a new set of parameters

Scaffold-GS

[CVPR 2024 Highlight] Scaffold-GS: Structured 3D Gaussians for View-Adaptive Rendering

Language:C++License:NOASSERTIONStargazers:746Issues:22Issues:77

EmerNeRF

PyTorch Implementation of EmerNeRF: Emergent Spatial-Temporal Scene Decomposition via Self-Supervision

Language:PythonLicense:NOASSERTIONStargazers:556Issues:27Issues:29

Octree-GS

Octree-GS: Towards Consistent Real-time Rendering with LOD-Structured 3D Gaussians

Language:C++License:NOASSERTIONStargazers:539Issues:25Issues:51

MRL

Code repository for the paper - "Matryoshka Representation Learning"

Language:Jupyter NotebookLicense:MITStargazers:409Issues:7Issues:6

GaussianCube

[NeurIPS 2024] GaussianCube: A Structured and Explicit Radiance Representation for 3D Generative Modeling

DriveDreamer

[ECCV 2024] DriveDreamer: Towards Real-world-driven World Models for Autonomous Driving

GeMap

[ECCV'24] Online Vectorized HD Map Construction using Geometry

Language:PythonLicense:Apache-2.0Stargazers:196Issues:8Issues:18

UrbanArchitect

The official repository of our paper: "Urban Architect: Steerable 3D Urban Scene Generation with Layout Prior"

GeneOH-Diffusion

[ICLR'24] GeneOH Diffusion: Towards Generalizable Hand-Object Interaction Denoising via Denoising Diffusion

Language:PythonLicense:MITStargazers:84Issues:4Issues:6

Dimba

Transformer-Mamba Diffusion Models

BP-Net

Implementation of our paper 'Bilateral Propagation Network for Depth Completion'

Language:PythonLicense:MITStargazers:72Issues:4Issues:21

dynmf

(ECCV '24) DynMF: Neural Motion Factorization for Real-time Dynamic View Synthesis with 3D Gaussian Splatting

decomp_diffusion

[ICML 2024] Compositional Image Decomposition with Diffusion Models

DDMI

Official Implementation (Pytorch) of "DDMI: Domain-Agnostic Latent Diffusion Models for Synthesizing High-Quality Implicit Neural Representations", ICLR 2024

Language:PythonLicense:MITStargazers:20Issues:5Issues:6
Language:PythonStargazers:3Issues:1Issues:0