Eugene Zatepyakin's starred repositories

whisper

Robust Speech Recognition via Large-Scale Weak Supervision

Language:PythonLicense:MITStargazers:66040Issues:550Issues:0

diffusers

🤗 Diffusers: State-of-the-art diffusion models for image and audio generation in PyTorch and FLAX.

Language:PythonLicense:Apache-2.0Stargazers:24637Issues:193Issues:3917

coyo-dataset

COYO-700M: Large-scale Image-Text Pair Dataset

EfficientFormer

EfficientFormerV2 [ICCV 2023] & EfficientFormer [NeurIPs 2022]

Language:PythonLicense:NOASSERTIONStargazers:972Issues:37Issues:58

pix2seq

Pix2Seq codebase: multi-tasks with generative modeling (autoregressive and diffusion)

Language:Jupyter NotebookLicense:Apache-2.0Stargazers:848Issues:18Issues:48

Adan

Adan: Adaptive Nesterov Momentum Algorithm for Faster Optimizing Deep Models

Language:PythonLicense:Apache-2.0Stargazers:741Issues:7Issues:35

NextFace

A high-fidelity 3D face reconstruction library from monocular RGB image(s)

Language:Jupyter NotebookLicense:GPL-3.0Stargazers:699Issues:24Issues:77

NeuralRecon-W

Code for "Neural 3D Reconstruction in the Wild", SIGGRAPH 2022 (Conference Proceedings)

Language:PythonLicense:Apache-2.0Stargazers:689Issues:52Issues:47

LIA

[ICLR 22] Latent Image Animator: Learning to Animate Images via Latent Space Navigation

Language:PythonLicense:NOASSERTIONStargazers:583Issues:28Issues:23

DPVO

Deep Patch Visual Odometry/SLAM

Language:C++License:MITStargazers:574Issues:19Issues:60

MICA

MICA - Towards Metrical Reconstruction of Human Faces [ECCV2022]

Language:PythonLicense:NOASSERTIONStargazers:532Issues:9Issues:60

HybVIO

HybVIO visual-inertial odometry and SLAM system

Language:C++License:GPL-3.0Stargazers:446Issues:14Issues:42

QuadTreeAttention

QuadTree Attention for Vision Transformers (ICLR2022)

Language:Jupyter NotebookStargazers:336Issues:11Issues:29

classifier-free-guidance-pytorch

Implementation of Classifier Free Guidance in Pytorch, with emphasis on text conditioning, and flexibility to include multiple text embedding models

Language:PythonLicense:MITStargazers:310Issues:8Issues:4

shapy

CVPR 2022 - Official code repository for the paper: Accurate 3D Body Shape Regression using Metric and Semantic Attributes.

mega

Sequence modeling with Mega.

Language:PythonLicense:MITStargazers:296Issues:128Issues:16

R-VIO2

Square-Root Robocentric Visual-Inertial Odometry with Online Spatiotemporal Calibration

Language:C++License:GPL-3.0Stargazers:227Issues:12Issues:9

SelfBlendedImages

[CVPR 2022 Oral] Detecting Deepfakes with Self-Blended Images https://arxiv.org/abs/2204.08376

Language:PythonLicense:NOASSERTIONStargazers:195Issues:7Issues:45

Implicit-Language-Q-Learning

Official code from the paper "Offline RL for Natural Language Generation with Implicit Language Q Learning"

Language:PythonLicense:MITStargazers:192Issues:6Issues:8

rpg_vision-based_slam

This repo contains the code of the paper "Continuous-Time vs. Discrete-Time Vision-based SLAM: A Comparative Study", RA-L 2022.

Language:C++License:GPL-3.0Stargazers:191Issues:13Issues:2

SQuant

SQuant [ICLR22]

DART

DART: Articulated Hand Model with Diverse Accessories and Rich Textures (NeurIPS 2022 - Datasets and Benchmarks Track)

pnec

[CVPR 2022] README.md The Probabilistic Normal Epipolar Constraint for Frame-To-Frame Rotation Optimization under Uncertain Feature Positions

Language:C++License:BSD-3-ClauseStargazers:132Issues:18Issues:4

NIMBLE_model

repo for NIMBLE: A Non-rigid Hand Model with Bones and Muscles

Language:PythonLicense:MITStargazers:116Issues:5Issues:17

PCAccumulation

[ECCV 2022] Dynamic 3D Scene Analysis by Point Cloud Accumulation

Language:PythonLicense:MITStargazers:115Issues:7Issues:8

DisentangledColorization

A colorization framework that disentangles the color multimodality and the structural consistency via adaptively located anchors, so that both aspects can be achieved effectively. [SIGGRAPH Asia 2022]

Language:PythonLicense:MITStargazers:113Issues:6Issues:5

perceiver-ar-pytorch

Implementation of Perceiver AR, Deepmind's new long-context attention network based on Perceiver architecture, in Pytorch

Language:PythonLicense:MITStargazers:86Issues:4Issues:8

differentiable-SDF-pytorch

Implementation of Differentiable Sign-Distance Function Rendering - in Pytorch

m2d2

M2D2: A Massively Multi-domain Language Modeling Dataset (EMNLP 2022) by Machel Reid, Victor Zhong, Suchin Gururangan, Luke Zettlemoyer

Retriever

[ICLR2022] Code for "Retriever: Learning Content-Style Representation as a Token-Level Bipartite Graph"