Beast code in Giters

Eugene Zatepyakin's starred repositories

whisper

Robust Speech Recognition via Large-Scale Weak Supervision

Language:PythonMIT66040 5500

diffusers

🤗 Diffusers: State-of-the-art diffusion models for image and audio generation in PyTorch and FLAX.

Language:PythonApache-2.024637 193 3917

coyo-dataset

COYO-700M: Large-scale Image-Text Pair Dataset

Language:Python1125 14 14

EfficientFormer

EfficientFormerV2 [ICCV 2023] & EfficientFormer [NeurIPs 2022]

Language:PythonNOASSERTION972 37 58

pix2seq

Pix2Seq codebase: multi-tasks with generative modeling (autoregressive and diffusion)

Language:Jupyter NotebookApache-2.0848 18 48

Adan

Adan: Adaptive Nesterov Momentum Algorithm for Faster Optimizing Deep Models

Language:PythonApache-2.0741 7 35

NextFace

A high-fidelity 3D face reconstruction library from monocular RGB image(s)

Language:Jupyter NotebookGPL-3.0699 24 77

NeuralRecon-W

Code for "Neural 3D Reconstruction in the Wild", SIGGRAPH 2022 (Conference Proceedings)

Language:PythonApache-2.0689 52 47

LIA

[ICLR 22] Latent Image Animator: Learning to Animate Images via Latent Space Navigation

Language:PythonNOASSERTION583 28 23

DPVO

Deep Patch Visual Odometry/SLAM

Language:C++MIT574 19 60

MICA

MICA - Towards Metrical Reconstruction of Human Faces [ECCV2022]

Language:PythonNOASSERTION532 9 60

HybVIO

HybVIO visual-inertial odometry and SLAM system

Language:C++GPL-3.0446 14 42

QuadTreeAttention

QuadTree Attention for Vision Transformers (ICLR2022)

Language:Jupyter Notebook336 11 29

classifier-free-guidance-pytorch

Implementation of Classifier Free Guidance in Pytorch, with emphasis on text conditioning, and flexibility to include multiple text embedding models

Language:PythonMIT310 8 4

shapy

CVPR 2022 - Official code repository for the paper: Accurate 3D Body Shape Regression using Metric and Semantic Attributes.

Language:Python302 15 54

mega

Sequence modeling with Mega.

Language:PythonMIT296 128 16

R-VIO2

Square-Root Robocentric Visual-Inertial Odometry with Online Spatiotemporal Calibration

Language:C++GPL-3.0227 12 9

SelfBlendedImages

[CVPR 2022 Oral] Detecting Deepfakes with Self-Blended Images https://arxiv.org/abs/2204.08376

Language:PythonNOASSERTION195 7 45

Implicit-Language-Q-Learning

Official code from the paper "Offline RL for Natural Language Generation with Implicit Language Q Learning"

Language:PythonMIT192 6 8

rpg_vision-based_slam

This repo contains the code of the paper "Continuous-Time vs. Discrete-Time Vision-based SLAM: A Comparative Study", RA-L 2022.

Language:C++GPL-3.0191 13 2

DART

DART: Articulated Hand Model with Diverse Accessories and Rich Textures (NeurIPS 2022 - Datasets and Benchmarks Track)

Language:Python132 3 16

pnec

[CVPR 2022] README.md The Probabilistic Normal Epipolar Constraint for Frame-To-Frame Rotation Optimization under Uncertain Feature Positions

Language:C++BSD-3-Clause132 18 4

NIMBLE_model

repo for NIMBLE: A Non-rigid Hand Model with Bones and Muscles

Language:PythonMIT116 5 17

PCAccumulation

[ECCV 2022] Dynamic 3D Scene Analysis by Point Cloud Accumulation

Language:PythonMIT115 7 8

A colorization framework that disentangles the color multimodality and the structural consistency via adaptively located anchors, so that both aspects can be achieved effectively. [SIGGRAPH Asia 2022]

Language:PythonMIT113 6 5

perceiver-ar-pytorch

Implementation of Perceiver AR, Deepmind's new long-context attention network based on Perceiver architecture, in Pytorch

Language:PythonMIT86 4 8

differentiable-SDF-pytorch

Implementation of Differentiable Sign-Distance Function Rendering - in Pytorch

MIT69 17 1

m2d2

M2D2: A Massively Multi-domain Language Modeling Dataset (EMNLP 2022) by Machel Reid, Victor Zhong, Suchin Gururangan, Luke Zettlemoyer

Language:Python53 2 1

Retriever

[ICLR2022] Code for "Retriever: Learning Content-Style Representation as a Token-Level Bipartite Graph"

53 18 2

inspirit