vobecant's starred repositories

POP3D

Source code for NeurIPS paper "POP-3D: Open-Vocabulary 3D Occupancy Prediction from Images"

Language:PythonStargazers:76Issues:0Issues:0
Language:Jupyter NotebookLicense:MITStargazers:93Issues:0Issues:0

S-NeRF

[ICLR 2023] S-NeRF: Neural Radiance Fields for Street Views

Language:PythonLicense:MITStargazers:151Issues:0Issues:0

stablediffusion

High-Resolution Image Synthesis with Latent Diffusion Models

Language:PythonLicense:MITStargazers:36675Issues:0Issues:0

NeRF-LOAM

[ICCV2023] NeRF-LOAM: Neural Implicit Representation for Large-Scale Incremental LiDAR Odometry and Mapping

Language:PythonLicense:MITStargazers:484Issues:0Issues:0

SimpleOccupancy

(IEEE TIV) A Comprehensive Framework for 3D Occupancy Estimation in Autonomous Driving

Language:PythonStargazers:161Issues:0Issues:0
Language:PythonLicense:NOASSERTIONStargazers:32Issues:0Issues:0

arxiv-latex-cleaner

arXiv LaTeX Cleaner: Easily clean the LaTeX code of your paper to submit to arXiv

Language:PythonLicense:Apache-2.0Stargazers:4837Issues:0Issues:0

Awesome-Visual-Transformer

Collect some papers about transformer with vision. Awesome Transformer with Computer Vision (CV)

Stargazers:3273Issues:0Issues:0

dino

PyTorch code for Vision Transformers training with the Self-Supervised learning method DINO

Language:PythonLicense:Apache-2.0Stargazers:5936Issues:0Issues:0

visualDet3D

Official Repo for Ground-aware Monocular 3D Object Detection for Autonomous Driving / YOLOStereo3D: A Step Back to 2D for Efficient Stereo 3D Detection

Language:PythonLicense:Apache-2.0Stargazers:360Issues:0Issues:0

vissl

VISSL is FAIR's library of extensible, modular and scalable components for SOTA Self-Supervised Learning with images.

Language:Jupyter NotebookLicense:MITStargazers:3233Issues:0Issues:0

long-range-arena

Long Range Arena for Benchmarking Efficient Transformers

Language:PythonLicense:Apache-2.0Stargazers:686Issues:0Issues:0

DALLE-pytorch

Implementation / replication of DALL-E, OpenAI's Text to Image Transformer, in Pytorch

Language:PythonLicense:MITStargazers:5500Issues:0Issues:0

CLIP

CLIP (Contrastive Language-Image Pretraining), Predict the most relevant text snippet given an image

Language:Jupyter NotebookLicense:MITStargazers:22707Issues:0Issues:0

point-transformer-pytorch

Implementation of the Point Transformer layer, in Pytorch

Language:PythonLicense:MITStargazers:580Issues:0Issues:0
Language:Jupyter NotebookLicense:Apache-2.0Stargazers:9451Issues:0Issues:0

pytorch-image-models

The largest collection of PyTorch image encoders / backbones. Including train, eval, inference, export scripts, and pretrained weights -- ResNet, ResNeXT, EfficientNet, NFNet, Vision Transformer (ViT), MobileNet-V3/V2, RegNet, DPN, CSPNet, Swin Transformer, MaxViT, CoAtNet, ConvNeXt, and more

Language:PythonLicense:Apache-2.0Stargazers:30126Issues:0Issues:0

spherecluster

Clustering routines for the unit sphere

Language:PythonLicense:MITStargazers:330Issues:0Issues:0

pytorch-lightning

Pretrain, finetune and deploy AI models on multiple GPUs, TPUs with zero code changes.

Language:PythonLicense:Apache-2.0Stargazers:27130Issues:0Issues:0

transformers

🤗 Transformers: State-of-the-art Machine Learning for Pytorch, TensorFlow, and JAX.

Language:PythonLicense:Apache-2.0Stargazers:126531Issues:0Issues:0

vit-pytorch

Implementation of Vision Transformer, a simple way to achieve SOTA in vision classification with only a single transformer encoder, in Pytorch

Language:PythonLicense:MITStargazers:18320Issues:0Issues:0

imaginaire

NVIDIA's Deep Imagination Team's PyTorch Library

Language:PythonLicense:NOASSERTIONStargazers:3955Issues:0Issues:0

CtCI-6th-Edition-Python

Cracking the Coding Interview 6th Ed. Python Solutions

Language:PythonStargazers:4850Issues:0Issues:0