oyontalas's repositories
ubuntu_init
Ubuntu 20.04, 22.04 快捷安装所需软件
BEVFormer_
[ECCV 2022] This is the official implementation of BEVFormer, a camera-only framework for autonomous driving perception, e.g., 3D object detection and semantic map segmentation.
BoxeR_
Code release for "BoxeR: Box-Attention for 2D and 3D Transformers"
C-language-assignment
大连理工大学 c语言程序设计大作业 课程设计 高分大作业98分
CLIP_
CLIP (Contrastive Language-Image Pretraining), Predict the most relevant text snippet given an image
Deformable-DETR_
Deformable DETR: Deformable Transformers for End-to-End Object Detection.
DINO_
[ICLR 2023] Official implementation of the paper "DINO: DETR with Improved DeNoising Anchor Boxes for End-to-End Object Detection"
DN-DETR_
[CVPR 2022 Oral] Official implementation of DN-DETR
LMDrive_
[CVPR 2024] LMDrive: Closed-Loop End-to-End Driving with Large Language Models
longformer_
Longformer: The Long-Document Transformer
Mask2Former_
Code release for "Masked-attention Mask Transformer for Universal Image Segmentation"
MaskDINO_
[CVPR 2023] Official implementation of the paper "Mask DINO: Towards A Unified Transformer-based Framework for Object Detection and Segmentation"
MaskFormer_
Per-Pixel Classification is Not All You Need for Semantic Segmentation (NeurIPS 2021, spotlight)
mobilestereonet_
Lightweight stereo matching network based on MobileNet blocks
OccFormer_
[ICCV 2023] OccFormer: Dual-path Transformer for Vision-based 3D Semantic Occupancy Prediction
OccupancyDETR_
OccupancyDETR: Making Semantic Scene Completion as Straightforward as Object Detection
Pytorch-UNet_
PyTorch implementation of the U-Net for image semantic segmentation with high quality images
segment-anything_
The repository provides code for running inference with the SegmentAnything Model (SAM), links for downloading the trained model checkpoints, and example notebooks that show how to use the model.
Swin-Transformer_
This is an official implementation for "Swin Transformer: Hierarchical Vision Transformer using Shifted Windows".
Symphonies_
Symphonies (Scene-from-Insts): Symphonize 3D Semantic Scene Completion with Contextual Instance Queries
tensor2tensor_
Library of deep learning models and datasets designed to make deep learning more accessible and accelerate ML research.
VoxFormer_
Official PyTorch implementation of VoxFormer [CVPR 2023 Highlight]
VQASynth_
Compose multimodal datasets 🎹
ZoeDepth_
Metric depth estimation from a single image