There are 2 repositories under multi-view topic.
[ICCV 2025] Official implementation of the paper “MagicDrive-V2: High-Resolution Long Video Generation for Autonomous Driving with Adaptive Control”
Cameras as Relative Positional Encoding
CVPR2023 | MVImgNet: A Large-scale Dataset of Multi-view Images
[ICCV2023] Official Implementation of "UniTR: A Unified and Efficient Multi-Modal Transformer for Bird’s-Eye-View Representation"
Attention-Guided Version of 2D UNet for Automatic Brain Tumor Segmentation
[CVPR'22 Best Paper Finalist] Official PyTorch implementation of the method presented in "Learning Multi-View Aggregation In the Wild for Large-Scale 3D Semantic Segmentation"
A Python package housing a collection of deep-learning multi-modal data fusion method pipelines! From data loading, to training, to evaluation - fusilli's got you covered 🌸
[AAAI2024] Far3D: Expanding the Horizon for Surround-view 3D Object Detection
PyTorch implementation for COMPLETER: Incomplete Multi-view Clustering via Contrastive Prediction (CVPR 2021)
[TII 2023] A Cross-View Transformer Network for LiDAR-Based Place Recognition in Autonomous Driving Environments.
[CVPR2024] Official implementation of "RCooper: A Real-world Large-scale Dataset for Roadside Cooperative Perception"
An Open-Source Package for Chinese Open-domain Conversational Chatbot (中文闲聊对话系统,一键部署微信闲聊机器人)
[ECCV 2024] Pytorch code for our ECCV'24 paper NeRF-MAE: Masked AutoEncoders for Self-Supervised 3D Representation Learning for Neural Radiance Fields
The Official PyTorch implementation of "3D Human Action Representation Learning via Cross-View Consistency Pursuit" in CVPR 2021
Video recording app with sub-millisecond synchronization accuracy for multiple Android smartphones, useful for creating affordable and easy-to-setup multi-view camera systems for robotics, SLAM, 3D-reconstruction, panorama stitching
[JAMA] "Complete AI-Enabled Echocardiography Interpretation with Multitask Deep Learning" by Gregory Holste, Evangelos K. Oikonomou, Márton Tokodi, Attila Kovács, Zhangyang Wang, & Rohan Khera
Official source code to CVPR'20 paper, "When2com: Multi-Agent Perception via Communication Graph Grouping"
PyTorch implementation for Dual Contrastive Prediction for Incomplete Multi-view Representation Learning (TPAMI'22)
This repository contains the code of AirPose, our multi-view fusion network for Human Pose and Shape Estimation method
Seeing from Another Perspective: Evaluating Multi-View Understanding in MLLMs
Official Code for "Lifting Multi-View Detection and Tracking to the Bird’s Eye View"
Implementation of "Long-Range Grouping Transformer for Multi-View 3D Reconstruction" [ICCV 2023]
Blender addon to configure and render light fields
Myocardial Infarction Detection
Drug Similarity Integration Through Attentive Multi-view Graph Auto-Encoders (IJCAI 2018)
[AAAI 2024] ICMVC: Incomplete Contrastive Multi-View Clustering with High-confidence Guiding
[ECCV 2024] PanoFree: Tuning-Free Holistic Multi-view Image Generation with Cross-view Self-Guidance