piaofu110's repositories
Awesome-Open-Vocabulary
(TPAMI 2024) A Survey on Open Vocabulary Learning
BCKD
Official Implementation of Bridging Cross-task Protocol Inconsistency for Distillation in Dense Object Detection
CaFo
[CVPR 2023] Prompt, Generate, then Cache: Cascade of Foundation Models makes Strong Few-shot Learners
CoDA_NeurIPS2023
Official code for NeurIPS2023 paper: CoDA: Collaborative Novel Box Discovery and Cross-modal Alignment for Open-vocabulary 3D Object Detection
detrex
detrex is a research platform for Transformer-based Instance Recognition algorithms including DETR (ECCV 2020), Deformable-DETR (ICLR 2021), Conditional-DETR (ICCV 2021), DAB-DETR (ICLR 2022), DN-DETR (CVPR 2022), DINO (ICLR 2023), H-DETR (CVPR 2023), MaskDINO (CVPR 2023), DETA (ArXiv 2022), etc.
DiffusionDet
[ICCV2023 Oral] PyTorch implementation of DiffusionDet (https://arxiv.org/abs/2211.09788)
DINO
[ICLR 2023] Official implementation of the paper "DINO: DETR with Improved DeNoising Anchor Boxes for End-to-End Object Detection"
DISC-FinLLM
DISC-FinLLM,中文金融大语言模型(LLM),旨在为用户提供金融场景下专业、智能、全面的金融咨询服务。DISC-FinLLM, a Chinese financial large language model (LLM) designed to provide users with professional, intelligent, and comprehensive financial consulting services in financial scenarios.
insightface
State-of-the-art 2D and 3D Face Analysis Project
LAGNN
ICML'22: Local Augmentation for Graph Neural Networks
mavrec-code
This code is provided for reproducibility of results in the paper: Dual-View Drone (DVD) Dataset: Can Multi-view Improve Aerial Visual Perception?
MFDC
Multi-Faceted Distillation of Base-Novel Commonality for Few-shot Object Detection, ECCV 2022
MiniGPT-4
MiniGPT-4: Enhancing Vision-language Understanding with Advanced Large Language Models
MSCG-Net
Multi-view Self-Constructing Graph Convolutional Networks with Adaptive Class Weighting Loss for Semantic Segmentation
OpenCOOD
[ICRA 2022] An opensource framework for cooperative detection. Official implementation for OPV2V.
pyGAT
Pytorch implementation of the Graph Attention Network model by Veličković et. al (2017, https://arxiv.org/abs/1710.10903)
PyQt5-YOLOv5
PyQt5 implementation of YOLOv5 GUI
RegionCLIP
[CVPR 2022] Official code for "RegionCLIP: Region-based Language-Image Pretraining"
rescience-ijcai2017-230
[Re] Object Detection Meets Knowledge Graphs
STTran
Spatial-Temporal Transformer for Dynamic Scene Graph Generation, ICCV2021
ultralytics
NEW - YOLOv8 🚀 in PyTorch > ONNX > OpenVINO > CoreML > TFLite
viewer
ML models and internal tensors 3D visualizer
YOLO-World
[CVPR 2024] Real-Time Open-Vocabulary Object Detection
yolov7
Implementation of paper - YOLOv7: Trainable bag-of-freebies sets new state-of-the-art for real-time object detectors