No One's starred repositories
segment-anything-2
The repository provides code for running inference with the Meta Segment Anything Model 2 (SAM 2), links for downloading the trained model checkpoints, and example notebooks that show how to use the model.
Mask2Former
Code release for "Masked-attention Mask Transformer for Universal Image Segmentation"
YOLO-World
[CVPR 2024] Real-Time Open-Vocabulary Object Detection
Awesome-LLMs-for-Video-Understanding
🔥🔥🔥Latest Papers, Codes and Datasets on Vid-LLMs.
ActivityNet-Entities
A Dataset for Grounded Video Description
DemystifyLocalViT
Official code for paper "On the Connection between Local Attention and Dynamic Depth-wise Convolution" ICLR 2022 Spotlight
ControlNet
Let us control diffusion models!
X-AnyLabeling
Effortless data labeling with AI support from Segment Anything and other awesome models.
labelImg
LabelImg is now part of the Label Studio community. The popular image annotation tool created by Tzutalin is no longer actively being developed, but you can check out Label Studio, the open source data labeling tool for images, text, hypertext, audio, video and time-series data.
cityscapes-to-coco-conversion
Cityscapes to CoCo Format Conversion Tool for Mask-RCNN and Detectron
llm-action
本项目旨在分享大模型相关技术原理以及实战经验。
OccNet-Course
国内首个占据栅格网络全栈课程《从BEV到Occupancy Network,算法原理与工程实践》,包含端侧部署。Surrounding Semantic Occupancy Perception Course for Autonomous Driving (docs, ppt and source code) 在线课程主页:http://111.229.117.200:8100/ (作者独立搭建)