yytzsy's starred repositories
TimeMarker
A Versatile Video-LLM for Long and Short Video Understanding with Superior Temporal Localization Ability
HuggingFace-Download-Accelerator
利用HuggingFace的官方下载工具从镜像网站进行高速下载。
Downstream-Dinov2
Downstream-Dino-V2: A GitHub repository featuring an easy-to-use implementation of the DINOv2 model by Facebook for downstream tasks such as Classification, Semantic Segmentation and Monocular depth estimation.
PolygonObjectDetection
This repository is based on Ultralytics/yolov5, with adjustments to enable polygon prediction boxes.
3D-Box-Segment-Anything
We extend Segment Anything to 3D perception by combining it with VoxelNeXt.
stable-diffusion-webui
Stable Diffusion web UI
segment-anything
The repository provides code for running inference with the SegmentAnything Model (SAM), links for downloading the trained model checkpoints, and example notebooks that show how to use the model.
DALLE2-pytorch
Implementation of DALL-E 2, OpenAI's updated text-to-image synthesis neural network, in Pytorch
skeleton-tracing
A new algorithm for retrieving topological skeleton as a set of polylines from binary images
contrastive_association
Contrastive Instance Association for 4D Panoptic Segmentation using Sequences of 3D LiDAR Scans
Panoptic-SegFormer
This is the official repo of Panoptic SegFormer [CVPR'22]
MaskFormer
Per-Pixel Classification is Not All You Need for Semantic Segmentation (NeurIPS 2021, spotlight)
Mask2Former
Code release for "Masked-attention Mask Transformer for Universal Image Segmentation"
3D-PointCloud
Papers and Datasets about Point Cloud.
Panoptic-PolarNet
Implementation for Panoptic-PolarNet (CVPR 2021)
Rotated_IoU
Differentiable IoU of rotated bounding boxes using Pytorch
lift-splat-shoot
Lift, Splat, Shoot: Encoding Images from Arbitrary Camera Rigs by Implicitly Unprojecting to 3D (ECCV 2020)
mmdetection3d
OpenMMLab's next-generation platform for general 3D object detection.