JerryWang22's starred repositories
transformers
🤗 Transformers: State-of-the-art Machine Learning for Pytorch, TensorFlow, and JAX.
segment-anything
The repository provides code for running inference with the SegmentAnything Model (SAM), links for downloading the trained model checkpoints, and example notebooks that show how to use the model.
pytorch-image-models
The largest collection of PyTorch image encoders / backbones. Including train, eval, inference, export scripts, and pretrained weights -- ResNet, ResNeXT, EfficientNet, NFNet, Vision Transformer (ViT), MobileNetV4, MobileNet-V3 & V2, RegNet, DPN, CSPNet, Swin Transformer, MaxViT, CoAtNet, ConvNeXt, and more
paper-reading
深度学习经典、新论文逐段精读
labelImg
LabelImg is now part of the Label Studio community. The popular image annotation tool created by Tzutalin is no longer actively being developed, but you can check out Label Studio, the open source data labeling tool for images, text, hypertext, audio, video and time-series data.
label-studio
Label Studio is a multi-type data labeling and annotation tool with standardized output format
Grounded-Segment-Anything
Grounded SAM: Marrying Grounding DINO with Segment Anything & Stable Diffusion & Recognize Anything - Automatically Detect , Segment and Generate Anything
Awesome-Multimodal-Large-Language-Models
:sparkles::sparkles:Latest Advances on Multimodal Large Language Models
WARP-Clash-API
该项目可以让你通过订阅的方式使用Cloudflare WARP+,自动获取流量。This project enables you to use Cloudflare WARP+ through subscription, automatically acquiring traffic.
consistency_models
Official repo for consistency models.
YOLO-World
[CVPR 2024] Real-Time Open-Vocabulary Object Detection
Awesome-LLMs-for-Video-Understanding
🔥🔥🔥Latest Papers, Codes and Datasets on Vid-LLMs.
GroundingGPT
[ACL 2024] GroundingGPT: Language-Enhanced Multi-modal Grounding Model
Awesome_Long_Form_Video_Understanding
Awesome papers & datasets specifically focused on long-term videos.
CBLPRD-330k
China-Balanced-License-Plate-Recognition-Dataset-330k:A balanced dataset of 330,000 images featuring various types of Chinese license plates for recognition tasks, ideal for training and evaluating license plate recognition models.
DeepHDRVideo-Dataset
https://guanyingc.github.io/DeepHDRVideo/
Skin-detection-rgb-hsv-ycrcb-color-spaces
implementation and analysis of skin detection using rgb, hsv and ycrcb color spaces