Goforit's repositories
SegDrawer
Simple static web-based mask drawer, supporting semantic segmentation with Segment Anything Model (SAM) and video segmentation with XMem.
Track-Anything
Track-Anything is a flexible and interactive tool for video object tracking and segmentation, based on Segment Anything, XMem, and E2FGVI.
Caption-Anything
Caption-Anything is a versatile tool combining image segmentation, visual captioning, and ChatGPT, generating tailored captions with diverse controls for user preferences.
Segment-and-Track-Anything
An open-source project dedicated to tracking and segmenting any objects in videos, either automatically or interactively. The primary algorithms utilized include the Segment Anything Model (SAM) for key-frame segmentation and Associating Objects with Transformers (AOT) for efficient tracking and propagation purposes.
super-gradients
Easily train or fine-tune SOTA computer vision models with one open source training library
E2FGVI
Official code for "Towards An End-to-End Framework for Flow-Guided Video Inpainting" (CVPR2022)
gpt-neox
An implementation of model parallel autoregressive transformers on GPUs, based on the DeepSpeed library.
RL4LMs
A modular RL library to fine-tune language models to human preferences
fairseq
Facebook AI Research Sequence-to-Sequence Toolkit written in Python.
CodeRL
This is the official code for the paper CodeRL: Mastering Code Generation through Pretrained Models and Deep Reinforcement Learning (NeurIPS22).
ParlAI
A framework for training and evaluating AI models on a variety of openly available dialogue datasets.
label-studio
Label Studio is a multi-type data labeling and annotation tool with standardized output format
yolov7-segmentation
YOLOv7 Instance Segmentation using OpenCV and PyTorch
labelme
Image Polygonal Annotation with Python (polygon, rectangle, circle, line, point and image-level flag annotation).
whisper.cpp
Port of OpenAI's Whisper model in C/C++
whisper-playground
Build real time speech2text web apps using OpenAI's Whisper https://openai.com/blog/whisper/
whisper
Robust Speech Recognition via Large-Scale Weak Supervision
coco-annotator
:pencil2: Web-based image segmentation tool for object detection, localization, and keypoints
guided-inpainting
Towards Unified Keyframe Propagation Models
gpt-neo
An implementation of model parallel GPT-2 and GPT-3-style models using the mesh-tensorflow library.
mmtracking
OpenMMLab Video Perception Toolbox. It supports Video Object Detection (VID), Multiple Object Tracking (MOT), Single Object Tracking (SOT), Video Instance Segmentation (VIS) with a unified framework.
Tracking-Solov2-Deepsort
The MOT implement by Solov2+DeepSORT with C++ (Libtorch, TensorRT).
PixelLib
Visit PixelLib's official documentation https://pixellib.readthedocs.io/en/latest/
Video-Captioning
Video Captioning is an encoder decoder mode based on sequence to sequence learning
Mask_RCNN
Mask R-CNN for object detection and instance segmentation on Keras and TensorFlow
websocket-mse-demo
Stream H264 to browsers with websocket and w3 media source extensions