Goforit's repositories

SegDrawer

Simple static web-based mask drawer, supporting semantic segmentation with Segment Anything Model (SAM) and video segmentation with XMem.

Language:PythonStargazers:0Issues:0Issues:0

Track-Anything

Track-Anything is a flexible and interactive tool for video object tracking and segmentation, based on Segment Anything, XMem, and E2FGVI.

License:MITStargazers:0Issues:0Issues:0

Caption-Anything

Caption-Anything is a versatile tool combining image segmentation, visual captioning, and ChatGPT, generating tailored captions with diverse controls for user preferences.

License:BSD-3-ClauseStargazers:0Issues:0Issues:0

Segment-and-Track-Anything

An open-source project dedicated to tracking and segmenting any objects in videos, either automatically or interactively. The primary algorithms utilized include the Segment Anything Model (SAM) for key-frame segmentation and Associating Objects with Transformers (AOT) for efficient tracking and propagation purposes.

License:AGPL-3.0Stargazers:0Issues:0Issues:0
License:MITStargazers:0Issues:0Issues:0

super-gradients

Easily train or fine-tune SOTA computer vision models with one open source training library

License:Apache-2.0Stargazers:0Issues:0Issues:0

E2FGVI

Official code for "Towards An End-to-End Framework for Flow-Guided Video Inpainting" (CVPR2022)

License:NOASSERTIONStargazers:0Issues:0Issues:0

gpt-neox

An implementation of model parallel autoregressive transformers on GPUs, based on the DeepSpeed library.

License:Apache-2.0Stargazers:0Issues:0Issues:0

RL4LMs

A modular RL library to fine-tune language models to human preferences

License:Apache-2.0Stargazers:0Issues:0Issues:0

fairseq

Facebook AI Research Sequence-to-Sequence Toolkit written in Python.

License:MITStargazers:0Issues:0Issues:0

CodeRL

This is the official code for the paper CodeRL: Mastering Code Generation through Pretrained Models and Deep Reinforcement Learning (NeurIPS22).

License:BSD-3-ClauseStargazers:0Issues:0Issues:0

ParlAI

A framework for training and evaluating AI models on a variety of openly available dialogue datasets.

License:MITStargazers:0Issues:0Issues:0

label-studio

Label Studio is a multi-type data labeling and annotation tool with standardized output format

License:Apache-2.0Stargazers:0Issues:0Issues:0

yolov7-segmentation

YOLOv7 Instance Segmentation using OpenCV and PyTorch

License:GPL-3.0Stargazers:0Issues:0Issues:0

labelme

Image Polygonal Annotation with Python (polygon, rectangle, circle, line, point and image-level flag annotation).

License:NOASSERTIONStargazers:0Issues:0Issues:0
Language:CStargazers:0Issues:0Issues:0

whisper.cpp

Port of OpenAI's Whisper model in C/C++

License:MITStargazers:0Issues:0Issues:0

whisper-playground

Build real time speech2text web apps using OpenAI's Whisper https://openai.com/blog/whisper/

License:MITStargazers:0Issues:0Issues:0

whisper

Robust Speech Recognition via Large-Scale Weak Supervision

License:MITStargazers:0Issues:0Issues:0

coco-annotator

:pencil2: Web-based image segmentation tool for object detection, localization, and keypoints

License:MITStargazers:0Issues:0Issues:0
License:Apache-2.0Stargazers:0Issues:0Issues:0

guided-inpainting

Towards Unified Keyframe Propagation Models

License:MITStargazers:0Issues:0Issues:0
Language:CStargazers:0Issues:0Issues:0

gpt-neo

An implementation of model parallel GPT-2 and GPT-3-style models using the mesh-tensorflow library.

License:MITStargazers:0Issues:0Issues:0

mmtracking

OpenMMLab Video Perception Toolbox. It supports Video Object Detection (VID), Multiple Object Tracking (MOT), Single Object Tracking (SOT), Video Instance Segmentation (VIS) with a unified framework.

License:Apache-2.0Stargazers:0Issues:0Issues:0

Tracking-Solov2-Deepsort

The MOT implement by Solov2+DeepSORT with C++ (Libtorch, TensorRT).

License:MITStargazers:0Issues:0Issues:0

PixelLib

Visit PixelLib's official documentation https://pixellib.readthedocs.io/en/latest/

License:MITStargazers:0Issues:0Issues:0

Video-Captioning

Video Captioning is an encoder decoder mode based on sequence to sequence learning

Stargazers:0Issues:0Issues:0

Mask_RCNN

Mask R-CNN for object detection and instance segmentation on Keras and TensorFlow

License:NOASSERTIONStargazers:0Issues:0Issues:0

websocket-mse-demo

Stream H264 to browsers with websocket and w3 media source extensions

Stargazers:0Issues:0Issues:0