woody0105

Goforit's repositories

SegDrawer

Simple static web-based mask drawer, supporting semantic segmentation with Segment Anything Model (SAM) and video segmentation with XMem.

Language:Python000

Track-Anything

Track-Anything is a flexible and interactive tool for video object tracking and segmentation, based on Segment Anything, XMem, and E2FGVI.

MIT000

Caption-Anything

Caption-Anything is a versatile tool combining image segmentation, visual captioning, and ChatGPT, generating tailored captions with diverse controls for user preferences.

BSD-3-Clause000

An open-source project dedicated to tracking and segmenting any objects in videos, either automatically or interactively. The primary algorithms utilized include the Segment Anything Model (SAM) for key-frame segmentation and Associating Objects with Transformers (AOT) for efficient tracking and propagation purposes.

AGPL-3.0000

i-Code

MIT000

super-gradients

Easily train or fine-tune SOTA computer vision models with one open source training library

Apache-2.0000

E2FGVI

Official code for "Towards An End-to-End Framework for Flow-Guided Video Inpainting" (CVPR2022)

NOASSERTION000

gpt-neox

An implementation of model parallel autoregressive transformers on GPUs, based on the DeepSpeed library.

Apache-2.0000

RL4LMs

A modular RL library to fine-tune language models to human preferences

Apache-2.0000

fairseq

Facebook AI Research Sequence-to-Sequence Toolkit written in Python.

MIT000

CodeRL

This is the official code for the paper CodeRL: Mastering Code Generation through Pretrained Models and Deep Reinforcement Learning (NeurIPS22).

BSD-3-Clause000

ParlAI

A framework for training and evaluating AI models on a variety of openly available dialogue datasets.

MIT000

label-studio

Label Studio is a multi-type data labeling and annotation tool with standardized output format

Apache-2.0000

yolov7-segmentation

YOLOv7 Instance Segmentation using OpenCV and PyTorch

GPL-3.0000

labelme

Image Polygonal Annotation with Python (polygon, rectangle, circle, line, point and image-level flag annotation).

NOASSERTION000

smarttranscoding

Language:C000

whisper.cpp

Port of OpenAI's Whisper model in C/C++

MIT000

whisper-playground

Build real time speech2text web apps using OpenAI's Whisper https://openai.com/blog/whisper/

MIT000

whisper

Robust Speech Recognition via Large-Scale Weak Supervision

MIT000

coco-annotator

:pencil2: Web-based image segmentation tool for object detection, localization, and keypoints

MIT000

trlx

Apache-2.0000

guided-inpainting

Towards Unified Keyframe Propagation Models

MIT000

segmentation

Language:C000

gpt-neo

An implementation of model parallel GPT-2 and GPT-3-style models using the mesh-tensorflow library.

MIT000

mmtracking

OpenMMLab Video Perception Toolbox. It supports Video Object Detection (VID), Multiple Object Tracking (MOT), Single Object Tracking (SOT), Video Instance Segmentation (VIS) with a unified framework.

Apache-2.0000

Tracking-Solov2-Deepsort

The MOT implement by Solov2+DeepSORT with C++ (Libtorch, TensorRT).

MIT000

PixelLib

Visit PixelLib's official documentation https://pixellib.readthedocs.io/en/latest/

MIT000

Video-Captioning

Video Captioning is an encoder decoder mode based on sequence to sequence learning

000

Mask_RCNN

Mask R-CNN for object detection and instance segmentation on Keras and TensorFlow

NOASSERTION000

websocket-mse-demo

Stream H264 to browsers with websocket and w3 media source extensions

000