Beast code in Giters

isrkhou's starred repositories

MAD

MAD: A Scalable Dataset for Language Grounding in Videos from Movie Audio Descriptions

Language:PythonMIT14400

moment_detr

[NeurIPS 2021] Moment-DETR code and QVHighlights dataset

Language:PythonMIT25400

insightface

State-of-the-art 2D and 3D Face Analysis Project

Language:Python2247600

reid-strong-baseline

Bag of Tricks and A Strong Baseline for Deep Person Re-identification

Language:PythonMIT223600

:bouncing_ball_person: Pytorch ReID: A tiny, friendly, strong pytorch implement of person re-id / vehicle re-id baseline. Tutorial 👉https://github.com/layumi/Person_reID_baseline_pytorch/tree/master/tutorial

Language:PythonMIT404000

Prompt-Can-Anything

You can do anything by sota AI with prompt ,auto AI tools , VL larger model fine and project

Language:Jupyter NotebookGPL-3.017700

All-in-One-Gait

TrackGait is a sub project of OpenGait. Implemented a gait recognition system.

Language:Python7000

PLIP

The official code of "PLIP: Language-Image Pre-training for Person Representation Learning"

Language:PythonMIT9100

UniVTG

[ICCV2023] UniVTG: Towards Unified Video-Language Temporal Grounding

Language:PythonMIT31000

SeViLA

[NeurIPS 2023] Self-Chained Image-Language Model for Video Localization and Question Answering

Language:PythonBSD-3-Clause17400

DetGPT

Language:Jupyter NotebookBSD-3-Clause75000

ImageBind

ImageBind One Embedding Space to Bind Them All

Language:PythonNOASSERTION815700

UniHCP

Official PyTorch implementation of UniHCP

Language:PythonMIT14600

modelscope

ModelScope: bring the notion of Model-as-a-Service to life.

Language:PythonApache-2.0667300

SOLIDER

A Semantic Controllable Self-Supervised Learning Framework to learn general human representations from massive unlabeled human images, which can benefit downstream human-centric tasks to the maximum extent

Language:PythonApache-2.0189200

PeekingDuck

A modular framework built to simplify Computer Vision inference workloads.

Language:PythonApache-2.016100

DINO

[ICLR 2023] Official implementation of the paper "DINO: DETR with Improved DeNoising Anchor Boxes for End-to-End Object Detection"

Language:PythonApache-2.0213300

playground

A central hub for gathering and showcasing amazing projects that extend OpenMMLab with SAM and other exciting features.

Language:PythonApache-2.0107900

scenic

Scenic: A Jax Library for Computer Vision Research and Beyond

Language:PythonApache-2.0320200

Video-ChatGPT

[ACL 2024 🔥] Video-ChatGPT is a video conversation model capable of generating meaningful conversation about videos. It combines the capabilities of LLMs with a pretrained visual encoder adapted for spatiotemporal video representation. We also introduce a rigorous 'Quantitative Evaluation Benchmarking' for video-based conversational models.

Language:PythonCC-BY-4.0112000

isrkhou

isrkhou's starred repositories

MAD

moment_detr

insightface

reid-strong-baseline

Person_reID_baseline_pytorch

Prompt-Can-Anything

All-in-One-Gait

PLIP

UniVTG

SeViLA

DetGPT

ImageBind

UniHCP

modelscope

SOLIDER

PeekingDuck

DINO

playground

scenic

Video-ChatGPT

InternVideo

towhee

Video-LLaMA

OpenSeeFace

human

GroundingDINO

Grounded-Segment-Anything

segment-anything

deepface

mmdetection