WangTaoAs

followers

following

stars

Peking University

Tao Wang's starred repositories

gpt4free

The official gpt4free repository | various collection of powerful language models

Language:PythonGPL-3.059470 462 1281

MiniGPT-4

Open-sourced codes for MiniGPT-4 and MiniGPT-v2 (https://minigpt-4.github.io, https://minigpt-v2.github.io/)

Language:PythonBSD-3-Clause25196 223 453

denoising-diffusion-pytorch

Implementation of Denoising Diffusion Probabilistic Model in Pytorch

Language:PythonMIT7646 32 284

ChatLaw

ChatLaw：A Powerful LLM Tailored for Chinese Legal. 中文法律大模型

AGPL-3.06712 36 74

SlowFast

PySlowFast: video understanding codebase from FAIR for reproducing state-of-the-art video models.

Language:PythonApache-2.06414 94 667

Track-Anything

Track-Anything is a flexible and interactive tool for video object tracking and segmentation, based on Segment Anything, XMem, and E2FGVI.

Language:PythonMIT6320 61 133

gluon-cv

Gluon CV Toolkit

Language:PythonApache-2.05787 153 828

CogVLM

a state-of-the-art-level open visual language model | 多模态预训练模型

Language:PythonApache-2.05713 66 408

BLIP

PyTorch code for BLIP: Bootstrapping Language-Image Pre-training for Unified Vision-Language Understanding and Generation

Language:Jupyter NotebookBSD-3-Clause4517 34 190

mmaction2

OpenMMLab's Next Generation Video Understanding Toolbox and Benchmark

Language:PythonApache-2.04075 44 1348

ChatGPT-CodeReview

🐥 A code review bot powered by ChatGPT

Language:JavaScriptISC3842 19 76

awesome-TS-anomaly-detection

List of tools & datasets for anomaly detection on time-series data.

recognize-anything

Open-source and strong foundation image recognition models.

Language:Jupyter NotebookApache-2.02633 26 153

omnimotion

Language:PythonApache-2.02087 126 54

VideoMAE

[NeurIPS 2022 Spotlight] VideoMAE: Masked Autoencoders are Data-Efficient Learners for Self-Supervised Video Pre-Training

Language:PythonNOASSERTION1280 16 118

InternVideo

[ECCV2024] Video Foundation Models & Data for Multimodal Understanding

Language:PythonApache-2.01160 29 136

T2T-ViT

ICCV2021, Tokens-to-Token ViT: Training Vision Transformers from Scratch on ImageNet

Language:Jupyter NotebookNOASSERTION1132 18 76

YOWO

You Only Watch Once: A Unified CNN Architecture for Real-Time Spatiotemporal Action Localization

Language:Python832 52 98

Transformer-SSL

This is an official implementation for "Self-Supervised Learning with Swin Transformers".

Language:PythonMIT613 6 19

DN-DETR

[CVPR 2022 Oral] Official implementation of DN-DETR

Language:PythonApache-2.0530 16 67

VideoMAEv2

[CVPR 2023] VideoMAE V2: Scaling Video Masked Autoencoders with Dual Masking

Language:PythonMIT461 6 51

realtime-action-detection

This repository host the code for real-time action detection paper

Language:MATLABNOASSERTION318 23 54

CLIP-ReID

Official implementation for "CLIP-ReID: Exploiting Vision-Language Model for Image Re-identification without Concrete Text Labels" (AAAI 2023)

Language:PythonMIT237 4 45

SuperGlobal

ICCV 2023 Paper Global Features are All You Need for Image Retrieval and Reranking Official Repository

Language:PythonMIT180 6 18

PFD_Net

[AAAI2022] This is Official implementation for "Pose-guided Feature Disentangling for Occluded Person Re-Identification Based on Transformer" in AAAI2022

Language:PythonMIT108 4 13

DeLVM

Language:Python106 2 9

tubelet-transformer

This is an official implementation of TubeR: Tubelet Transformer for Video Action Detection

Language:PythonApache-2.066 2 20

STMixer

[CVPR 2023] STMixer: A One-Stage Sparse Action Detector

Language:Python48 1 4

LexLIP-ICCV23

Official Code for the ICCV23 Paper: "LexLIP: Lexicon-Bottlenecked Language-Image Pre-Training for Large-Scale Image-Text Sparse Retrieval"

Language:PythonApache-2.037 2 6

PADE

[ICASSP 2024] Parallel Augmentation and Dual Enhancement for Occluded Person Re-identification

Language:PythonMIT13 1 2