Tao Wang (WangTaoAs)

WangTaoAs

Geek Repo

Company:Peking University

Github PK Tool:Github PK Tool

Tao Wang's starred repositories

gpt4free

The official gpt4free repository | various collection of powerful language models

Language:PythonLicense:GPL-3.0Stargazers:59470Issues:462Issues:1281

MiniGPT-4

Open-sourced codes for MiniGPT-4 and MiniGPT-v2 (https://minigpt-4.github.io, https://minigpt-v2.github.io/)

Language:PythonLicense:BSD-3-ClauseStargazers:25196Issues:223Issues:453

denoising-diffusion-pytorch

Implementation of Denoising Diffusion Probabilistic Model in Pytorch

Language:PythonLicense:MITStargazers:7646Issues:32Issues:284

ChatLaw

ChatLaw:A Powerful LLM Tailored for Chinese Legal. 中文法律大模型

SlowFast

PySlowFast: video understanding codebase from FAIR for reproducing state-of-the-art video models.

Language:PythonLicense:Apache-2.0Stargazers:6414Issues:94Issues:667

Track-Anything

Track-Anything is a flexible and interactive tool for video object tracking and segmentation, based on Segment Anything, XMem, and E2FGVI.

Language:PythonLicense:MITStargazers:6320Issues:61Issues:133

gluon-cv

Gluon CV Toolkit

Language:PythonLicense:Apache-2.0Stargazers:5787Issues:153Issues:828

CogVLM

a state-of-the-art-level open visual language model | 多模态预训练模型

Language:PythonLicense:Apache-2.0Stargazers:5713Issues:66Issues:408

BLIP

PyTorch code for BLIP: Bootstrapping Language-Image Pre-training for Unified Vision-Language Understanding and Generation

Language:Jupyter NotebookLicense:BSD-3-ClauseStargazers:4517Issues:34Issues:190

mmaction2

OpenMMLab's Next Generation Video Understanding Toolbox and Benchmark

Language:PythonLicense:Apache-2.0Stargazers:4075Issues:44Issues:1348

ChatGPT-CodeReview

🐥 A code review bot powered by ChatGPT

Language:JavaScriptLicense:ISCStargazers:3842Issues:19Issues:76

awesome-TS-anomaly-detection

List of tools & datasets for anomaly detection on time-series data.

recognize-anything

Open-source and strong foundation image recognition models.

Language:Jupyter NotebookLicense:Apache-2.0Stargazers:2633Issues:26Issues:153
Language:PythonLicense:Apache-2.0Stargazers:2087Issues:126Issues:54

VideoMAE

[NeurIPS 2022 Spotlight] VideoMAE: Masked Autoencoders are Data-Efficient Learners for Self-Supervised Video Pre-Training

Language:PythonLicense:NOASSERTIONStargazers:1280Issues:16Issues:118

InternVideo

[ECCV2024] Video Foundation Models & Data for Multimodal Understanding

Language:PythonLicense:Apache-2.0Stargazers:1160Issues:29Issues:136

T2T-ViT

ICCV2021, Tokens-to-Token ViT: Training Vision Transformers from Scratch on ImageNet

Language:Jupyter NotebookLicense:NOASSERTIONStargazers:1132Issues:18Issues:76

YOWO

You Only Watch Once: A Unified CNN Architecture for Real-Time Spatiotemporal Action Localization

Transformer-SSL

This is an official implementation for "Self-Supervised Learning with Swin Transformers".

Language:PythonLicense:MITStargazers:613Issues:6Issues:19

DN-DETR

[CVPR 2022 Oral] Official implementation of DN-DETR

Language:PythonLicense:Apache-2.0Stargazers:530Issues:16Issues:67

VideoMAEv2

[CVPR 2023] VideoMAE V2: Scaling Video Masked Autoencoders with Dual Masking

Language:PythonLicense:MITStargazers:461Issues:6Issues:51

realtime-action-detection

This repository host the code for real-time action detection paper

Language:MATLABLicense:NOASSERTIONStargazers:318Issues:23Issues:54

CLIP-ReID

Official implementation for "CLIP-ReID: Exploiting Vision-Language Model for Image Re-identification without Concrete Text Labels" (AAAI 2023)

Language:PythonLicense:MITStargazers:237Issues:4Issues:45

SuperGlobal

ICCV 2023 Paper Global Features are All You Need for Image Retrieval and Reranking Official Repository

Language:PythonLicense:MITStargazers:180Issues:6Issues:18

PFD_Net

[AAAI2022] This is Official implementation for "Pose-guided Feature Disentangling for Occluded Person Re-Identification Based on Transformer" in AAAI2022

Language:PythonLicense:MITStargazers:108Issues:4Issues:13

tubelet-transformer

This is an official implementation of TubeR: Tubelet Transformer for Video Action Detection

Language:PythonLicense:Apache-2.0Stargazers:66Issues:2Issues:20

STMixer

[CVPR 2023] STMixer: A One-Stage Sparse Action Detector

LexLIP-ICCV23

Official Code for the ICCV23 Paper: "LexLIP: Lexicon-Bottlenecked Language-Image Pre-Training for Large-Scale Image-Text Sparse Retrieval"

Language:PythonLicense:Apache-2.0Stargazers:37Issues:2Issues:6

PADE

[ICASSP 2024] Parallel Augmentation and Dual Enhancement for Occluded Person Re-identification

Language:PythonLicense:MITStargazers:13Issues:1Issues:2