Dangwei Li's repositories

pedestrian-attribute-recognition-pytorch

A simple baseline for pedestrian attribute recognition in surveillance scenarios

Language:PythonStargazers:1Issues:2Issues:0

Person-ReID-Core-Pytorch

Person Re-identification with Pytorch

Language:PythonStargazers:1Issues:0Issues:0

2dtan

An optimized re-implementation for 2D-TAN: Learning 2D Temporal Localization Networks for Moment Localization with Natural Language (AAAI'2020).

Language:PythonStargazers:0Issues:1Issues:0

AlignPS

Code for CVPR 2021 paper: Anchor-Free Person Search

Language:PythonLicense:Apache-2.0Stargazers:0Issues:1Issues:0

AudioCLIP

Source code for models described in the paper "AudioCLIP: Extending CLIP to Image, Text and Audio" (https://arxiv.org/abs/2106.13043)

License:MITStargazers:0Issues:0Issues:0

BLIP

PyTorch code for BLIP: Bootstrapping Language-Image Pre-training for Unified Vision-Language Understanding and Generation

License:BSD-3-ClauseStargazers:0Issues:0Issues:0

bottom-up-attention

Bottom-up attention model for image captioning and VQA, based on Faster R-CNN and Visual Genome

License:MITStargazers:0Issues:0Issues:0

CLIP4Clip

An official implementation for "CLIP4Clip: An Empirical Study of CLIP for End to End Video Clip Retrieval"

License:MITStargazers:0Issues:0Issues:0

DALI

A library containing both highly optimized building blocks and an execution engine for data pre-processing in deep learning applications

Language:C++License:Apache-2.0Stargazers:0Issues:2Issues:0
Language:HTMLStargazers:0Issues:0Issues:0

DeepCTR-Torch

【PyTorch】Easy-to-use,Modular and Extendible package of deep-learning based CTR models.

License:Apache-2.0Stargazers:0Issues:0Issues:0

DomainWordsDict

DomainWordsDict, Chinese words dict that contains more than 68 domains, which can be used as text classification、knowledge enhance task。涵盖68个领域、共计916万词的专业词典知识库,可用于文本分类、知识增强、领域词汇库扩充等自然语言处理应用。

Stargazers:0Issues:0Issues:0

FairMOT

A simple baseline for one-shot multi-object tracking

Language:PythonLicense:MITStargazers:0Issues:1Issues:0

fairseq

Facebook AI Research Sequence-to-Sequence Toolkit written in Python.

License:MITStargazers:0Issues:0Issues:0

GTM-Transformer

Official Implementation of paper: Well Googled is Half Done: Multimodal Forecasting of New FashionProduct Sales with Image-based Google Trends

License:MITStargazers:0Issues:0Issues:0

img2dataset

Easily turn large sets of image urls to an image dataset. Can download, resize and package 100M urls in 20h on one machine.

License:MITStargazers:0Issues:0Issues:0

lite.ai.toolkit

🛠 A lite C++ toolkit of awesome AI models with ONNXRuntime, NCNN, MNN and TNN. YOLOX, YOLOP, YOLOv6, YOLOR, MODNet, YOLOX, YOLOv7, YOLOv5. MNN, NCNN, TNN, ONNXRuntime.

Language:C++License:GPL-3.0Stargazers:0Issues:0Issues:0

mega.pytorch

Memory Enhanced Global-Local Aggregation for Video Object Detection, CVPR2020

License:NOASSERTIONStargazers:0Issues:0Issues:0

mmdetection

OpenMMLab Detection Toolbox and Benchmark

License:Apache-2.0Stargazers:0Issues:0Issues:0

node-fluent-ffmpeg

A fluent API to FFMPEG (http://www.ffmpeg.org)

Language:HTMLLicense:MITStargazers:0Issues:1Issues:0

PaddleOCR

Awesome multilingual OCR toolkits based on PaddlePaddle (practical ultra lightweight OCR system, support 80+ languages recognition, provide data annotation and synthesis tools, support training and deployment among server, mobile, embedded and IoT devices)

License:Apache-2.0Stargazers:0Issues:0Issues:0

pretrained-models.pytorch

Pretrained ConvNets for pytorch: NASNet, ResNeXt, ResNet, InceptionV4, InceptionResnetV2, Xception, DPN, etc.

Language:PythonLicense:BSD-3-ClauseStargazers:0Issues:2Issues:0

PyAV

Pythonic bindings for FFmpeg's libraries.

License:BSD-3-ClauseStargazers:0Issues:0Issues:0

PytorchToCaffe

Pytorch model to caffe model, supported pytorch 0.3, 0.3.1, 0.4, 0.4.1 ,1.0 , 1.0.1 , 1.2 ,1.3 .notice that only pytorch 1.1 have some bugs

License:MITStargazers:0Issues:0Issues:0

pytrends

Pseudo API for Google Trends

License:NOASSERTIONStargazers:0Issues:0Issues:0

SGG_from_NLS

Code repository for our paper "Learning to Generate Scene Graph from Natural Language Supervision" in ICCV 2021.

License:NOASSERTIONStargazers:0Issues:0Issues:0

SwinBERT

Research code for CVPR 2022 paper "SwinBERT: End-to-End Transformers with Sparse Attention for Video Captioning"

License:MITStargazers:0Issues:0Issues:0

transformers

🤗 Transformers: State-of-the-art Natural Language Processing for Pytorch, TensorFlow, and JAX.

Language:PythonLicense:Apache-2.0Stargazers:0Issues:1Issues:0
Language:PythonLicense:MITStargazers:0Issues:1Issues:0