Shi Pu's repositories
CVPR2022-AURL
This is the implementation of our AURL paper "Alignment-Uniformity aware Representation Learning for Zero-shot Video Classification".
cc2dataset
Easily convert common crawl to a dataset of caption and document. Image/text Audio/text Video/text, ...
Language:PythonMIT000
CLIP
Contrastive Language-Image Pretraining
Language:Jupyter NotebookMIT000
MiniGPT-4
MiniGPT-4: Enhancing Vision-language Understanding with Advanced Large Language Models
Language:PythonBSD-3-Clause000