vanpersie32's repositories
chatgpt-retrieval-plugin
The ChatGPT Retrieval Plugin lets you easily find personal or work documents by asking questions in natural language.
CondensedMovies
Story-Based Retrieval with Contextual Embeddings. Largest freely available movie video dataset. [ACCV'20]
Multigpu-Bert
This repository is an enhanced version of google official bert with multiple-gpu pretraining and validation supported
Swin-Transformer
This is an official implementation for "Swin Transformer: Hierarchical Vision Transformer using Shifted Windows".
CG-VLM
This is the official repo for Contrastive Vision-Language Alignment Makes Efficient Instruction Learner.
Chinese-LLaVA
支持中英文双语视觉-文本对话的开源可商用多模态模型。
CLIP
Contrastive Language-Image Pretraining
GPT2-Chinese
Chinese version of GPT2 training code, using BERT or BPE tokenizer.
laion-prepro
Get hundred of million of image+url from the crawling at home dataset and preprocess them
LLaVA-RLHF
Aligning LMMs with Factually Augmented RLHF
lynx-llm
paper: https://arxiv.org/abs/2307.02469 page: https://lynx-llm.github.io/
MAVE
The dataset contains 3 million attribute-value annotations across 1257 unique categories on 2.2 million cleaned Amazon product profiles. It is a large, multi-sourced, diverse dataset for product attribute extraction study.
mmdetection
Open MMLab Detection Toolbox and Benchmark
pan_pp.pytorch
Official implementations of PSENet, PAN and PAN++.
PDVC
End-to-End Dense Video Captioning with Parallel Decoding (ICCV 2021)
python-is-cool
Cool Python features for machine learning that I used to be too afraid to use
Qwen
The official repo of Qwen (通义千问) chat & pretrained large language model proposed by Alibaba Cloud.
StableStudio
StableStudio
torchnvjpeg
Decode JPEG image on GPU using PyTorch
Video-Swin-Transformer
This is an official implementation for "Video Swin Transformers".
vilbert-multi-task
Multi Task Vision and Language
VIMER
视觉预训练基础模型仓库
vit-pytorch
Implementation of Vision Transformer, a simple way to achieve SOTA in vision classification with only a single transformer encoder, in Pytorch
xtuner
A toolkit for efficiently fine-tuning LLM (InternLM, Llama, Baichuan, Qwen, ChatGLM)