jjprincess's repositories
LAVIS
LAVIS - A One-stop Library for Language-Vision Intelligence
vit-pytorch
Implementation of Vision Transformer, a simple way to achieve SOTA in vision classification with only a single transformer encoder, in Pytorch
Chinese-CLIP
Chinese version of CLIP which achieves Chinese cross-modal retrieval and representation generation.
you-get
:arrow_double_down: Dumb downloader that scrapes the web
bubogpt
BuboGPT: Enabling Visual Grounding in Multi-Modal LLMs
SegFormer
Official PyTorch implementation of SegFormer
BLIP
PyTorch code for BLIP: Bootstrapping Language-Image Pre-training for Unified Vision-Language Understanding and Generation
Pretrained-Language-Model
Pretrained language model and its related optimization techniques developed by Huawei Noah's Ark Lab.
AliceMind
ALIbaba's Collection of Encoder-decoders from MinD (Machine IntelligeNce of Damo) Lab
Cream
This is a collection of our NAS and Vision Transformer work.
unilm
Large-scale Self-supervised Pre-training Across Tasks, Languages, and Modalities
Video-Captioning
Video Captioning is an encoder decoder mode based on sequence to sequence learning
VideoX
VideoX: a collection of video cross-modal models
dlrm
An implementation of a deep learning recommendation model (DLRM)
CLIP4Clip
An official implementation for "CLIP4Clip: An Empirical Study of CLIP for End to End Video Clip Retrieval"
DALLE2-pytorch
Implementation of DALL-E 2, OpenAI's updated text-to-image synthesis neural network, in Pytorch
OFA
Official repository of OFA. Paper: Unifying Architectures, Tasks, and Modalities Through a Simple Sequence-to-Sequence Learning Framework
t5-pegasus-chinese
基于GOOGLE T5中文生成式模型的摘要生成/指代消解,支持batch批量生成,多进程
MASTER-pytorch
Code for the paper "MASTER: Multi-Aspect Non-local Network for Scene Text Recognition" (Pattern Recognition 2021)
PartialLabelingCSL
Official implementation for the paper: "Multi-label Classification with Partial Annotations using Class-aware Selective Loss"
CSRA
Official code of ICCV2021 paper "Residual Attention: A Simple but Effective Method for Multi-Label Recognition"
MobileModels
手机品牌型号汇总 | Mobile Models | This repository is licensed under CC BY-NC-SA 4.0
Informer2020
The GitHub repository for the paper "Informer" accepted by AAAI 2021.