Beast code in Giters

jjprincess's repositories

LAVIS

LAVIS - A One-stop Library for Language-Vision Intelligence

BSD-3-Clause000

vit-pytorch

Implementation of Vision Transformer, a simple way to achieve SOTA in vision classification with only a single transformer encoder, in Pytorch

MIT000

Chinese-CLIP

Chinese version of CLIP which achieves Chinese cross-modal retrieval and representation generation.

MIT000

R2D2

Apache-2.0000

you-get

:arrow_double_down: Dumb downloader that scrapes the web

NOASSERTION000

bubogpt

BuboGPT: Enabling Visual Grounding in Multi-Modal LLMs

BSD-3-Clause000

SegFormer

Official PyTorch implementation of SegFormer

NOASSERTION000

BLIP

PyTorch code for BLIP: Bootstrapping Language-Image Pre-training for Unified Vision-Language Understanding and Generation

BSD-3-Clause000

Pretrained-Language-Model

Pretrained language model and its related optimization techniques developed by Huawei Noah's Ark Lab.

000

MiniGPT-4

MiniGPT-4: Enhancing Vision-language Understanding with Advanced Large Language Models

BSD-3-Clause100

AliceMind

ALIbaba's Collection of Encoder-decoders from MinD (Machine IntelligeNce of Damo) Lab

Apache-2.0000

chatGPT-multimodal-bot

MIT000

Cream

This is a collection of our NAS and Vision Transformer work.

MIT000

disco-diffusion

NOASSERTION000

UEDVC

000

unilm

Large-scale Self-supervised Pre-training Across Tasks, Languages, and Modalities

MIT000

Video-Captioning

Video Captioning is an encoder decoder mode based on sequence to sequence learning

000

VideoX

VideoX: a collection of video cross-modal models

NOASSERTION000

dlrm

An implementation of a deep learning recommendation model (DLRM)

MIT000

ts2_net

000

CLIP4Clip

An official implementation for "CLIP4Clip: An Empirical Study of CLIP for End to End Video Clip Retrieval"

MIT000

mae

PyTorch implementation of MAE https//arxiv.org/abs/2111.06377

NOASSERTION100

DALLE2-pytorch

Implementation of DALL-E 2, OpenAI's updated text-to-image synthesis neural network, in Pytorch

MIT000

OFA

Official repository of OFA. Paper: Unifying Architectures, Tasks, and Modalities Through a Simple Sequence-to-Sequence Learning Framework

Apache-2.0000

t5-pegasus-chinese

基于GOOGLE T5中文生成式模型的摘要生成/指代消解，支持batch批量生成，多进程

MIT000

MASTER-pytorch

Code for the paper "MASTER: Multi-Aspect Non-local Network for Scene Text Recognition" (Pattern Recognition 2021)

MIT000

PartialLabelingCSL

Official implementation for the paper: "Multi-label Classification with Partial Annotations using Class-aware Selective Loss"

MIT000

CSRA

Official code of ICCV2021 paper "Residual Attention: A Simple but Effective Method for Multi-Label Recognition"

AGPL-3.0000

MobileModels

手机品牌型号汇总 | Mobile Models | This repository is licensed under CC BY-NC-SA 4.0

000

Informer2020

The GitHub repository for the paper "Informer" accepted by AAAI 2021.

Apache-2.0000