jjprincess's repositories

LAVIS

LAVIS - A One-stop Library for Language-Vision Intelligence

License:BSD-3-ClauseStargazers:0Issues:0Issues:0

vit-pytorch

Implementation of Vision Transformer, a simple way to achieve SOTA in vision classification with only a single transformer encoder, in Pytorch

License:MITStargazers:0Issues:0Issues:0

Chinese-CLIP

Chinese version of CLIP which achieves Chinese cross-modal retrieval and representation generation.

License:MITStargazers:0Issues:0Issues:0
License:Apache-2.0Stargazers:0Issues:0Issues:0

you-get

:arrow_double_down: Dumb downloader that scrapes the web

License:NOASSERTIONStargazers:0Issues:0Issues:0

bubogpt

BuboGPT: Enabling Visual Grounding in Multi-Modal LLMs

License:BSD-3-ClauseStargazers:0Issues:0Issues:0

SegFormer

Official PyTorch implementation of SegFormer

License:NOASSERTIONStargazers:0Issues:0Issues:0

BLIP

PyTorch code for BLIP: Bootstrapping Language-Image Pre-training for Unified Vision-Language Understanding and Generation

License:BSD-3-ClauseStargazers:0Issues:0Issues:0

Pretrained-Language-Model

Pretrained language model and its related optimization techniques developed by Huawei Noah's Ark Lab.

Stargazers:0Issues:0Issues:0

MiniGPT-4

MiniGPT-4: Enhancing Vision-language Understanding with Advanced Large Language Models

License:BSD-3-ClauseStargazers:1Issues:0Issues:0

AliceMind

ALIbaba's Collection of Encoder-decoders from MinD (Machine IntelligeNce of Damo) Lab

License:Apache-2.0Stargazers:0Issues:0Issues:0
License:MITStargazers:0Issues:0Issues:0

Cream

This is a collection of our NAS and Vision Transformer work.

License:MITStargazers:0Issues:0Issues:0
License:NOASSERTIONStargazers:0Issues:0Issues:0
Stargazers:0Issues:0Issues:0

unilm

Large-scale Self-supervised Pre-training Across Tasks, Languages, and Modalities

License:MITStargazers:0Issues:0Issues:0

Video-Captioning

Video Captioning is an encoder decoder mode based on sequence to sequence learning

Stargazers:0Issues:0Issues:0

VideoX

VideoX: a collection of video cross-modal models

License:NOASSERTIONStargazers:0Issues:0Issues:0

dlrm

An implementation of a deep learning recommendation model (DLRM)

License:MITStargazers:0Issues:0Issues:0
Stargazers:0Issues:0Issues:0

CLIP4Clip

An official implementation for "CLIP4Clip: An Empirical Study of CLIP for End to End Video Clip Retrieval"

License:MITStargazers:0Issues:0Issues:0

mae

PyTorch implementation of MAE https//arxiv.org/abs/2111.06377

License:NOASSERTIONStargazers:1Issues:0Issues:0

DALLE2-pytorch

Implementation of DALL-E 2, OpenAI's updated text-to-image synthesis neural network, in Pytorch

License:MITStargazers:0Issues:0Issues:0

OFA

Official repository of OFA. Paper: Unifying Architectures, Tasks, and Modalities Through a Simple Sequence-to-Sequence Learning Framework

License:Apache-2.0Stargazers:0Issues:0Issues:0

t5-pegasus-chinese

基于GOOGLE T5中文生成式模型的摘要生成/指代消解,支持batch批量生成,多进程

License:MITStargazers:0Issues:0Issues:0

MASTER-pytorch

Code for the paper "MASTER: Multi-Aspect Non-local Network for Scene Text Recognition" (Pattern Recognition 2021)

License:MITStargazers:0Issues:0Issues:0

PartialLabelingCSL

Official implementation for the paper: "Multi-label Classification with Partial Annotations using Class-aware Selective Loss"

License:MITStargazers:0Issues:0Issues:0

CSRA

Official code of ICCV2021 paper "Residual Attention: A Simple but Effective Method for Multi-Label Recognition"

License:AGPL-3.0Stargazers:0Issues:0Issues:0

MobileModels

手机品牌型号汇总 | Mobile Models | This repository is licensed under CC BY-NC-SA 4.0

Stargazers:0Issues:0Issues:0

Informer2020

The GitHub repository for the paper "Informer" accepted by AAAI 2021.

License:Apache-2.0Stargazers:0Issues:0Issues:0