jim4399266's starred repositories

Retrieval-Augmented-Visual-Question-Answering

This is the official repository for Retrieval Augmented Visual Question Answering

Language:PythonLicense:GPL-3.0Stargazers:131Issues:0Issues:0
Language:PythonStargazers:25Issues:0Issues:0

vit-explain

Explainability for Vision Transformers

Language:PythonLicense:MITStargazers:790Issues:0Issues:0

Agent-Attention

Official repository of Agent Attention (ECCV2024)

Language:PythonStargazers:446Issues:0Issues:0

HAT

Implementation of our paper, 'Unifying Two-Stream Encoders with Transformers for Cross-Modal Retrieval.'

Language:PythonLicense:GPL-3.0Stargazers:18Issues:0Issues:0

Awesome-Multimodal-Large-Language-Models

:sparkles::sparkles:Latest Advances on Multimodal Large Language Models

Stargazers:10886Issues:0Issues:0

llama

Inference code for Llama models

Language:PythonLicense:NOASSERTIONStargazers:54693Issues:0Issues:0

HREM

Learning Semantic Relationship among Instances for Image-Text Matching, CVPR, 2023

Language:PythonLicense:Apache-2.0Stargazers:84Issues:0Issues:0

VSL

The code of "Image-text Retrieval via Preserving Main Semantic of Vision" in ICME 2023.

Language:PythonStargazers:12Issues:0Issues:0

grouped-query-attention-pytorch

(Unofficial) PyTorch implementation of grouped-query attention (GQA) from "GQA: Training Generalized Multi-Query Transformer Models from Multi-Head Checkpoints" (https://arxiv.org/pdf/2305.13245.pdf)

Language:PythonLicense:MITStargazers:94Issues:0Issues:0
Language:PythonLicense:Apache-2.0Stargazers:319Issues:0Issues:0

fitlog

fitlog是一款在深度学习训练中用于辅助用户记录日志和管理代码的工具

Language:PythonLicense:Apache-2.0Stargazers:1457Issues:0Issues:0

annotated_deep_learning_paper_implementations

🧑‍🏫 60 Implementations/tutorials of deep learning papers with side-by-side notes 📝; including transformers (original, xl, switch, feedback, vit, ...), optimizers (adam, adabelief, sophia, ...), gans(cyclegan, stylegan2, ...), 🎮 reinforcement learning (ppo, dqn), capsnet, distillation, ... 🧠

Language:PythonLicense:MITStargazers:52282Issues:0Issues:0

LAVIS

LAVIS - A One-stop Library for Language-Vision Intelligence

Language:Jupyter NotebookLicense:BSD-3-ClauseStargazers:9302Issues:0Issues:0

ptp

[CVPR2023] The code for 《Position-guided Text Prompt for Vision-Language Pre-training》

Language:PythonLicense:Apache-2.0Stargazers:147Issues:0Issues:0

MaaAssistantArknights

《明日方舟》小助手,全日常一键长草!| An Arknights assistant compatible with EN, JP, KR, ZH_TW clients

Language:C++License:AGPL-3.0Stargazers:12Issues:0Issues:0

Liquipedia.net-page

Liquipedia landing page

Language:PHPStargazers:26Issues:0Issues:0

unilm

Large-scale Self-supervised Pre-training Across Tasks, Languages, and Modalities

Language:PythonLicense:MITStargazers:19259Issues:0Issues:0

Pretraining-T5-PyTorch-Lightning

Collection of scripts to pretrain T5 in unsupervised text, using PyTorch Lightning. CORD-19 pretraining provided as example.

Language:PythonStargazers:29Issues:0Issues:0

awesome-pretrained-chinese-nlp-models

Awesome Pretrained Chinese NLP Models,高质量中文预训练模型&大模型&多模态模型&大语言模型集合

Language:PythonLicense:MITStargazers:4587Issues:0Issues:0

bert4torch-fork

参考bert4keras的pytorch实现

Language:PythonLicense:MITStargazers:1Issues:0Issues:0

RoFormer_pytorch

RoFormer V1 & V2 pytorch

Language:PythonLicense:Apache-2.0Stargazers:448Issues:0Issues:0

nlpcda

一键中文数据增强包 ; NLP数据增强、bert数据增强、EDA:pip install nlpcda

Language:PythonLicense:Apache-2.0Stargazers:1729Issues:0Issues:0

kaggle-feedback-effectiveness-1st-place-solution

Winning solution for the Kaggle Feedback Prize Challenge.

Language:Jupyter NotebookLicense:MITStargazers:64Issues:0Issues:0

vpncn.github.io

2024**翻墙软件VPN推荐以及科学上网避坑,稳定好用。对比SSR机场、蓝灯、V2ray、老王VPN、VPS搭建梯子等科学上网与翻墙软件,**最新科学上网翻墙梯子VPN下载推荐,访问Chatgpt。

Language:HTMLStargazers:14961Issues:0Issues:0

DeBERTa

The implementation of DeBERTa

Language:PythonLicense:MITStargazers:1935Issues:0Issues:0

OpenBG-IMG

Baselines for CCKS 2022 Task "Link Prediction for Multimodal Product Knowledge Graph"

Language:PythonStargazers:69Issues:0Issues:0

TCL

code for TCL: Vision-Language Pre-Training with Triple Contrastive Learning, CVPR 2022

Language:PythonLicense:MITStargazers:257Issues:0Issues:0

BLIP

PyTorch code for BLIP: Bootstrapping Language-Image Pre-training for Unified Vision-Language Understanding and Generation

Language:Jupyter NotebookLicense:BSD-3-ClauseStargazers:4514Issues:0Issues:0

pytorch-lightning

Pretrain, finetune and deploy AI models on multiple GPUs, TPUs with zero code changes.

Language:PythonLicense:Apache-2.0Stargazers:27601Issues:0Issues:0