Beast code in Giters

Adasunnylily's starred repositories

LLM-in-Vision

Recent LLM-based CV and related works. Welcome to comment/contribute!

Awesome-LLM-for-RecSys

Survey: A collection of AWESOME papers and resources on the large language model (LLM) related recommender system topics.

MIT91100

llm-action

本项目旨在分享大模型相关技术原理以及实战经验。

Language:HTMLApache-2.0875800

MESM

The official code of Towards Balanced Alignment: Modal-Enhanced Semantic Modeling for Video Moment Retrieval (AAAI2024)

Language:PythonMIT2800

MKT

Official implementation of "Open-Vocabulary Multi-Label Classification via Multi-Modal Knowledge Transfer".

Language:PythonMIT11900

FROSTER

The official repository for ICLR2024 paper "FROSTER: Frozen CLIP is a Strong Teacher for Open-Vocabulary Action Recognition"

Language:PythonNOASSERTION5200

LanguageBind

【ICLR 2024🔥】 Extending Video-Language Pretraining to N-modality by Language-based Semantic Alignment

Language:PythonMIT66500

BitNet

Implementation of "BitNet: Scaling 1-bit Transformers for Large Language Models" in pytorch

Language:PythonMIT151100

FreeU

FreeU: Free Lunch in Diffusion U-Net (CVPR2024 Oral)

MIT166800

LAVIS

LAVIS - A One-stop Library for Language-Vision Intelligence

Language:Jupyter NotebookBSD-3-Clause954300

Awesome-Prompting-on-Vision-Language-Model

This repo lists relevant papers summarized in our survey paper: A Systematic Survey of Prompt Engineering on Vision-Language Foundation Models.

33200

Awesome-Multimodal-Large-Language-Models

:sparkles::sparkles:Latest Advances on Multimodal Large Language Models

1138200

Multimodal-AND-Large-Language-Models

Paper list about multimodal and large language models, only used to record papers I read in the daily arxiv for personal needs.

49800

ovsam

[ECCV 2024] The official code of paper "Open-Vocabulary SAM".

Language:PythonNOASSERTION88700

ConceptDiscoveryModels

This is the official implementation of the Concept Discovery Models paper.

Language:Python800

diffusion-classifier

Diffusion Classifier leverages pretrained diffusion models to perform zero-shot classification without additional training

Language:Python38100

MetaVL

Language:PythonMIT100

Label-free-CBM

A new framework to transform any neural networks into an interpretable concept-bottleneck-model (CBM) without needing labeled concept data

Language:Jupyter Notebook6700

LaBo

CVPR 2023: Language in a Bottle: Language Model Guided Concept Bottlenecks for Interpretable Image Classification

Language:Python6700

causalml

Uplift modeling and causal inference with machine learning algorithms

Language:PythonNOASSERTION495000

srv6-wtmc2022

Language:Python200

SRPerf

A Performance Evaluation Framework for Segment Routing

Language:Python1400

Shunted-Transformer

Language:Python20900

metaformer

MetaFormer Baselines for Vision (TPAMI 2024)

Language:PythonApache-2.039400

DHVT

This is an official implementation of our NeurIPS 2022 paper "Bridging the Gap Between Vision Transformers and Convolutional Neural Networks on Small Datasets".

Language:PythonApache-2.05100

evit

Python code for ICLR 2022 spotlight paper EViT: Expediting Vision Transformers via Token Reorganizations

Language:PythonApache-2.016500

DynamicViT

[NeurIPS 2021] [T-PAMI] DynamicViT: Efficient Vision Transformers with Dynamic Token Sparsification

Language:Jupyter NotebookMIT55000

Efficient-AI-Backbones

Efficient AI Backbones including GhostNet, TNT and MLP, developed by Huawei Noah's Ark Lab.

Language:Python396300

dgmn

Language:PythonMIT5600

DAT

Repository of Vision Transformer with Deformable Attention (CVPR2022) and DAT++: Spatially Dynamic Vision Transformerwith Deformable Attention

Language:PythonApache-2.075700