Adasunnylily

Adasunnylily

Geek Repo

Github PK Tool:Github PK Tool

Adasunnylily's starred repositories

LLM-in-Vision

Recent LLM-based CV and related works. Welcome to comment/contribute!

Stargazers:817Issues:0Issues:0

Awesome-LLM-for-RecSys

Survey: A collection of AWESOME papers and resources on the large language model (LLM) related recommender system topics.

License:MITStargazers:911Issues:0Issues:0

llm-action

本项目旨在分享大模型相关技术原理以及实战经验。

Language:HTMLLicense:Apache-2.0Stargazers:8758Issues:0Issues:0

MESM

The official code of Towards Balanced Alignment: Modal-Enhanced Semantic Modeling for Video Moment Retrieval (AAAI2024)

Language:PythonLicense:MITStargazers:28Issues:0Issues:0

MKT

Official implementation of "Open-Vocabulary Multi-Label Classification via Multi-Modal Knowledge Transfer".

Language:PythonLicense:MITStargazers:119Issues:0Issues:0

FROSTER

The official repository for ICLR2024 paper "FROSTER: Frozen CLIP is a Strong Teacher for Open-Vocabulary Action Recognition"

Language:PythonLicense:NOASSERTIONStargazers:52Issues:0Issues:0

LanguageBind

【ICLR 2024🔥】 Extending Video-Language Pretraining to N-modality by Language-based Semantic Alignment

Language:PythonLicense:MITStargazers:665Issues:0Issues:0

BitNet

Implementation of "BitNet: Scaling 1-bit Transformers for Large Language Models" in pytorch

Language:PythonLicense:MITStargazers:1511Issues:0Issues:0

FreeU

FreeU: Free Lunch in Diffusion U-Net (CVPR2024 Oral)

License:MITStargazers:1668Issues:0Issues:0

LAVIS

LAVIS - A One-stop Library for Language-Vision Intelligence

Language:Jupyter NotebookLicense:BSD-3-ClauseStargazers:9543Issues:0Issues:0

Awesome-Prompting-on-Vision-Language-Model

This repo lists relevant papers summarized in our survey paper: A Systematic Survey of Prompt Engineering on Vision-Language Foundation Models.

Stargazers:332Issues:0Issues:0

Awesome-Multimodal-Large-Language-Models

:sparkles::sparkles:Latest Advances on Multimodal Large Language Models

Stargazers:11382Issues:0Issues:0

Multimodal-AND-Large-Language-Models

Paper list about multimodal and large language models, only used to record papers I read in the daily arxiv for personal needs.

Stargazers:498Issues:0Issues:0

ovsam

[ECCV 2024] The official code of paper "Open-Vocabulary SAM".

Language:PythonLicense:NOASSERTIONStargazers:887Issues:0Issues:0

ConceptDiscoveryModels

This is the official implementation of the Concept Discovery Models paper.

Language:PythonStargazers:8Issues:0Issues:0

diffusion-classifier

Diffusion Classifier leverages pretrained diffusion models to perform zero-shot classification without additional training

Language:PythonStargazers:381Issues:0Issues:0
Language:PythonLicense:MITStargazers:1Issues:0Issues:0

Label-free-CBM

A new framework to transform any neural networks into an interpretable concept-bottleneck-model (CBM) without needing labeled concept data

Language:Jupyter NotebookStargazers:67Issues:0Issues:0

LaBo

CVPR 2023: Language in a Bottle: Language Model Guided Concept Bottlenecks for Interpretable Image Classification

Language:PythonStargazers:67Issues:0Issues:0

causalml

Uplift modeling and causal inference with machine learning algorithms

Language:PythonLicense:NOASSERTIONStargazers:4950Issues:0Issues:0
Language:PythonStargazers:2Issues:0Issues:0

SRPerf

A Performance Evaluation Framework for Segment Routing

Language:PythonStargazers:14Issues:0Issues:0
Language:PythonStargazers:209Issues:0Issues:0

metaformer

MetaFormer Baselines for Vision (TPAMI 2024)

Language:PythonLicense:Apache-2.0Stargazers:394Issues:0Issues:0

DHVT

This is an official implementation of our NeurIPS 2022 paper "Bridging the Gap Between Vision Transformers and Convolutional Neural Networks on Small Datasets".

Language:PythonLicense:Apache-2.0Stargazers:51Issues:0Issues:0

evit

Python code for ICLR 2022 spotlight paper EViT: Expediting Vision Transformers via Token Reorganizations

Language:PythonLicense:Apache-2.0Stargazers:165Issues:0Issues:0

DynamicViT

[NeurIPS 2021] [T-PAMI] DynamicViT: Efficient Vision Transformers with Dynamic Token Sparsification

Language:Jupyter NotebookLicense:MITStargazers:550Issues:0Issues:0

Efficient-AI-Backbones

Efficient AI Backbones including GhostNet, TNT and MLP, developed by Huawei Noah's Ark Lab.

Language:PythonStargazers:3963Issues:0Issues:0
Language:PythonLicense:MITStargazers:56Issues:0Issues:0

DAT

Repository of Vision Transformer with Deformable Attention (CVPR2022) and DAT++: Spatially Dynamic Vision Transformerwith Deformable Attention

Language:PythonLicense:Apache-2.0Stargazers:757Issues:0Issues:0