Klay (KlayMa527)

KlayMa527

Geek Repo

Github PK Tool:Github PK Tool

Klay's starred repositories

Awesome-RSITR

🎮 A Benchmark and Awesome Collection of Methods for Remote Sensing Image-Text Retrieval (RSITR)| Remote Sensing Cross-Model Retrieval (RSCMR) | Remote Sensing Vision-Lanuage Models (RSVLMs)

Stargazers:35Issues:0Issues:0

deep_sort_pytorch

MOT using deepsort and yolov3 with pytorch

Language:PythonLicense:MITStargazers:2776Issues:0Issues:0

segment-anything-2

The repository provides code for running inference with the Meta Segment Anything Model 2 (SAM 2), links for downloading the trained model checkpoints, and example notebooks that show how to use the model.

Language:Jupyter NotebookLicense:Apache-2.0Stargazers:9717Issues:0Issues:0

FSL-Mate

FSL-Mate: A collection of resources for few-shot learning (FSL).

Language:PythonStargazers:1684Issues:0Issues:0

LLaMA-Factory

Efficiently Fine-Tune 100+ LLMs in WebUI (ACL 2024)

Language:PythonLicense:Apache-2.0Stargazers:29262Issues:0Issues:0

ShareGPT4V

[ECCV 2024] ShareGPT4V: Improving Large Multi-modal Models with Better Captions

Language:PythonStargazers:92Issues:0Issues:0
License:Apache-2.0Stargazers:27Issues:0Issues:0

fast-reid

SOTA Re-identification Methods and Toolbox

Language:PythonLicense:Apache-2.0Stargazers:3367Issues:0Issues:0

Awesome-Multi-Modal-Object-Re-Identification

Multi-modal Object Re-identification

Stargazers:26Issues:0Issues:0

EDITOR

【CVPR2024】Magic Tokens: Select Diverse Tokens for Multi-modal Object Re-Identification

Language:PythonLicense:MITStargazers:66Issues:0Issues:0

MLLA

Official repository of MLLA

Language:PythonStargazers:156Issues:0Issues:0

Remote-Sensing-ChatGPT

Chat with RS-ChatGPT and get the remote sensing interpretation results and the response!

Language:PythonStargazers:205Issues:0Issues:0

Awesome-Remote-Sensing-Multimodal-Large-Language-Model

Multimodal Large Language Models for Remote Sensing (RS-MLLMs): A Survey

Stargazers:106Issues:0Issues:0

GLM-4

GLM-4 series: Open Multilingual Multimodal Chat LMs | 开源多语言多模态对话模型

Language:PythonLicense:Apache-2.0Stargazers:4265Issues:0Issues:0

TimeChat

[CVPR 2024] TimeChat: A Time-sensitive Multimodal Large Language Model for Long Video Understanding

Language:PythonLicense:BSD-3-ClauseStargazers:255Issues:0Issues:0

CONQUER

[2021 MultiMedia] CONQUER: Contextual Query-aware Ranking for Video Corpus Moment Retrieval

Language:PythonStargazers:33Issues:0Issues:0

VTG-LLM

[Preprint] VTG-LLM: Integrating Timestamp Knowledge into Video LLMs for Enhanced Video Temporal Grounding

Language:PythonLicense:Apache-2.0Stargazers:43Issues:0Issues:0

VLMEvalKit

Open-source evaluation toolkit of large vision-language models (LVLMs), support ~100 VLMs, 30+ benchmarks

Language:PythonLicense:Apache-2.0Stargazers:878Issues:0Issues:0

MiniCPM-V

MiniCPM-V 2.6: A GPT-4V Level MLLM for Single Image, Multi Image and Video on Your Phone

Language:PythonLicense:Apache-2.0Stargazers:10858Issues:0Issues:0

InternLM-XComposer

InternLM-XComposer-2.5: A Versatile Large Vision Language Model Supporting Long-Contextual Input and Output

Language:PythonLicense:Apache-2.0Stargazers:2398Issues:0Issues:0

Awesome-Mamba-Papers

Awesome Papers related to Mamba.

Stargazers:1038Issues:0Issues:0

mamba

Mamba SSM architecture

Language:PythonLicense:Apache-2.0Stargazers:12197Issues:0Issues:0
Language:PythonLicense:Apache-2.0Stargazers:957Issues:0Issues:0

TubeDETR

[CVPR 2022 Oral] TubeDETR: Spatio-Temporal Video Grounding with Transformers

Language:PythonLicense:Apache-2.0Stargazers:165Issues:0Issues:0

ccf-deadlines

⏰ Collaboratively track deadlines of conferences recommended by CCF (Website, Python Cli, Wechat Applet) / If you find it useful, please star this project, thanks~

Language:VueLicense:MITStargazers:5652Issues:0Issues:0

acm-mm

competition code

Language:PythonStargazers:3Issues:0Issues:0

mistral-inference

Official inference library for Mistral models

Language:Jupyter NotebookLicense:Apache-2.0Stargazers:9434Issues:0Issues:0

LoG

Level of Gaussians

Language:PythonLicense:NOASSERTIONStargazers:654Issues:0Issues:0

MGM

Official repo for "Mini-Gemini: Mining the Potential of Multi-modality Vision Language Models"

Language:PythonLicense:Apache-2.0Stargazers:3144Issues:0Issues:0

ControlLLM

ControlLLM: Augment Language Models with Tools by Searching on Graphs

Language:PythonStargazers:181Issues:0Issues:0