abcilike's starred repositories

rschange

Change detection of remote sensing images

Language:PythonStargazers:41Issues:0Issues:0

dift

[NeurIPS'23] Emergent Correspondence from Image Diffusion

Language:PythonLicense:MITStargazers:556Issues:0Issues:0

Doduo

Official PyTorch implementation of Doduo: Dense Visual Correspondence from Unsupervised Semantic-Aware Flow

Language:PythonLicense:MITStargazers:41Issues:0Issues:0

MaskCD

[IEEE TGRS 2024]: The official PyTorch implementation of the paper "MaskCD: A Remote Sensing Change Detection Network Based on Mask Classification"

Language:PythonLicense:MITStargazers:23Issues:0Issues:0

TokenPacker

The code for "TokenPacker: Efficient Visual Projector for Multimodal LLM".

Language:PythonStargazers:105Issues:0Issues:0
Language:PythonStargazers:1351Issues:0Issues:0

LLM101n

LLM101n: Let's build a Storyteller

Stargazers:25339Issues:0Issues:0
Language:PythonLicense:Apache-2.0Stargazers:87Issues:0Issues:0

swift

ms-swift: Use PEFT or Full-parameter to finetune 300+ LLMs or 50+ MLLMs. (Qwen2, GLM4v, Internlm2.5, Yi, Llama3.1, Llava-Video, Internvl2, MiniCPM-V, Deepseek, Baichuan2, Gemma2, Phi3-Vision, ...)

Language:PythonLicense:Apache-2.0Stargazers:2531Issues:0Issues:0

OpenLRM

An open-source impl. of Large Reconstruction Models

Language:PythonLicense:Apache-2.0Stargazers:860Issues:0Issues:0

InstantMesh

InstantMesh: Efficient 3D Mesh Generation from a Single Image with Sparse-view Large Reconstruction Models

Language:PythonLicense:Apache-2.0Stargazers:2784Issues:0Issues:0

VCoder

VCoder: Versatile Vision Encoders for Multimodal Large Language Models, arXiv 2023 / CVPR 2024

Language:PythonLicense:Apache-2.0Stargazers:251Issues:0Issues:0

deictic-segment-anything

Segment Anything with Deictic Prompting

Language:PythonLicense:MITStargazers:10Issues:0Issues:0

Sigma

Python implementation of Sigma: Siamese Mamba Network for Multi-Modal Semantic Segmentation

Language:PythonLicense:MITStargazers:129Issues:0Issues:0

llama3-from-scratch

llama3 implementation one matrix multiplication at a time

Language:Jupyter NotebookLicense:MITStargazers:11385Issues:0Issues:0

LibMTL

A PyTorch Library for Multi-Task Learning

Language:PythonLicense:MITStargazers:1885Issues:0Issues:0
Language:PythonStargazers:32Issues:0Issues:0

LLaVA-pp

🔥🔥 LLaVA++: Extending LLaVA with Phi-3 and LLaMA-3 (LLaVA LLaMA-3, LLaVA Phi-3)

Language:PythonStargazers:760Issues:0Issues:0

HPT

HPT - Open Multimodal LLMs from HyperGAI

Language:PythonLicense:Apache-2.0Stargazers:303Issues:0Issues:0

TinyLLaVA_Factory

A Framework of Small-scale Large Multimodal Models

Language:PythonLicense:Apache-2.0Stargazers:509Issues:0Issues:0

moondream

tiny vision language model

Language:Jupyter NotebookLicense:Apache-2.0Stargazers:4601Issues:0Issues:0

MGM

Official repo for "Mini-Gemini: Mining the Potential of Multi-modality Vision Language Models"

Language:PythonLicense:Apache-2.0Stargazers:3108Issues:0Issues:0

LocalMamba

Code for paper LocalMamba: Visual State Space Model with Windowed Selective Scan

Language:PythonLicense:Apache-2.0Stargazers:176Issues:0Issues:0

Vim

[ICML 2024] Vision Mamba: Efficient Visual Representation Learning with Bidirectional State Space Model

Language:PythonLicense:Apache-2.0Stargazers:2649Issues:0Issues:0

GeoAware-SC

Official Implementation of paper "Telling Left from Right: Identifying Geometry-Aware Semantic Correspondence"

Language:PythonStargazers:62Issues:0Issues:0

RCML

Reliable Conflictive Multi-view Learning

Language:PythonStargazers:49Issues:0Issues:0

WeakTr

WeakTr: Exploring Plain Vision Transformer for Weakly-supervised Semantic Segmentation

Language:PythonLicense:MITStargazers:120Issues:0Issues:0

E2VPT

Official Pytorch implementation of "E2VPT: An Effective and Efficient Approach for Visual Prompt Tuning". (ICCV2023)

Language:PythonLicense:NOASSERTIONStargazers:63Issues:0Issues:0
Language:PythonLicense:Apache-2.0Stargazers:56Issues:0Issues:0

UniRepLKNet

[CVPR'24] UniRepLKNet: A Universal Perception Large-Kernel ConvNet for Audio, Video, Point Cloud, Time-Series and Image Recognition

Language:PythonLicense:Apache-2.0Stargazers:875Issues:0Issues:0