abcilike's starred repositories
TokenPacker
The code for "TokenPacker: Efficient Visual Projector for Multimodal LLM".
InstantMesh
InstantMesh: Efficient 3D Mesh Generation from a Single Image with Sparse-view Large Reconstruction Models
deictic-segment-anything
Segment Anything with Deictic Prompting
llama3-from-scratch
llama3 implementation one matrix multiplication at a time
TinyLLaVA_Factory
A Framework of Small-scale Large Multimodal Models
LocalMamba
Code for paper LocalMamba: Visual State Space Model with Windowed Selective Scan
GeoAware-SC
Official Implementation of paper "Telling Left from Right: Identifying Geometry-Aware Semantic Correspondence"
UniRepLKNet
[CVPR'24] UniRepLKNet: A Universal Perception Large-Kernel ConvNet for Audio, Video, Point Cloud, Time-Series and Image Recognition