Dingkang Liang's starred repositories
VAR
[GPT beats diffusionš„] [scaling laws in visual generationš] Official impl. of "Visual Autoregressive Modeling: Scalable Image Generation via Next-Scale Prediction". An *ultra-simple, user-friendly yet state-of-the-art* codebase for autoregressive image generation!
MobileAgent
Mobile-Agent: The Powerful Mobile Device Operation Assistant Family
InternLM-XComposer
InternLM-XComposer2 is a groundbreaking vision-language large model (VLLM) excelling in free-form text-image composition and comprehension.
2d-gaussian-splatting
[SIGGRAPH'24] 2D Gaussian Splatting for Geometrically Accurate Radiance Fields
flash-linear-attention
Efficient implementations of state-of-the-art linear attention models in Pytorch and Triton
Vision-RWKV
Vision-RWKV: Efficient and Scalable Visual Perception with RWKV-Like Architectures
numpy-hilbert-curve
Numpy implementation of Hilbert curves in arbitrary dimensions
WidthFormer
WidthFormer: Toward Efficient Transformer-based BEV View Transformation
1d-tokenizer
This repo contains the code for our paper An Image is Worth 32 Tokens for Reconstruction and Generation
Vision-Mamba-A-Comprehensive-Survey-and-Taxonomy
Vision Mamba: A Comprehensive Survey and Taxonomy