Mamba-LLaVA

Install

First follow the LLaVA README create the base environment.

Then install the packages for Mamba

pip install causal-conv1d
pip install mamba-ssm

Please download the 558K subset of the LAION-CC-SBU dataset with BLIP captions we use in the paper here.

Pretrain takes around 11 hours for Mamba-2.8B-LLaVA-v1.5 on 4x 3090 (24G).

Training script without DeepSpeed and bf16: pretrain_fp32.sh.

coming soon ...

Apache License 2.0

Language:Python 88.4%Language:Shell 7.3%Language:JavaScript 2.0%Language:HTML 1.6%Language:CSS 0.4%Language:Dockerfile 0.3%