lightonai / composer

Supercharge Your Model Training

Home Page:http://docs.mosaicml.com

Geek Repo:Geek Repo

Github PK Tool:Github PK Tool

Composer fork for Mamba models

This repository is a fork of the Composer library to train Mamba models with the following features:

  • Custom Block-wise activation checkpointing
  • Custom FSDP layer wrapping for Mamba
  • The WSD scheduler
  • FLOPs computation for Mamba
  • Custom and efficient dataloading
  • Improved logging

More details and instructions can be found in the dedicated mamba directory on how to use and train Mamba models with the provided codebase.

About

Supercharge Your Model Training

http://docs.mosaicml.com

License:Apache License 2.0


Languages

Language:Python 99.5%Language:Dockerfile 0.4%Language:Shell 0.1%Language:Makefile 0.0%