Shashank Rajput's repositories
RULER
This repo contains the source code for RULER: What’s the Real Context Size of Your Long-Context Language Models?
Language:PythonApache-2.0000
llm-foundry
LLM training code for MosaicML foundation models
Language:PythonApache-2.0000
Apache-2.0000
temp_composer
temporary fork of sashaDoubov/composer
Language:PythonApache-2.0000
composer
Train neural networks up to 7x faster
Language:PythonApache-2.0000
flash-attention
Fast and memory-efficient exact attention
Language:PythonBSD-3-Clause000
transformers
🤗 Transformers: State-of-the-art Machine Learning for Pytorch, TensorFlow, and JAX.
Apache-2.0000