Mistral: A strong, northwesterly wind: Framework for transparent and accessible large-scale language model training, built with Hugging Face 🤗 Transformers.
Currently artifacts aren't getting cleaned ... we probably want at least the first test to completely retokenize wikitext-2 ... the others can use the cache.