mmarius

mmarius's starred repositories

peft

🤗 PEFT: State-of-the-art Parameter-Efficient Fine-Tuning.

Language:PythonApache-2.015321 103 991

mesh-transformer-jax

Model parallel transformers in JAX and Haiku

Language:PythonApache-2.06250 112 205

FriendsDontLetFriends

Friends don't let friends make certain types of data visualization - What are they and why are they bad.

Language:RMIT6208 107 7

text-to-text-transfer-transformer

Code for the paper "Exploring the Limits of Transfer Learning with a Unified Text-to-Text Transformer"

Language:PythonApache-2.06053 103 405

x-transformers

A simple but complete full-attention transformer with a set of promising experimental features from various papers

Language:PythonMIT4448 53 203

OpenPrompt

An Open-Source Framework for Prompt-Learning.

Language:PythonApache-2.04259 43 256

SimCSE

[EMNLP 2021] SimCSE: Simple Contrastive Learning of Sentence Embeddings https://arxiv.org/abs/2104.08821

Language:PythonMIT3331 27 266

adapters

A Unified Library for Parameter-Efficient and Modular Transfer Learning

Language:Jupyter NotebookApache-2.02487 30 377

makemore

An autoregressive character-level language model for making more things

Language:PythonMIT2365 33 8

pythia

The hub for EleutherAI's work on interpretability and learning dynamics

Language:Jupyter NotebookApache-2.02168 32 102

pyserini

Pyserini is a Python toolkit for reproducible information retrieval research with sparse and dense representations.

Language:PythonApache-2.01576 19 536

Megatron-DeepSpeed

Ongoing research training transformer language models at scale, including: BERT & GPT-2

Language:PythonNOASSERTION1293 24 143

cramming

Cramming the training of a (BERT-type) language model into limited compute.

Language:PythonMIT1272 22 34

bigscience

Central place for the engineering/scaling WG: documentation, SLURM scripts and logs, compute environment and data.

Language:ShellNOASSERTION968 38 19

tueplots

Figure sizes, font sizes, fonts, and more configurations at minimal overhead. Fix your journal papers, conference proceedings, and other scientific publications.

Language:PythonMIT652 4 54

mistral

Mistral: A strong, northwesterly wind: Framework for transparent and accessible large-scale language model training, built with Hugging Face 🤗 Transformers.

Language:PythonApache-2.0549 16 96