tma15

followers

following

stars

Kanagawa, Japan

https://tma15.github.io

Takuya Makino's starred repositories

tuning_playbook

A playbook for systematically maximizing the performance of deep learning models.

NOASSERTION2613900

self_supervised

A Pytorch-Lightning implementation of self-supervised algorithms

Language:PythonMIT52700

olm-training

Repo for training MLMs, CLMs, or T5-type models on the OLM pretraining data, but it should work with any hugging face text dataset.

Language:PythonApache-2.09100

data-preparation

Code used for sourcing and cleaning the BigScience ROOTS corpus

Language:Jupyter NotebookApache-2.029400

olm-datasets

Pipeline for pulling and processing online language model pretraining data from the web

Language:PythonApache-2.017000

cramming

Cramming the training of a (BERT-type) language model into limited compute.

Language:PythonMIT127800

PaLM-rlhf-pytorch

Implementation of RLHF (Reinforcement Learning with Human Feedback) on top of the PaLM architecture. Basically ChatGPT but with PaLM

Language:PythonMIT766300

tiktoken

tiktoken is a fast BPE tokeniser for use with OpenAI's models.

Language:PythonMIT1148500

pytorch_scatter

PyTorch Extension Library of Optimized Scatter Operations

Language:PythonMIT152400

pyserini

Pyserini is a Python toolkit for reproducible information retrieval research with sparse and dense representations.

Language:PythonApache-2.0159000

TRIME

[EMNLP 2022] Training Language Models with Memory Augmentation https://arxiv.org/abs/2205.12674

Language:Python18900

Laboro-DistilBERT-Japanese

Language:Jupyter NotebookNOASSERTION1600

EMAT

Efficient Memory-Augmented Transformers

Language:Python3400

FasterTransformer

Transformer related optimization, including BERT, GPT

Language:C++Apache-2.0571500

LibVQ

Language:PythonMIT6900

KnnContrastiveForOOD

Language:Python3100

torchprofile

A general and accurate MACs / FLOPs profiler for PyTorch models

Language:PythonMIT55000

Transkimmer

Code for ACL2022 publication Transkimmer: Transformer Learns to Layer-wise Skim

Language:Python2100

intel-extension-for-transformers

⚡ Build your chatbot within minutes on your favorite device; offer SOTA compression techniques for LLMs; run LLMs efficiently on Intel Platforms⚡

Language:PythonApache-2.0208900

dataloader

The merlin dataloader lets you rapidly load tabular data for training deep leaning models with TensorFlow, PyTorch or JAX

Language:PythonApache-2.040100

length-adaptive-transformer

Official Pytorch Implementation of Length-Adaptive Transformer (ACL 2021)

Language:PythonApache-2.010000

dse

Language:PythonApache-2.04400

SentAugment

SentAugment is a data augmentation technique for NLP that retrieves similar sentences from a large bank of sentences. It can be used in combination with self-training and knowledge-distillation, or for retrieving paraphrases.

Language:PythonNOASSERTION36300

voltaML

⚡VoltaML is a lightweight library to convert and run your ML/DL deep learning models in high performance inference runtimes like TensorRT, TorchScript, ONNX and TVM.

Language:PythonApache-2.0119400

kernl

Kernl lets you run PyTorch transformer models several times faster on GPU with a single line of code, and is designed to be easily hackable.

Language:Jupyter NotebookApache-2.0150600

model-soups

Model soups: averaging weights of multiple fine-tuned models improves accuracy without increasing inference time

Language:PythonMIT39900

transformers-bloom-inference

Fast Inference Solutions for BLOOM

Language:PythonApache-2.055700

DeepSpeed-MII

MII makes low-latency and high-throughput inference possible, powered by DeepSpeed.

Language:PythonApache-2.0181700

AITemplate

AITemplate is a Python framework which renders neural network into high performance CUDA/HIP C++ code. Specialized for FP16 TensorCore (NVIDIA GPU) and MatrixCore (AMD GPU) inference.

Language:PythonApache-2.0450500

pytorch-lightning

Pretrain, finetune and deploy AI models on multiple GPUs, TPUs with zero code changes.

Language:PythonApache-2.02774600