berlino / seq-test

Understand and test language model architectures on synthetic tasks.

Geek Repo:Geek Repo

Github PK Tool:Github PK Tool

Setup

pip install torch==2.2.0 opt_einsum pythomata
pip install -U git+https://github.com/sustcsonglin/flash-linear-attention
pip install -e .

Running

python -m zoology.launch zoology/experiments/mqar/mha.py

Sync with Upstream

git checkout -b HazyResearch-main main
git pull https://github.com/HazyResearch/zoology.git main
git checkout main
git merge --no-ff HazyResearch-main
git push origin main

About

Understand and test language model architectures on synthetic tasks.

License:Apache License 2.0


Languages

Language:Python 96.1%Language:Cuda 2.0%Language:Jupyter Notebook 0.9%Language:C++ 0.5%Language:Shell 0.5%