bob80333 / investigating_extrapolation

investigating sequence length extrapolation in transformer language models across different positional embeddings

Geek Repo:Geek Repo

Github PK Tool:Github PK Tool

This repository is not active

About

investigating sequence length extrapolation in transformer language models across different positional embeddings

License:MIT License


Languages

Language:Jupyter Notebook 95.9%Language:Python 4.1%