sliorde / conv-decoder

causal decoder based on convolutions only (no attention): can be applied to ubbounded sequence lengths; the prediction of the next token depends on *all* previous tokens; allows autoregressive sampling; highly gpu-parralellizable; trained with teacher forcing;

Geek Repo:Geek Repo

Github PK Tool:Github PK Tool

This repository is not active

About

causal decoder based on convolutions only (no attention): can be applied to ubbounded sequence lengths; the prediction of the next token depends on *all* previous tokens; allows autoregressive sampling; highly gpu-parralellizable; trained with teacher forcing;


Languages

Language:Python 100.0%