xingdi-eric-yuan / deep-language-networks

We view Large Language Models as stochastic language layers in a network, where the learnable parameters are the natural language prompts at each layer. We stack two such layers, feeding the output of one layer to the next. We call the stacked architecture a Deep Language Network - DLN

Geek Repo:Geek Repo

Github PK Tool:Github PK Tool

xingdi-eric-yuan/deep-language-networks Issues

No issues in this repository yet.