h3ndrik / streaming-llm

Efficient Streaming Language Models with Attention Sinks

Home Page:https://arxiv.org/abs/2309.17453

Geek Repo:Geek Repo

Github PK Tool:Github PK Tool

h3ndrik/streaming-llm Watchers