arrmansa / Basic-UI-for-GPT-Neo-with-low-vram

A basic ui for running gpt neo 2.7B on low vram (3 gb Vram minimum)

Geek Repo:Geek Repo

Github PK Tool:Github PK Tool

Basic-UI-Gpt-Neo-low-vram

A basic ui for running gpt neo 2.7B on low vram (3 gb Vram minimum)

Expected speed on pcie-3 with 3gb vram is 0.8s/token or 20s for 25 tokens
Expected speed on pcie-3 with 8gb vram is 0.4s/token or 10s for 25 tokens
(with a 2000 token input)

About

A basic ui for running gpt neo 2.7B on low vram (3 gb Vram minimum)

License:Apache License 2.0


Languages

Language:Jupyter Notebook 100.0%