llama-cpp-python forked version while contains some modifications, provides much more simpler inference API and demo. Integrated with my ChatBots.
Geek Repo:Geek Repo
Github PK Tool:Github PK Tool