anakin87 / llama2-haystack

Using Llama2 with Haystack, the NLP/LLM framework.

Geek Repo:Geek Repo

Github PK Tool:Github PK Tool

llama2-haystack

(Experimental) Using Llama2 with Haystack

The notebook contains my hacky experiments in which I try to load and use Llama2 with Haystack, the NLP/LLM framework.

It's nothing official or well refined, but perhaps it may be useful to other people experimenting.

llama2-haystack

Notebook summary

  1. Installed Transformers from the main branch (and other libraries) ๐Ÿ“š
  2. Loaded Llama-2-13b-chat-hf on Colab using 4-bit quantizazion, thanks to the great material shared by Younes Belkada ๐Ÿ™Œ
  3. Disabled Tensor Parallelism, which caused some issues ๐Ÿ› ๏ธ
  4. Installed a minimal version of Haystack
  5. Found a hacky way to load the model in Haystack's PromptNode
  6. Had a llama-zing chat session, from ๐ŸŽง๐ŸŽถ David Guetta to Don Matteo โ›ช๐Ÿ“ฟ (an Italian TV series)!

About

Using Llama2 with Haystack, the NLP/LLM framework.


Languages

Language:Jupyter Notebook 100.0%