Fuzzy-Search / realtime-bakllava

llama.cpp with BakLLaVA model describes what does it see

Geek Repo:Geek Repo

Github PK Tool:Github PK Tool

🍰 Bakllava Llama C++ Tutorial πŸ¦™

Welcome to the delicious world of Bakllava Llama with C++! Follow these steps to get your code running and indulge in AI sweetness! πŸ˜‹

🚨 Properly tested only with Apple silicon chip

youtube installation guide

similar relevant project: Be My Eyes" web app

πŸš€ Step 1: Install Llama C++

First things first, let's get the Llama C++ installed.

πŸ”— Clone the repository from GitHub:

git clone https://github.com/ggerganov/llama.cpp
cd llama.cpp

On Linux & macOS:

πŸ›  Build with make:

make

πŸ— Or, if you prefer cmake:

cmake --build . --config Release

πŸ“¦ Step 2: Download the Model!

  1. πŸ“₯ Download from Hugging Face - mys/ggml_bakllava-1 this 2 files:
  • 🌟 ggml-model-q4_k.gguf (or any other quantized model) - only one is required!
  • 🧊 mmproj-model-f16.gguf
  1. βœ‚οΈ Copy the paths of those 2 files.

  2. πŸƒβ€β™‚οΈ Run this in the llama.cpp repository (replace YOUR_PATH with the paths to the files you downloaded):

    macOS

    ./server -m YOUR_PATH/ggml-model-q4_k.gguf --mmproj YOUR_PATH/mmproj-model-f16.gguf -ngl 1
    

    Windows

    server.exe -m REPLACE_WITH_YOUR_PATH\ggml-model-q4_k.gguf --mmproj REPLACE_WITH_YOUR_PATH\mmproj-model-f16.gguf -ngl 1
    
    
  3. πŸŽ‰ The llama server is now up and running!

    ⚠️ NOTE: Keep the server running in the background.

  4. πŸ“Ή Let's run the script to use the webcam or send it a single picture!

πŸƒβ€β™€οΈ Step 3: Running the Demo

Open a new terminal window and clone the demo app:

git clone https://github.com/Fuzzy-Search/realtime-bakllava.git
cd realtime-bakllava

πŸ›  (Optional) Create a new Python virtual environment and activate it

python3 -m venv bakllava-venv
source bakllava-venv/bin/activate
pip3 install -r requirements.txt

πŸŽ₯ Webcam Script

To start streaming from your webcam:

! if you have problem with FFMPEG lib, download the source code and in file src/video_stream.py modify second line of code

python3 src/video_stream.py

πŸ–Ό Simple Picture Drop

Export-1699182386675

pip install -r picture_requirements.txt
python src/picture_drop.py --path src/sample_pic.png

πŸ“ Enjoy your adventure with Llama C++! πŸš€πŸ¦™

Star History

Star History Chart

About

llama.cpp with BakLLaVA model describes what does it see


Languages

Language:Python 100.0%