tomasmcm / cog-replicate-llama-gguf

How to push any GGUF LLM to Replicate

Geek Repo:Geek Repo

Github PK Tool:Github PK Tool

How to push any GGUF LLM to Replicate

  1. Download a GGUF model from HuggingFace and place it in /models
  2. Update the <model_name> in predict.py to match the file in /models
  3. Create a model on Replicate (https://replicate.com/docs/guides/push-a-transformers-model)
  4. Run cog login
  5. Run cog push r8.im/<your-username>/<your-model-name>

About

How to push any GGUF LLM to Replicate


Languages

Language:Python 100.0%