tomasmcm / cog-replicate-llama-gguf

How to push any GGUF LLM to Replicate

How to push any GGUF LLM to Replicate

Download a GGUF model from HuggingFace and place it in /models
Update the <model_name> in predict.py to match the file in /models
Create a model on Replicate (https://replicate.com/docs/guides/push-a-transformers-model)
Run cog login
Run cog push r8.im/<your-username>/<your-model-name>

About

How to push any GGUF LLM to Replicate

Languages

Language:Python 100.0%