Add Metal/GPU support for running model inference

Question

Add Metal/GPU support for running model inference

singularitti opened this issue a year ago · comments

I am no expert in this, but it seems to be running on CPUs, which could cause severe heat generation.

Alex Rozanski · Answer 1 · Wed Jun 21 2023 07:51:12 GMT+0800 (China Standard Time)

@singularitti adding support for this in llama.swift to start with (see alexrozanski/llama.swift#8). this will be coming to LlamaChat v2 which is still a WIP!