[Question] What is the status of Vulkan backend?

Question

[Question] What is the status of Vulkan backend?

DanielMazurkiewicz opened this issue 9 months ago · comments

Daniel Mazurkiewicz commented 9 months ago

Vulkan may be not the the best/fastest/easiest or so solution for inference, but is probably most portable GPU acceleration approach.

Is anyone working actively to add support for it? And if so what is status/progress? If not, is it planned?

Erik Scholz · Answer 1 · Thu Sep 28 2023 21:32:04 GMT+0800 (China Standard Time)

there are ggerganov/llama.cpp#2059 and ggerganov/llama.cpp#2039

0cc4m · Answer 2 · Sat Sep 30 2023 05:13:26 GMT+0800 (China Standard Time)

Yeah, I'm working on it. Let me know if you have any questions. It's a big project, but I'm making progress.

Calandiel · Answer 3 · Thu Nov 16 2023 17:06:13 GMT+0800 (China Standard Time)

Yeah, I'm working on it. Let me know if you have any questions. It's a big project, but I'm making progress.

Is there any low hanging fruit a newcomer to the project could help with?

0cc4m · Answer 4 · Thu Nov 16 2023 17:29:08 GMT+0800 (China Standard Time)

Yeah, I'm working on it. Let me know if you have any questions. It's a big project, but I'm making progress.

Is there any low hanging fruit a newcomer to the project could help with?

@Calandiel If you have experience with Vulkan, maybe. Otherwise probably not.

Calandiel · Answer 5 · Thu Nov 16 2023 21:20:47 GMT+0800 (China Standard Time)

Yeah, I'm working on it. Let me know if you have any questions. It's a big project, but I'm making progress.

Is there any low hanging fruit a newcomer to the project could help with?

@Calandiel If you have experience with Vulkan, maybe. Otherwise probably not.

I have. I've written vulkan based render pipelines professionally and made toy neural networks in vulkan trained with sgd. Been working with it at least in some capacity for the last 4 years or so.

0cc4m · Answer 6 · Thu Nov 16 2023 22:11:16 GMT+0800 (China Standard Time)

Yeah, I'm working on it. Let me know if you have any questions. It's a big project, but I'm making progress.

Is there any low hanging fruit a newcomer to the project could help with?

@Calandiel If you have experience with Vulkan, maybe. Otherwise probably not.

I have. I've written vulkan based render pipelines professionally and made toy neural networks in vulkan trained with sgd. Been working with it at least in some capacity for the last 4 years or so.

Oh cool, I'd be glad to work something out. If you have Discord, send me a message (_occam), otherwise send me an Email and we'll find another way.

Calandiel · Answer 7 · Fri Nov 17 2023 15:11:44 GMT+0800 (China Standard Time)

Yeah, I'm working on it. Let me know if you have any questions. It's a big project, but I'm making progress.

Is there any low hanging fruit a newcomer to the project could help with?

@Calandiel If you have experience with Vulkan, maybe. Otherwise probably not.

I have. I've written vulkan based render pipelines professionally and made toy neural networks in vulkan trained with sgd. Been working with it at least in some capacity for the last 4 years or so.

Oh cool, I'd be glad to work something out. If you have Discord, send me a message (_occam), otherwise send me an Email and we'll find another way.

Will do, see you on Discord!

sorasoras · Answer 8 · Mon Dec 11 2023 01:17:41 GMT+0800 (China Standard Time)

I think nomic-ai have functional kompute of llama.cpp right now
https://github.com/nomic-ai/llama.cpp
and, GPT4ALL is plenty fast on my 7900XTX via vulkan.
but I am not sure how to integrate this on ggml as i am not a programmer.

Erik Scholz · Answer 9 · Fri Dec 15 2023 23:26:00 GMT+0800 (China Standard Time)

@sorasoras ggerganov/llama.cpp#4456