typio / nervous

Deep Learning Framework in WebGPU (For Education)

Home Page:https://nervous.vercel.app

Geek Repo:Geek Repo

Github PK Tool:Github PK Tool

Solve WebGPU OP exceeding max compute workgroups

typio opened this issue · comments

Potential Solutions

1. Settle with current limits
(only guarantee <=2047x2047, result of workgroup_size(64))

1. Utilize x,y maybe z at 64
(cons: verbose wgsl with conversions from 2d/3d to 1d indicies)

2. Loops in wgsl to extend
(cons: technically lessens parallelization, how much would be enough??)

3. Chunk tensor op?
(cons: buffer memory will overflow anyway and this would add a much more complexity)