Solve WebGPU OP exceeding max compute workgroups
typio opened this issue · comments
Thomas Huber commented
Potential Solutions
1. Settle with current limits
(only guarantee <=2047x2047, result of workgroup_size(64))
1. Utilize x,y maybe z at 64
(cons: verbose wgsl with conversions from 2d/3d to 1d indicies)
2. Loops in wgsl to extend
(cons: technically lessens parallelization, how much would be enough??)
3. Chunk tensor op?
(cons: buffer memory will overflow anyway and this would add a much more complexity)