Can sparse convolution benefit from the pre-existing weights of dense convolution?

Question

Can sparse convolution benefit from the pre-existing weights of dense convolution?

ZzTodd22 opened this issue 4 months ago · comments

Thank you immensely for sharing your work! I do have a query though: I've transitioned an identical model from dense convolution to sparse convolution. Despite meticulously loading the weights of all layers post-transition, I've yet to observe significant improvements. My question is, could the weights of the original dense convolution still influence the performance of the sparse convolution? In essence, can sparse convolution benefit from the pre-existing weights of dense convolution? Eagerly awaiting your insights on this matter!

Shang Yang · Answer 1 · Wed Mar 06 2024 00:58:33 GMT+0800 (China Standard Time)

Hi @ZzTodd22! Thanks for your interests in TorchSparse! This is a very good question!

We do have some unit tests demonstrating that the sparse convolution can be mapped to the dense convolution in a layer-wise comparison. However, transforming dense pre-trained weights to sparse models might be more challenging, since the properties of sparse workloads may be very different. While the question is really interesting, we think the answer to it still remains open for further exploration.

Best regards,
Shang