Segment operations for matrix multiplication instead of reduction

Question

Segment operations for matrix multiplication instead of reduction

sidnb13 opened this issue 8 months ago · comments

Sidharth Baskaran commented 8 months ago

I would like to implement an optimized operation to perform what segment_coo or segment_csr does, but apply matrix multiplication instead of the available reductions. Here is an example, and how I'm currently doing it. It is the fastest way I could conceive so far, but I would like parallelize across the number of available weights instead of looping over them.

import torch

input_dim, output_dim = 4, 8
weights = [torch.randn(input_dim, output_dim) for _ in range(3)]
# assigns each feature to a weight
indptr = torch.tensor([0, 0, 1, 1, 2, 2, -1, -1]) # -1 means a padding index so can use any specified weight
features = torch.randn(indptr.shape[0], input_dim)

out = torch.zeros(features.shape[0], output_dim)
for weight in weights:
    out_ = features @ weight
    out += out_[indptr != i, :]

Would appreciate any guidance on how implement a custom operation/kernel to do the above.

Matthias Fey · Answer 1 · Thu Sep 14 2023 13:03:20 GMT+0800 (China Standard Time)

Take a look at https://pyg-lib.readthedocs.io/en/latest/modules/ops.html#pyg_lib.ops.segment_matmul :)

Sidharth Baskaran · Answer 2 · Thu Sep 14 2023 22:20:29 GMT+0800 (China Standard Time)

Great, this is exactly what I needed!