Documentation for SymbolicDecomposition in simple Ax=b use case

Question

Documentation for SymbolicDecomposition in simple Ax=b use case

IdRatherBeCoding opened this issue 10 months ago · comments

Greg Bellchambers commented 10 months ago

Hello.

Interested in using your pytorch wrapper of baspacho for solving multiple Ax = b problems (as opposed to least squares). I can see tests using SymbolicDecomposition, but I don't understand how to prepare the arguments.

It would be really appreicated if you could provide an example of how to use SymbolicDecomposition to solve simple linear problem Ax=b for, say, a batch of 2 input scipy csr matrices with the same sparsity pattern.

Tom Vercauteren · Answer 1 · Wed Nov 01 2023 20:13:02 GMT+0800 (China Standard Time)

I am also interested in this. In fact, there is a related discussion here:
#609

The LU solver at least is rather simple to use and can be called with something along the lines of the following snippet (simple adaptation needed for batched input):

def test_solve_lu(A, b):
  batch_size = 1
  solver = CusolverLUSolver(batch_size, A.shape[1], A.crow_indices(), A.col_indices())
  singularities = solver.factor(A.values().unsqueeze(0))
  print("singularities:", singularities)
  x = b.clone().unsqueeze(0)
  solver.solve(x)
  return x.squeeze()

If the backward function is required as well, I am not sure this can be done directly with Theseus as the higher level wrapper LUCudaSolveFunction actually solves the normal equation. You could however use CusolverLUSolver in combination with the approach we use here:
https://github.com/cai4cai/torchsparsegradutils/blob/main/torchsparsegradutils/sparse_solve.py#L223

Putting this together is something I want to look into at some point:
cai4cai/torchsparsegradutils#49

Greg Bellchambers · Answer 2 · Wed Nov 01 2023 21:51:18 GMT+0800 (China Standard Time)

@tvercaut Thanks for your answer - we will try this too!

Luis Pineda · Answer 3 · Wed Nov 01 2023 22:23:26 GMT+0800 (China Standard Time)

Hi @IdRatherBeCoding and @tvercaut. @maurimo (the main author of Baspacho) wrote a similar example on using Baspacho in 2c6a8ce. Hopefully you can find this useful as well. Let me know if you have any questions. (ps: @tvercaut sorry for the delay, he was only able to find time for this the past weekend).

Considering the interest, I'm likely to make clean API to expose differentiable versions of these solvers the next major feature we add, it's mostly the issue of finding some time to do it, as we have other projects consuming most of our bandwidth.

Greg Bellchambers · Answer 4 · Mon Nov 06 2023 18:37:43 GMT+0800 (China Standard Time)

The support is appreciated, thank you. For our use case the example code with 1x1 blocks has impressive scaling with batch size, but the overhead of setup and relatively slow solve is too high for us. I have tried larger (uniform size) blocks, but this doesn't change much. Do you happen to have any advice on approaches to block sizing for general matrices? (e.g. Laplacians).

Otherwise we look forward to some of the efficiency improvements mentioned in the baspacho README :)

Luis Pineda · Answer 5 · Mon Nov 06 2023 23:55:40 GMT+0800 (China Standard Time)

@IdRatherBeCoding Unfortunately, @maurimo himself pointed out the slowness setup issue when using default 1x1 blocks, and BaSpaCho currently has no automatic way to determine this, so still is up to the user to setup an appropriate block structure.

@maurimo do you perhaps have any advice on block sizing for general matrices?

Maurizio Monge · Answer 6 · Tue Nov 07 2023 00:25:57 GMT+0800 (China Standard Time)

Hello, sorry for late reply!
So, BaSpaCho is built around the matrix being block-structured, of course it is a double-edged sword as it will be (a bit...) counterproductive if the matrix doesn't have a block structure. But in general (and especially when doing optimization) matrices actually are block structured, with blocks corresponding to variable pairs, and variables being often 3D or at least 2D vectors.
I also have to say that I have made sure that factor was as optimized as possible (as in my use case it's invariably the bottleneck) and I have put little attention on solve and even less on the setup, contributors welcome :-p. Also for Theseus the setup is done only once so this is a win most often than not.
I hope I will eventually find some time to work on improving those ops too, eventually.
That said, if your matrix is a Laplacian don't you have at least 2D variables, making the blocks 2x2? if the matrix is really not block structured there isn't much you can do other than using 1x1 blocks (or using zero fill-in but this is likely to make things worse).
If you are not building the matrix you can "discover" a block structure using a hashing trick: you build a list of random "Zobrist" keys Z_j (as many as the order N of the matrix), and for each i you compute H_i as the sum of Z_j for all i,j pairs such that M_{i,j} or M_{j,i}. Then if H_i = H_k it means that those columns/rows actually belong to the same block, and you can apply a permutation to make them become consecutive, and define a non-trivial block structure. Sorry if it's not very clear let me know if this trick might be useful to you and I will provide a proof of concept code!

Greg Bellchambers · Answer 7 · Thu Dec 07 2023 22:54:04 GMT+0800 (China Standard Time)

Thanks for this input, @maurimo . I tried your suggestion to identify blocks but it did not come up with anything - but maybe I misunderstood something. Instead, I did try converting the matrix to a banded structure and was able to get some improvement using blocksize > 1 for the banded matrix.