auto MPI nonblocking
a very naive idea
Run
mkdir build && cd build
cmake ..
make
cd ..
mpicc -S main.cpp -emit-llvm
opt -f -load-pass-plugin=./build/libReplaceMPIColl.so -passes=replace-mpi-coll -S main.ll -o m.ll
TODO
- finish all the collective MPI calls
- try get data flow and rearrange IR order
- find stencil pattern and acheive inner & outer seperate computation.