Issue with removing omega_tol

Question

Issue with removing omega_tol

jamesgoulet opened this issue 5 months ago · comments

I have completed the implementation and tested the new formulation for all the mixture-based activation functions in all four files

activation.cpp
activation_fun_cpu.cpp
activation_fun.cu
activation_cuda.cu
that can be found in the branch: New-formulation-mixture-activations

I have found a bug in the existing MixtureSigmoid() where the ma = ma/2 should have been done as a separate step. The result is that the number of epochs in the LSTM example test_lstm.py needs to be reduced to avoid NaNs.

I have re-updated the unit tests that were all minimally changed by the new formulation.
I have tested with test.py all activation functions with CPU and GPU.

The issue that I am running into is that I am unable to remove omega_tol from all the classes' inputs. I tried searching for all occurence in the cuTAGI repo and removing them, but I am then unable to compile and I do not have the experience to understand what I am doing wrong. If you can provide me with guidance with how to do so, I can finish this implementation. You will find below a screenshot of the errors I am running into when trying to compile after having remove the omega_tol everywhere:

Luong-Ha Nguyen · Answer 1 · Thu Apr 04 2024 05:18:37 GMT+0800 (China Standard Time)

@jamesgoulet Did you push the latest code to your branch New-formulation-mixture-activations? It compiled just fine on my side

James-A. Goulet · Answer 2 · Thu Apr 04 2024 05:31:10 GMT+0800 (China Standard Time)

@lhnguyen102 No, have deleted because I was afraid I would break something... I can redo it if you want?

Luong-Ha Nguyen · Answer 3 · Thu Apr 04 2024 05:43:47 GMT+0800 (China Standard Time)

@jamesgoulet could you push the latest code that caused this error to your New-formulation-mixture-activations? Otherwise I wont be able to troubleshoot the issues. As long as we dont merge to main, it wont break main branch

James-A. Goulet · Answer 4 · Thu Apr 04 2024 06:13:32 GMT+0800 (China Standard Time)

I re-did it from scratch and now it compiles. I will prepare the PR.