aarch64: volk_32fc_s32f_power_spectrum_32f fail
marcusmueller opened this issue · comments
Marcus Müller commented
Found during a Fedora scratch build.
build.log.txt
Sum of poly is already covered in #650 , I'm surprised I didn't see that:
suspiciously fast in neonvert, and also wrong:
72/139 Test #72: qa_volk_32f_x3_sum_of_poly_32f .......................***Failed 0.01 sec
RUN_VOLK_TESTS: volk_32f_x3_sum_of_poly_32f(131071,1)
generic completed in 0.216802 ms
a_neon completed in 0.328402 ms
neonvert completed in 0.0616 ms
offset 0 in1: -8937.34 in2: -9080.31 tolerance was: 0.01
volk_32f_x3_sum_of_poly_32f: fail on arch a_neon
Best aligned arch: neonvert
Best unaligned arch: neonvert
95/139 Test #95: qa_volk_32fc_s32f_power_spectrum_32f .................***Failed 0.01 sec
RUN_VOLK_TESTS: volk_32fc_s32f_power_spectrum_32f(131071,1)
generic completed in 0.472402 ms
neon completed in 0.286401 ms
offset 114951 in1: -48.291 in2: -48.291 tolerance was: 1e-06
volk_32fc_s32f_power_spectrum_32f: fail on arch neon
Best aligned arch: neon
Best unaligned arch: neon
Clayton Smith commented
This one fails in CI too:
- https://github.com/gnuradio/volk/actions/runs/6603595907/job/17936864893
- https://github.com/gnuradio/volk/actions/runs/6603892571/job/17937473189
97: RUN_VOLK_TESTS: volk_32fc_s32f_power_spectrum_32f(131071,1)
97: generic completed in 14.0574 ms
97: neon completed in 11.6342 ms
97: offset 108359 in1: -48.2963 in2: -48.2963 tolerance was: 1e-06
97: volk_32fc_s32f_power_spectrum_32f: fail on arch neon
Clayton Smith commented
As an experiment, I tried increasing the tolerance to 2e-06
and no failures occurred on my Raspberry Pi after a full day running the power_spectrum
test.
Clayton Smith commented
So probably just a small amount of error from the approximate log calculation. I'm not sure how much error is acceptable here.