Testing NNLib / Lux / Flux

Question

Testing NNLib / Lux / Flux

gdalle opened this issue 6 months ago · comments

Guillaume Dalle commented 6 months ago

Lower hanging fruit: NNLib.jl, because there are less weird structs, mostly arrays

Cross-referencing:

Guillaume Dalle · Answer 1 · Tue Mar 26 2024 23:19:00 GMT+0800 (China Standard Time)

Slow

Replace several calls to grad_test with a vector of scenarios, like in softmax.jl and then scatter.jl

Fast

Replace grad_test and watch the world crumble: https://github.com/FluxML/NNlib.jl/blob/master/test/test_utils.jl

Avik Pal · Answer 2 · Thu Mar 28 2024 01:58:43 GMT+0800 (China Standard Time)

If we want to be adventurous, you can change https://github.com/LuxDL/LuxTestUtils.jl and all downstream CPU tests in Lux will be triggered (and we just need to copy one of the buildkite files from LuxLib to trigger the CUDA + AMDGPU tests)

Guillaume Dalle · Answer 3 · Thu Mar 28 2024 02:00:21 GMT+0800 (China Standard Time)

Don't tempt me Avik

Avik Pal · Answer 4 · Thu Mar 28 2024 02:04:58 GMT+0800 (China Standard Time)

On a serious note though, I had to write it to mostly deal with arrays or at least convert structures to arrays https://github.com/LuxDL/LuxTestUtils.jl/blob/143a51f0d2fb4cbc75ea583c706ff5194be103d2/src/LuxTestUtils.jl#L387-L398, so that could be helpful to writing your test suite. (But this is also terribly inefficient and only tests correctness and definitely don't combine @test_gradients with @jet)

Guillaume Dalle · Answer 5 · Thu Mar 28 2024 02:09:09 GMT+0800 (China Standard Time)

Are the tests of LuxTestUtils already interesting to run locally, or should we wait for the Downstream CI every time?

Avik Pal · Answer 6 · Thu Mar 28 2024 02:10:45 GMT+0800 (China Standard Time)

no the tests there do nothing practically, it is all via the downstream CI

Avik Pal · Answer 7 · Thu Mar 28 2024 02:12:01 GMT+0800 (China Standard Time)

but the Lux test suite doesn't take long -- 10 mins on a nicer machine (like the buildkite ones) but github actions ones take longer ~30 mins

If you want to test locally, set RETESTITEMS_NWORKERS and it will be much faster

Guillaume Dalle · Answer 8 · Thu Mar 28 2024 02:17:04 GMT+0800 (China Standard Time)

So the workflow is to:

fork LuxTestUtils.jl and Lux.jl
put my own gradient callers in LuxTestUtils.jl
dev LuxTestUtils.jl into the test environment of Lux.jl
test Lux.jl

right?

Avik Pal · Answer 9 · Thu Mar 28 2024 02:37:17 GMT+0800 (China Standard Time)

If you want to test locally yes.

Guillaume Dalle · Answer 10 · Thu Mar 28 2024 03:10:56 GMT+0800 (China Standard Time)

Any suggestions on dealing with multiple arguments? Is wrapping them in a ComponentVector always gonna work, or are there non-array structs in the mix?

Guillaume Dalle · Answer 11 · Thu Mar 28 2024 03:11:06 GMT+0800 (China Standard Time)

DifferentiationInterface only accepts a single input

Guillaume Dalle · Answer 12 · Thu Mar 28 2024 03:31:40 GMT+0800 (China Standard Time)

I'm thinking https://docs.julialang.org/en/v1/base/base/#Base.splat on a ComponentVector

Avik Pal · Answer 13 · Thu Mar 28 2024 03:36:30 GMT+0800 (China Standard Time)

Based on how the tests are written, for multiple arguments, I assume any non-array is non-differentiable (this is a testing package so I can assume that) so these get filtered out in https://github.com/LuxDL/LuxTestUtils.jl/blob/143a51f0d2fb4cbc75ea583c706ff5194be103d2/src/LuxTestUtils.jl#L357-L383. After that there are 2 possibilities -- 1) backend supports multi args so in that case it just forwards it 2) all other cases use a componentarray and create a closure which unflattens the componentarray to provide the correct args.

Guillaume Dalle · Answer 14 · Thu Mar 28 2024 03:40:16 GMT+0800 (China Standard Time)

I'll see what I can do once our own testing interface stabilizes. Step one would be to replace your gradient calls, but we can actually aim to replace your entire testing macro

Guillaume Dalle · Answer 15 · Thu Mar 28 2024 03:41:03 GMT+0800 (China Standard Time)

Our function https://gdalle.github.io/DifferentiationInterface.jl/dev/api/#DifferentiationInterfaceTest.test_differentiation does something very similar

Avik Pal · Answer 16 · Thu Mar 28 2024 03:55:12 GMT+0800 (China Standard Time)

I'll see what I can do once our own testing interface stabilizes. Step one would be to replace your gradient calls, but we can actually aim to replace your entire testing macro

correct. I had planned to replace the API with something like skip = [AutoTracker(), ...] and broken = [AutoReverseDiff()...]. But eventually we might use DI