Run success on local and xcode on CI Runner but test failed when using fastlane run_tests

Question

Run success on local and xcode on CI Runner but test failed when using fastlane run_tests

goodboygb1 opened this issue 8 months ago · comments

Poom wutthidech commented 8 months ago

I set precision of test to 95% and try to run test on

xcode local success
xcode on CI Runner success
terminal on CI Runner using fastlane run_tests failed with error -> snapshot not match refference

How can i solve this

Sean Batson · Answer 1 · Mon Dec 11 2023 00:30:24 GMT+0800 (China Standard Time)

If your local is Apple Silicon (arm64) and the CI runner is intel, this is a known architectural issue.

TomaszLizer · Answer 2 · Tue Dec 12 2023 20:19:45 GMT+0800 (China Standard Time)

@adozenlines Author specifically mentioned that tests are being run on CI using fastlane as well as Xcode - architecture difference between those two runs should not exist (unless it is eg M1 CI and Xcode is running through rosetta).

@goodboygb1 Have you tried running fastlane locally too?

TLDR: Try running tests with different simulator or reset that simulator.

Full Story:
I am experiencing similar issue.
Running test that should fail (changing compared view to something completely different or removing test data so view for sure has changed) ends up as an success on Xcode, it fails as expected using fastlane.
This is on M2 Max MBP 14 with MacOS 14.0, Xcode 15.0.1 and using swift-snapshot-testing 1.15.1.
I downgraded back to 1.15.0 but the results are the same.

I started debugging and it seems there is some kind of issue in function perceptuallyCompare (UIImage.swift:193) and more exactly in ThresholdImageProcessorKernel (UIImage.swift:245).
It seems that process function was never called hence generated thresholdOutputImage was just empty image.
Strange thing is that no error was thrown or anything else happened.
At the same time delta image was showing obvious differences.

In the end it happened to be some simulator (Metal?) issue. I change simulator from iPhone 15 Pro (iOS 17.0.1) to iPhone 15 (iOS 17.0.1) and it worked (fastlane was using iPhone 15).

Maybe it is possible to add some additional safeguard check, eg in

guard actualPixelPrecision < pixelPrecision else { return nil }

Add additional validation that deltaOutputImage is empty image?
Also why do we compare actual pixel precision using thresholdOutputImage that is generated with perceptualPrecision?
It does not seem right to me, but maybe I am missing something? 😬

FYI: @stephencelis @mbrandonw
I think that is critical issue as it can cause false positives. We may need to investigate that deeper. 🤔
Can try to have a look later this week if no-one gets there before me - can also try to train it on my "broken" simulator.

Poom wutthidech · Answer 3 · Tue Jan 09 2024 21:49:29 GMT+0800 (China Standard Time)

If your local is Apple Silicon (arm64) and the CI runner is intel, this is a known architectural issue.

local and CI runner both are Apple Silicon

Poom wutthidech · Answer 4 · Tue Jan 09 2024 21:56:56 GMT+0800 (China Standard Time)

@adozenlines Author specifically mentioned that tests are being run on CI using fastlane as well as Xcode - architecture difference between those two runs should not exist (unless it is eg M1 CI and Xcode is running through rosetta).

@goodboygb1 Have you tried running fastlane locally too?

TLDR: Try running tests with different simulator or reset that simulator.

Full Story: I am experiencing similar issue. Running test that should fail (changing compared view to something completely different or removing test data so view for sure has changed) ends up as an success on Xcode, it fails as expected using fastlane. This is on M2 Max MBP 14 with MacOS 14.0, Xcode 15.0.1 and using swift-snapshot-testing 1.15.1. I downgraded back to 1.15.0 but the results are the same.

I started debugging and it seems there is some kind of issue in function perceptuallyCompare (UIImage.swift:193) and more exactly in ThresholdImageProcessorKernel (UIImage.swift:245). It seems that process function was never called hence generated thresholdOutputImage was just empty image. Strange thing is that no error was thrown or anything else happened. At the same time delta image was showing obvious differences.

In the end it happened to be some simulator (Metal?) issue. I change simulator from iPhone 15 Pro (iOS 17.0.1) to iPhone 15 (iOS 17.0.1) and it worked (fastlane was using iPhone 15).

Maybe it is possible to add some additional safeguard check, eg in
guard actualPixelPrecision < pixelPrecision else { return nil }
Add additional validation that deltaOutputImage is empty image? Also why do we compare actual pixel precision using thresholdOutputImage that is generated with perceptualPrecision? It does not seem right to me, but maybe I am missing something? 😬

FYI: @stephencelis @mbrandonw I think that is critical issue as it can cause false positives. We may need to investigate that deeper. 🤔 Can try to have a look later this week if no-one gets there before me - can also try to train it on my "broken" simulator.

I already reset and change simulator, but for me I saw that the result image from my local and from CI runner is not the same image the resolution and image size is different so I thinks this is a root cause. And I also run with fastlane local it success but it failed with fastlane on runner

TomaszLizer · Answer 5 · Fri Jan 12 2024 22:40:39 GMT+0800 (China Standard Time)

I already reset and change simulator, but for me I saw that the result image from my local and from CI runner is not the same image the resolution and image size is different so I thinks this is a root cause. And I also run with fastlane local it success but it failed with fastlane on runner

@goodboygb1
Different resolution of the image indicates that you may be running different simulator locally and in CI. It can occur when tests are run on 2x and 3x device. I would check that first.
In order to have reliable snapshot tests you need to strictly control environment:

Xcode version (hence iOS base SDK for which app is built)
iOS version in simulator (different os-es / simulator os versions renders same content differently)
iPhone model (different screen resolution can cause issues, see 2x/3x retina screens)
Host platform - I believe in the past there were issues with tests run between Intel and Apple processors architecture

In my personal experience maintaining same simulator model and os version was having biggest impact.