Mocked time

Question

Mocked time

scf37 opened this issue 7 months ago · comments

Add delay and now functions. Add test runtime with emulated clock to test code with delays without spending actual time on delays.

Would be invaluable for testing asynchronous code, not sure is this even possible with current Loom design but might be worth a try.

There is similar feature in cats-effect, separate effect evaluator with emulated time: https://typelevel.org/cats-effect/docs/core/test-runtime#mocking-time

Adam Warski · Answer 1 · Sat Dec 30 2023 16:55:33 GMT+0800 (China Standard Time)

I agree, that would be a great feature to have. One problem that I'm currently not sure is possible to solve right now is detecting when time can be "pushed forward". This should be done only when all threads of execution have suspended (either on delay or some condition - e.g. reading from a blocking queue). In cats or ZIO, that's possible as you are in control of the runtime, so you know when all of your fibers are suspended, and none can be further executed unless the time progresses. With Loom, you'd have to somehow inspect the state of all currently running virtual threads. I don't think there's an API for that currently, and even it if where, there would also be the problem of scoping the checks: we only really want to wait until the virtual threads created by some parent scope are suspended, not necessarily all in the whole JVM.

Adam Warski · Answer 2 · Sat Dec 30 2023 16:59:49 GMT+0800 (China Standard Time)

A project with similar goals (though no longer developed / maintained) is https://github.com/devexperts/time-test, but probably doesn't work with Loom

Norbert Schultz · Answer 3 · Sat Apr 13 2024 03:48:42 GMT+0800 (China Standard Time)

IMHO this can only work if getting current time, and waiting for somehting comes together, e.g.

class Clock {
  // Returns simulated time
  def currentTime: Instant
  // Wait for specific amount of time
  def wait(duration: FiniteDuration): Unit
  // Wait until a time is met.
  def waitUntil(i: Instant): Unit 
}

In my opinion, it shouldn't be necessary to track all other threads suspensions state, as long as the clock is monotonically increasing, because other threads can't make more assumptions. And if something crashes because the time advanced, well, that's exactly what a test should figure out.

On the other hand, all (testable) access to the time/wait must then use this clock, maybe in some scoped value?

Adam Warski · Answer 4 · Sun Apr 14 2024 21:13:53 GMT+0800 (China Standard Time)

Yes, we can make this available via a scoped value, but what would the wait method do then? It wouldn't know if it's fine to advance the clock (& "pretend" that e.g. 10 seconds has passed), as other threads might be runnable and have something to do in the "current" time. So such a test would behave very differently from one ran using wall-clock time.

scf37 · Answer 5 · Sun Apr 14 2024 21:27:37 GMT+0800 (China Standard Time)

Idea is to write tests assuming execution of non-blocking code takes zero time. Which is true in most practical cases involving wait(duration) calls

Norbert Schultz · Answer 6 · Mon Apr 15 2024 02:00:50 GMT+0800 (China Standard Time)

I think it depends on the test scope. But if wait immediately returns and stores advanced time, it should work for some reasons. For "smaller tests" this could already be useful, e.g.

class RetryPolicy {
  def retry3Times(f: => T): T = { .. }
}

// The test
val retryPolicy = RetryPolicy (/* Configure*/)

intercept[CouldNotRetryAnymore]{
  retryPolicy.retry3Times { throw RuntimeException() }
}

clock.waitedTime: FiniteDuration shouldBe 3 * retryPolicy.retryTime

Adam Warski · Answer 7 · Tue Apr 16 2024 15:10:41 GMT+0800 (China Standard Time)

Ah, good to share some examples :) Something that I had in mind:

val i = AtomicInteger(0)
fork {
  i.getAndAdd(1)
}

sleep(10.seconds)
assert(i.get() == 1)

In (almost ;) ) every "normal" execution that test would pass. But if sleeps don't sleep, it would fail.

scf37 · Answer 8 · Tue Apr 16 2024 15:20:23 GMT+0800 (China Standard Time)

Isn't that good rather than bad? :-)
This is data race 'solved' by sleep call instead of joining forked thread. Both linearizations are valid and test one just uncovers illegal synchronization. Therefore my point is: emulated sleeps do not introduce new linearizations but shift their probabilities.

Adam Warski · Answer 9 · Tue Apr 16 2024 18:01:53 GMT+0800 (China Standard Time)

Yes, you're right in principle, but then ... what are the usual cases for having sleeps in your code in the first place?

One non-artificial example that comes to my mind is sending pings over a web socket:

fork {
  forever {
    ws.send(Ping())
    sleep(1.second)
  }
}

ws.send(msgsToSendChannel.receive())

with the sleep-doesn't-sleep, the test might end up sending an infinite stream of pings, and no messages altogether. Or vice versa.

Norbert Schultz · Answer 10 · Tue Apr 16 2024 18:08:22 GMT+0800 (China Standard Time)

I see two alternatives in your example

We wait a minimal time (like 5ms), so that a parallel thread can check exit-assumptions, but that wouldn't be very good unit tests as we need statistical results
The clock has some upper boundery for simulated time, e.g and we stop the children thread if this time is over. Then no real waiting has to be done.

val clock = SimualtedClock(simulatedTime = 1.minute)
clock.run {
  // Adams Example
}
pingReceived.count shouldBe 60

In the end it looks like, multiple clock implementations can be discussed, depending on the way of testing methods.

scf37 · Answer 11 · Tue Apr 16 2024 18:18:44 GMT+0800 (China Standard Time)

Cases I've encountered (and written tests for!) are:

retries
rate control (call remote service with at most 10 rps)
scheduling

Additionally, from my experience of writing emulated time tests (for Future though, it is much easier):
All waits are coordinated by the test. When code under test calls wait() function, it block indefinitely until:
a) test calls tick() funciton which releases single wait call with closest timeline
b) test calls waitUntil(time) function which releases all waits before given timestamp

edit: here is some inspiration: https://gist.github.com/scf37/4071839f25e197e38e4b070cfbed977b

Norbert Schultz · Answer 12 · Wed Apr 17 2024 15:10:38 GMT+0800 (China Standard Time)

@scf37 this sounds more or less like mocking of the clock for me, so that the test itself is capable of releasing waits.

This is a perfect use case for complicated things, which involves multiple threads.

In my small unit test thinking I would like to get rid of threads all together to get a deterministic behaviour. We can run a function, which itself calls wait/waitFor and has some stopping kriteria (e.g. when simulated time is out or some or some explicit behavior).

Consequently this leads to multiple clock implementations.

scf37 · Answer 13 · Wed Apr 17 2024 15:16:21 GMT+0800 (China Standard Time)

I would like to get rid of threads all together to get a deterministic behaviour

Provided Mock does exactly that - it performs linearization by adding chunks of code to single queue executed by test.

Unfortunately there is no API in Loom to access such "chunks" (AKA coroutines/continuations) therefore I don't know how to implement deterministic tests outside of Futures. Still, emulated delays are much better than nothing.