Conditionally perform assertions, allow tests to commit to the results

Question

Conditionally perform assertions, allow tests to commit to the results

danr opened this issue 6 years ago · comments

Dan Rosén commented 6 years ago

Please see #1692 (comment) for the accepted proposal and #1692 (comment) for follow-up.

Original issue follows:

Description: add observeTest and commitToResult

Right now we have a function test of which the essence has this type signature:

const test: (t: TestContext) => PromiseLike<void> | Iterator<any> | Observable | void

It would be helpful for library writers if there was a function:

const observeTest: 
  (test: (t: TestContext) => PromiseLike<void> | Iterator<any> | Observable | void) => 
  {success: boolean, test_result: TestResult}

edit: fixed the type signature

For my purposes TestResult can be a completely opaque type.
The use for it is to make this a result for the main driver. For the lack of a better name I will call it commitToResult for now. This could either be a function exported by AVA (or a method on AVA's exported test), or a method on TestContext:

const commitToResult: (test_result: TestResult) => void

or

interface TestContext {
    ...
    // stop the execution of the test now reporting this result:
    commitToResult: (test_result: TestResult) => void
}

Use case: property based testing

The use case for this is property-based testing, which was first introduced in Haskell with the package QuickCheck.

There are quite a few implementations for JavaScript, for example jsverify, testcheck-js. The idea is to generate random data and test if functions satsify properties run on these data to approximate universal quantification. If the test fails it is common practice in these libraries to try to shrink the generated data. Then the user can be presented with a small (and hopefully minimal) counterexample.

The library writer of a property-based testing would then run tests written using asserts as in AVA-style, but behind the hoods it would be run with observeTest, and if it fails start a loop trying to shrink the randomly generated input until it cannot be shrunk anymore and then report the smallest possible failing TestResult using commitToResult.

There is intergration for AVA in
testcheck-js/AVA-check, but it has to use the AVA internals and is thus a pain to maintain, see for example this commit in a PR by me: leebyron/testcheck-js@65ef263 So there would be clear benefits for library writers if AVA could expose these two functions.

Mark Wubben · Answer 1 · Wed Feb 07 2018 20:14:27 GMT+0800 (China Standard Time)

@danr interesting!

AVA doesn't let you create new tests once tests have started running. I can't quite tell how testcheck-js gets around this. But as you say, this is about using AVA's assertion library. Perhaps we can support creating a new ExecutionContext (the t argument) that can be passed to the user's test implementation.

Taking this example:

const test = require('ava')
const { check, gen } = require('ava-check')

test('addition is commutative', check(gen.int, gen.int, (t, numA, numB) => {
  t.true(numA + numB === numB + numA)
}));

What if check() could do (in pseudocode):

function check(genA, genB, implementation) {
  return async t => {
    while (true) {
      const result = await t.try(implementation, genA(), genB())
      if (result.passed || cannotMinimizeFurther()) {
        return result.commit()
      }
    }
  }
}

With a rough type definition of:

interface ExecutionContext {
  try (implementation: (t: ExecutionContext, ...args: any[]) => void, ...args: any[]): Promise<TryResult>
}

interface TryResult {
  passed: boolean
  assertionErrors: AssertionError[]
  commit (): boolean
}

And in actual English: t.try() can be used to invoke a test implementation. It returns a promise with the result of the invocation. If passed is true then no assertions failed. Otherwise assertionErrors is populated with one or more errors. The usual rules around requiring assertions and awaiting asynchronous ones apply. commit() either passes the test or fails it with the assertionErrors. Not calling commit() results in a test failure unless another assertion is used.

I like how this is low-level, which means we can support interesting higher-order tests.

@avajs/core what do you think?

Dan Rosén · Answer 2 · Wed Feb 07 2018 23:07:38 GMT+0800 (China Standard Time)

@novemberborn: Thanks for your response and the write up.

Your try improves my observeTest but I would simplify the type to:

interface ExecutionContext {
  try (implementation: (t: ExecutionContext) => void): Promise<TryResult>
}

The call to t.try in your pseudo-code then becomes:

      const result = await t.try(t2 => implementation(t2, genA(), genB()))

Rationale: improved type-safety and smaller type signature for try.

(A name that is not a javascript keyword must be used instead of try)

Your commit improves my commitToResult becauses it is easier to use as a method.

I am a bit skeptical about this behaviour:

Not calling commit() results in a test failure unless another assertion is used.

Is this not overly restrictive? There should be no particular reason to make this a test failure by default.

Mark Wubben · Answer 3 · Thu Feb 08 2018 02:48:16 GMT+0800 (China Standard Time)

Your try improves my observeTest but I would simplify the type to:

Perhaps. We have a pattern of supporting additional arguments that get passed to the implementation, seems nice to retain that.

(A name that is not a javascript keyword must be used instead of try)

It should work as a property.

I am a bit skeptical about this behaviour:

Not calling commit() results in a test failure unless another assertion is used.

Is this not overly restrictive? There should be no particular reason to make this a test failure by default.

That's how AVA behaves out of the box. Tests must use assertions. IMHO not committing a t.try is akin to not actually using assertions.

Dan Rosén · Answer 4 · Thu Feb 08 2018 03:03:53 GMT+0800 (China Standard Time)

Alright! I retract all those remarks then :)

Dan Rosén · Answer 5 · Wed Feb 14 2018 00:51:58 GMT+0800 (China Standard Time)

Any further comments on this?

Is it likely that this would be accepted upstreams?

If so can anyone give me some pointers or how to implement what @novemberborn and I have discussed?

Mark Wubben · Answer 6 · Wed Feb 14 2018 01:58:31 GMT+0800 (China Standard Time)

@sindresorhus what do you think about my proposal?

Sindre Sorhus · Answer 7 · Sun Feb 18 2018 02:52:05 GMT+0800 (China Standard Time)

@novemberborn I like it. It's simple and can enable a lot of interesting use-cases. 👍 from me.

Mark Wubben · Answer 8 · Sun Feb 18 2018 22:55:48 GMT+0800 (China Standard Time)

Expanding on my earlier comment #1692 (comment) there's the question of what to do with t.context.

We do support t.context being created in .before() hooks. We then shallowly clone it for use by each test. We're hoping this strikes a balance between letting tests (or .beforeEach() hooks) make modifications without impacting other tests.

Perhaps we should do the same when passing the execution context to the t.try() implementation? Regardless I think we can agree that assigning t.context should not affect the "parent".

Should we allow recursively calling t.try()? I suppose ideally there is no difference between t.try() implementations and test implementations. This might just be how it shakes out anyhow — so let's assume we do allow this and reconsider if it proves difficult.

Does t.try() count as one assertion for t.plan() or is it transparent? Conversely can t.try() implementations use t.plan()? I'm leaning towards it being transparent, which means that implementations mustn't be able to call t.plan(), and even when the result is committed the implementation still must have at least one passing assertion.

Ordinarily assertions are implemented in lib/assert.js but I don't think try() can do its work there. It should be added to ExecutionContext in lib/test.js instead.

Test needs to support a new assertion type, along the lines of addPendingAssertion. pendingAssertionCount should be incremented so the test fails if the promise returned by t.try() is not awaited. We need to instantiate a new ExecutionContext, but we can't pass it the Test instance. Perhaps inside the ExecutionContext we should rename "test" to "host". That way t.try() can create a new host object. There could be a Host class which is extended by Test and which implements the assertion counting and snapshot-supporting methods.

This is the point where my architecting runs into "this should really be played with in code" so I'm going to stop. @danr let me know if you want to take this on, happy to review code and give pointers!

Mark Wubben · Answer 9 · Tue Feb 20 2018 00:32:10 GMT+0800 (China Standard Time)

Does t.try() count as one assertion for t.plan() or is it transparent? Conversely can t.try() implementations use t.plan()? I'm leaning towards it being transparent, which means that implementations mustn't be able to call t.plan(), and even when the result is committed the implementation still must have at least one passing assertion.

I've thought about this some more and I think it's wrong. Potentially a third-party module is used which then calls a user-provided implementation. Users may want to plan the number of assertions in their implementation. The third-party module wouldn't be able to do this.

Instead we should not allow t.try() when t.plan() is used, but allow t.plan() inside an implementation.

Dan Rosén · Answer 10 · Tue Feb 20 2018 01:08:08 GMT+0800 (China Standard Time)

Hey @novemberborn, great to see your interest in this. I'm swamped time-wise right now so I can't contribute anything code-wise atm.

Instead we should not allow t.try() when t.plan() is used, but allow t.plan() inside an implementation.

I agree with this. Consider the quick-check usage: the test that is started using try is supposed to look just like a normal test for the end-user, so ideally they should be able to use plan. Also: in the quick-check case there is no point in running plan outside the try. (edit: used incorrect idiom)

Mark Wubben · Answer 11 · Mon Aug 27 2018 19:28:49 GMT+0800 (China Standard Time)

We should allow the test result to be discarded, perhaps without even waiting for the promise returned by t.try() to be fulfilled. Perhaps by adding a discard() method on the promise. This would allow calling code to manage timeouts, see #1918 (comment).

dashuser33 · Answer 12 · Sat Sep 22 2018 22:31:12 GMT+0800 (China Standard Time)

I arrived at this issue searching for a way to retry assertions until they succeed, similar to how Scala Specs2 does it https://github.com/etorreborre/specs2/blob/master/tests/src/test/scala/org/specs2/matcher/EventuallyMatchersSpec.scala

in pseudo-code:

test('random < 0.1', t => {
     t.eventually(
	t.true(Math.random() < 0.1)
     );
});

In spec2, by default it retries 40 times with a delay of 100ms, but both are configurable and have in the past needed to use with delay of 0ms and delay of 1000ms. One example of where this is interesting. This allows you to avoid adding sleep for testing slow async operations. It will keep retrying until ready or max retries is reached.

Is there any wait do that today?

dashuser33 · Answer 13 · Sun Sep 23 2018 04:36:02 GMT+0800 (China Standard Time)

To help anyone wanting the same feature, here is how I implemented it:

const test = require('ava');

function sleep(ms) {
  return new Promise(resolve => {
    setTimeout(resolve, ms);
  });
}

async function retry(t, fn) {
  for (let i = 0; i < 10; i++) {
    if (fn()) {
      t.pass();
      return;
    }
    await sleep(1000);
  }

  t.fail('failed after 10 attempts');

}


test('random < 0.1', async t => {
  await retry(t, () => Math.random() < 0.1);
});

Of course this has the drawback of only supporting true assertion. I might try to generalize this at some later point.

UPDATE: this is what I did: https://github.com/dashuser33/darkcoin-client/blob/master/src/lib/async.ts#L20

Vladimir Starkov · Answer 14 · Thu Jan 10 2019 20:31:39 GMT+0800 (China Standard Time)

I like property based testing too and used jsverify as in ava-jsverify by @imranolas. it worked nice but also augments Ava internals (I believe). so integration can be harsh sometimes.

Vladimir Starkov · Answer 15 · Thu Jan 10 2019 20:32:34 GMT+0800 (China Standard Time)

I was thinking about utilising macros for this purpose. do you think it is feasible?

Nikita Volodin · Answer 16 · Thu Jan 10 2019 23:09:15 GMT+0800 (China Standard Time)

@iamstarkov There is a work in progress solution sitting in the #1947. I've been using it on my project and it works for now. There will be changes before it will get merged into master, but if you really would like to, you can check it out at that specific commit bde2be1.

I also use jsverify but I modified the connector between ava and jsverify: https://gist.github.com/qlonik/9a297285284d71f7da47022f120ef4ad. The modification handles the logging slightly better, provides typings, and works with try-commit/discard pull request. It is written to work in a way similar to the original ava-jsverify. Maybe later on, it could be modified to be a macro.

Pierre Havelaar · Answer 17 · Fri Mar 22 2019 19:05:11 GMT+0800 (China Standard Time)

Please consider my pull request #2078

Mark Wubben · Answer 18 · Mon Sep 16 2019 02:10:29 GMT+0800 (China Standard Time)

We've landed @qlonik's PR and it'll be released soon. For follow-up issues see https://github.com/orgs/avajs/projects/1.