Reuse `mypy_primer` to check issue reproductions?

Question

Reuse `mypy_primer` to check issue reproductions?

A5rocks opened this issue 3 years ago · comments

As the title says, mypy_primer could totally detect diffs in mypy output for repros for issues on mypy -- this could help find issues that a PR closes. However, this would require having an up-to-date listing of them, which is/will probably be a challenge.

Perhaps as an MVP, just get every code block from every open issue tagged "bug" and run mypy over them...?

Jelle Zijlstra · Answer 1 · Mon Nov 29 2021 10:53:59 GMT+0800 (China Standard Time)

@Akuli at some point ran a script that tried to repro open issues by running mypy on code samples. Not sure if it would be feasible to generalize that into something we run in mypy-primer.

EXPLOSION · Answer 2 · Mon Nov 29 2021 11:12:03 GMT+0800 (China Standard Time)

Yeah I was originally thinking "this should be a script" but realized it would be probably be more useful in mypy-primer (no need to tell false-positive vs false-negative for an issue, and also it adds more (weirder too!) lines of code to check regressions).

Shantanu · Answer 3 · Mon Nov 29 2021 11:34:46 GMT+0800 (China Standard Time)

This sounds useful, although people's code blocks often aren't self contained, so might need some more heuristics in practice.

I think the best way to implement this is create a separate repo in which we have a Github Action that scrapes mypy issues and commits them back to the repo in files named by issue number. We can then add this repo to mypy_primer. This would keep mypy_primer fast, relatively self contained and avoid issues arising from Github rate limits.