Polymorphic Math methods, backwards compatibility, and polyfills

Question

Polymorphic Math methods, backwards compatibility, and polyfills

brad4d opened this issue 3 years ago · comments

Suppose code was written with the knowledge that the Math methods will throw exceptions if given BigInt values.

// example
function myNumberOnlyApiMethod(numArray) {
  // This method should throw an exception if any of the values are BigInts
  const maxNum = numArray.reduce((prev, cur) => Math.max(prev, cur))
  // do further Number-only logic here
}

If the implementation of some methods change to allow BigInt values instead, that code will not behave as designed.
If it were necessary to call a new method to get the new behavior, then the potential for this bug would be eliminated.

It would be good to explicitly consider whether it is better to incur this risk than to add a new method.

Stephen Hicks · Answer 1 · Tue Nov 02 2021 03:42:42 GMT+0800 (China Standard Time)

An additional concern with changing the existing methods is that it puts transpilers/polyfillers in the awkward position of needing to defensively clobber all these builtin functions with bigint-aware versions. These polyfills need to be included even if nothing directly references bigint in the code, since an external API could produce bigints (e.g. Math.abs(externalFunction()) needs to be able to handle bigints in case externalFunction returns one).

This is made worse because any usages of Math.pow will now need to be polyfilled for bigint, and the polyfill can't be reasonably used in ES 2015 (or earlier) contexts, since it really wants to use ** syntax.

J. S. Choi · Answer 2 · Fri Nov 12 2021 00:08:07 GMT+0800 (China Standard Time)

Thanks for the comments! 😄

Suppose code was written with the knowledge that the Math methods will throw exceptions if given BigInt values.
If the implementation of some methods change to allow BigInt values instead, that code will not behave as designed. If it were necessary to call a new method to get the new behavior, then the potential for this bug would be eliminated.
It would be good to explicitly consider whether it is better to incur this risk than to add a new method.

This situation requires two things to be simultaneously true:

That a codebase is already supplying both Numbers and BigInts to the same function.
That the function’s returning a BigInt (instead of throwing a TypeError) would materially change the codebase’s behavior.

I would be quite surprised if the first condition were true in any real-world codebase, and I would be even more surprised if both the first and second conditions were simultaneously true in any real-world codebase (actually breaking something). Having said that, I do accept that someone theoretically might have done something weird and brittle…

But this is the same as extending web APIs with new type overloading. Many web APIs’ functions have been extended to accept broader inputs without web-compatibility problems. Examples include (thanks @annevk):

Extending XMLHttpRequest’s send function to accept ArrayBuffers, ArrayBufferViews, Blobs, FormData, and URLSearchParams. (Note that some of these types were added before this function’s types were overloaded, e.g., ArrayBuffer was added in Firefox 4 and XMLHttpRequest’s send started accepting ArrayBuffers in Firefox 9.)
Extending addEventListener to accept an object instead of a boolean as its third argument. (This in fact was riskier than the the BigInt Math change and the XMLHttpRequest changes, since supplying an object argument had previously not thrown a TypeError but rather was silently coerced to a boolean.)
Extending Canvas’s fill, stroke, clip, isPointInPath, and isPointInStroke functions to accept Path2Ds.
Extending numerous CSS properties to accept broader input values.

Such function-input changes have been generally considered “web compatible”. I don’t think there is something unique about extending Math functions in this manner that is different than extending web-API functions in this manner.

So I would explicitly consider this risk to be small, that the ergonomics (avoiding yet more globals for BigMath and DecMath) to be worth this small risk, and that this is in keeping with precedents set by many previous changes to web-platform APIs’ functions. I will add this to the explainer when I have time.

However, if any browser team remains concerned about web compatibility, then use-counter data would be welcome, of course.

An additional concern with changing the existing methods is that it puts transpilers/polyfillers in the awkward position of needing to defensively clobber all these builtin functions with bigint-aware versions. These polyfills need to be included even if nothing directly references bigint in the code, since an external API could produce bigints (e.g. Math.abs(externalFunction()) needs to be able to handle bigints in case externalFunction returns one).

I would imagine that any polyfilling would continue to be opt-in. Libraries generally advertise any dependencies on language features they have, e.g., “This library requires promises.” I’m not sure how extending Math would be different—if a codebase has a dependency on a library that requires Promise, then it needs to include a Promise polyfill; if it has no such dependency, then it does not need to include a polyfill. The same would go for BigInt Math: “This library requires BigInt Math.” A codebase that has no dependency on a library requiring BigInt Math would not need a BigInt Math polyfill.

This is made worse because any usages of Math.pow will now need to be polyfilled for bigint, and the polyfill can't be reasonably used in ES 2015 (or earlier) contexts, since it really wants to use ** syntax.

BigInts were added to the language after **, so they cannot be used in ES2015 at all anyway. All engine versions that support BigInts also support **. (However, BigInt pow would be reimplementable without ** anyway.)

Thanks again for the comments! 😄

Jordan Harband · Answer 3 · Fri Nov 12 2021 00:11:50 GMT+0800 (China Standard Time)

It is always the case that language builtins may stop throwing exceptions; code that relies on those exceptions is brittle. If you want to throw on BigInts reliably, you must explicitly check that; that’s always been the case for any types.

Jordan Harband · Answer 4 · Fri Nov 12 2021 00:12:49 GMT+0800 (China Standard Time)

@shicks every single language change likely requires that; as one of the prominent polyfill authors, that’s not a burden, that’s just the way it works.

jakobkummerow · Answer 5 · Fri Nov 12 2021 04:13:11 GMT+0800 (China Standard Time)

Slow rollout of engines with BigInt support (still only ~90%! Who wants to leave 10% of their potential users out in the cold? That's millions of people!), combined with lack of polyfillability of overloaded operators, are a reason why many developers still use libraries (e.g. these) instead of native BigInts. Overloading Math.* functions is going to make that approach a whole lot messier. Adding new functions as e.g. BigInt.max() would be:

easier to implement for libraries (no need to monkey-patch Math)
faster to execute (no need to have a userland type check on every Math.* function call, which aside from its direct overhead would be almost guaranteed to disable compiler optimizations)
easier to transition off of when the time comes (when native support has reached >99% or whatever a given app picks as their threshold; note that reaching x% native BigInt support will likely happen years before reaching the same x% native support for whatever this proposal will end up being, so there'll likely have to be a staged migration away from polyfills/libraries).

Jordan Harband · Answer 6 · Fri Nov 12 2021 04:21:59 GMT+0800 (China Standard Time)

I don’t see why it would make anything messier. It’s very easy to feature-detect whether a function works with BigInt or not, and the less-than-five authors that write these polyfills will handle that for everyone else.

J. S. Choi · Answer 7 · Fri Nov 12 2021 22:06:31 GMT+0800 (China Standard Time)

It’s certainly true that support for BigInts and BigInt Math will not reach >99% prevalence for many years, and that BigInt polyfills will be needed for a long time. But I agree with @ljharb—I’m not sure why monkey patching Math is different than monkey-patching the many web APIs that have already been also gradually broadened.

As you say, any codebase that uses BigInts (but wants to accommodate the 10% of browsers that do not support them) cannot use a transparent polyfill anyway. (This is, of course, because the native BigInt API depends on syntax, like + and instanceof.) The codebase must instead use a non-polyfill library like Google’s JSBI, with the intent to later transpile their code into native BigInt code whenever they decide that browsers are ready for it, years later. So the codebase already would look like this:

JSBI.multiply(JSBI.BigInt(-2), JSBI.add(JSBI.BigInt(x), JSBI.BigInt(1)))

…with the intent to later be converted, years later, into:

-2n * (BigInt(x) + 1n)

However, the situation between transpiling to operators is little different from having to transpile to other functions like abs, pow, and such. Just like how people don’t use + with JSBI BigInts, people aren’t going to use Math.abs with JSBI BigInts; JSBI instead would provide its own abs function (just like how it provides its own add function). There’s no reason to monkey patch Math when using JSBI (or whatever library):

JSBI.multiply(JSBI.BigInt(-2), JSBI.abs(JSBI.add(JSBI.BigInt(x), JSBI.BigInt(1)))

Years later, when the JSBI-using codebase gets transpiled to native BigInt syntax, calls to JSBI.abs etc. would be transpilable to Math calls:

-2n * Math.abs(BigInt(x) + 1n)

jakobkummerow · Answer 8 · Fri Nov 12 2021 23:51:15 GMT+0800 (China Standard Time)

So to summarize what you just said: people won't be able to actually write Math.abs(some_bigint) for many years. You don't think that's a reason to prefer a different design, where the new stuff could actually be used soon?

I'm quite familiar with JSBI: I wrote it. And it's exactly what informs my opinion. Do you think that the whole situation around needing to use JSBI is desirable and should be repeated for more features? I think the contortions that people have to go through to use BigInts today, which are so much more cumbersome than most new JS features that can be polyfilled reasonably, are a lesson learned: don't spec future features in similar non-polyfillable ways.
(Related side note: it turns out that even the "transpile-JSBI-away" idea is, unfortunately, much easier said than done in a scenario where multiple libraries/components of a large system use BigInts and pass them between each other. Figuring out how to accomplish an atomic mass code migration, or whether an incremental transition could possibly be feasible, is a thing that developer teams actually worry about today.)

Monkey-patching comes into play once native BigInt support is sufficiently widely available that JSBI is no longer needed for emulating BigInts themselves (which brings a performance benefit).

Monkey-patching builtins is widely considered bad practice for a variety of reasons (not the least of which: compatibility issues which then impede TC39's ability to standardize future features, as happened before), but there's nuance:
Adding a nonexisting function (i.e. the pattern window.foo = window.foo || function FooPolyfill() { ... }) may be deemed acceptable due to lack of good alternative, and at least won't affect code that doesn't use window.foo.
Overwriting an existing function, however, does affect existing code that uses the function in question: it makes it slower, both by adding type checks to it, and by disabling compiler optimizations that replace the original function with a specialized instruction sequence (in a quick test with Math.pow, I'm seeing up to 30x slowdown!). If you assume that code calling Math functions is typically performance sensitive, incurring such a slowdown on existing code just so that some new code can rely on a polyfilled feature is likely unacceptable. (Another drawback of the monkey-patch-builtins becomes visible there: if you import two libraries, where one does the monkey-patching and the other relies on high performance, you have a problem.)

So yeah, you can demand existence of JSBI.abs (or MyShinyBigMathPolyfill.abs for that matter), and force developers to rewrite/transpile eventually. But at that point, why not simply spec it as BigInt.abs? Where's the drawback of that?

Stephen Hicks · Answer 9 · Sat Nov 13 2021 01:12:40 GMT+0800 (China Standard Time)

@js-choi About opt-in polyfilling, we have found this to be unscalable. The library is the thing that knows what it depends on, so relying on the application (maybe transitively) depending on it to aggregate all these "extra dependencies" via some sort of documentation side-channel doesn't really work, for the same reason that NPM doesn't leave installing all the transitive dependency libraries up to the end user. We've had great success with Closure Compiler as a central point of detection for which polyfills (both of language syntax and of standard runtime library) are required, allowing library authors to add polyfill dependencies as seamlessly as they can add ordinary library dependencies, without any work required by application authors (which would be a significant blocker in a "one version" monorepo).

While it may seem that ** is not a problem due to the order in which it came into the language, looking deeper reveals this as false security. As Jakob suggested, the problem of "how do you get a large codebase off of JSBI" is actually a lot more difficult than you might assume. JSBI is not interoperable with native BigInt, and data flow across application and library boundaries is often two-way, effectively requiring an infeasible atomic migration. The alternative is to add a compatibility shim that feature-detects and routes calls to either JSBI or native operators/methods depending on browser availability. (This shim can be compiled out of builds targeting modern browsers, but must remain in the fallback build, which may still see native BigInts in practice). When the fallback build targets ES 2015 or lower, ** becomes a syntax error, so it needs to be transpiled, even though (as we just established) it may end up running into native bigints. These sorts of transpilation/polyfill issues continue even past ES 2015 due to the performance impact of monkey-patching the builtins.

Our hope has been that, once native BigInt adoption reaches 99.x% and the final "highest-requirement" service finally stops demanding support for pre-BigInt browsers, then (and only then) we'll be able to actually get rid of JSBI and start using native BigInts everywhere. But if that would introduce these polyfill requirements on Math, then that basically moves the goalpost and pushes the GA date for BigInt back another 3-4 years.

Jordan Harband · Answer 10 · Sat Nov 13 2021 01:47:13 GMT+0800 (China Standard Time)

don't spec future features in similar non-polyfillable ways

The committee has repeatedly rejected polyfillability as a constraint on language design - much of the language post-ES6 would have been designed differently if maintaining polyfillability was a constraint, before and after BigInt.

John Lenz · Answer 11 · Sat Nov 13 2021 03:25:15 GMT+0800 (China Standard Time)

@ljharb It is surprising to me that as a Polyfill library maintainer that you aren't concerned by the performance implications of doing the BigInt polyfills for Math.

There is no reason to overload the methods for BigInt, it doesn't help anyone. It just placates someones desire to not to have decide on a new namespace (or to limit the number of namespaces).

Yes, the committee can not be constrained by polyfills, otherwise you can't add features like WeakMap, WeakRef or other new capabilities, but that doesn't mean the committee should not consider what the implication of those polyfills are to the ecosystem.

Jordan Harband · Answer 12 · Sat Nov 13 2021 04:27:45 GMT+0800 (China Standard Time)

@concavelenz i don’t agree those implications are particularly problematic, personally. Either way, it’s a better design for methods under “Math” to accept all mathematical data types. It very much helps me as a language user that Math methods work with all numeric primitives, and would hurt me to needlessly increase the separation between BigInt and Number (and potentially Decimal in the future)

Stephen Hicks · Answer 13 · Sat Nov 13 2021 05:13:07 GMT+0800 (China Standard Time)

@ljharb The problem is that there already is a separation between BigInt and Number. The fact that Math and Number are two different namespaces is perhaps unfortunate, but as it currently stands, Math and Number operate only on numbers. That's a pretty clean invariant to maintain. Extending this to have BigInt methods operate only on bigints (and eventually Decimal methods operate only on decimals) is, I would argue, more consistent than a situation where the Math namespace is a grab bag of some functions that work on bigints and others than don't, and some can mix types but others can't.

Jordan Harband · Answer 14 · Sat Nov 13 2021 05:17:18 GMT+0800 (China Standard Time)

I think that consistency is a subjective and variable thing, and it’s highly inconsistent that something named Math only works with one of the numeric primitives.

The separation between BigInt and Number isn’t complete; there are a number of operators that work on both.

J. S. Choi · Answer 15 · Tue Nov 16 2021 01:19:56 GMT+0800 (China Standard Time)

First, I’d like to express gratitude towards everyone bringing their experience and insight about backwards compatibility, especially from Google Closure Compiler and from JSBI. Your concerns are important issues: thank you for raising them, and thank for your patience in explaining them.

This topic was originally about backwards compatibility and polyfillability. So I think that discussing what benefit polymorphic Math methods would have over separate methods probably belongs in another issue (i.e., #14). My mental model as a developer had always been that Math methods are conceptually a “extension” of the math operators, I expect many other developers to share this mental model (@ljharb and @sarahghp also seem to too), and polymorphic Math methods would match that mental model. Of course, as @ljharb says, consistency is subjective, but I think the benefits would be real. But, again, this part probably belongs more on #14.

Back on topic: There are concerns about the transition period for a codebase after it switches to native BigInt primitives but still before it can use native polymorphic Math methods.

A polyfill that monkey-patches the Number-only native Math methods would bring performance problems for all Number-only uses of the Math methods. And it would be difficult to keep the polyfill opt-in only; keeping track of the documentation of transitive dependencies is unscalable. These are all certainly true.

These problems already have a solution from the web-API space, standalone polyfill implementations (non-shim polyfills).

During this transition period, standalone implementations would provide their own BigInt-supporting math methods separately from monkey-patching the Math methods. We would discourage any monkey-patching or shims of Math. Eventually, many years from now, when native polymorphic Math functions are ready, the codebase would switch its calls from the standalone polyfill implementation to the native Math functions. That is:

// During transition period during which native BigInts are available but native BigInt Math is not.
// Later, once the userbase is ready, all bigMax calls would be replaced with Math.max calls.
import { bigMax } from 'standalone-library-that-does-not-monkey-patch-anything';

bigMax(0n, 1n, 2n, 3n);
Math.max(0, 1, 2, 3); // Not affected by the library.

I think that discouraging Math monkey-patching / shims and encouraging “standalone implementations” – temporarily until enough BigInt Math support – may address concerns about the performance of polyfilling polymorphic Math.

Thank you all again for insight and patience in explaining your concerns. Hopefully, encouraging standalone implementations rather than monkey-patching or shimming should address concerns about performance during that transition period.

[Edit 1: Replaced with alternative terminology as requested by @ljharb]

[Edit 2: See also tc39/proposal-decimal#31. I do agree with @littledan and @ljharb, although I appreciate @jakobkummerow‘s points.]

Jordan Harband · Answer 16 · Tue Nov 16 2021 01:27:42 GMT+0800 (China Standard Time)

Let's please avoid usage of that particular term, though :-) The es-shims ecosystem calls a spec-compliant implementation a "polyfill" and installing that in the environment "shimming". We don't have to use the es-shims terms, of course, but I'd prefer we only popularize professional terms.

Ricardo Fernández Serrata · Answer 17 · Tue Jun 14 2022 01:56:33 GMT+0800 (China Standard Time)

it’s highly inconsistent that something named Math only works with one of the numeric primitives

I think this is a wrong use of the word "inconsistent", but I mostly agree with the comment. I think another term would be "unintuitive", but there's the contradiction that having some methods work with BigInts may also be unintuitive. So a better term is "misleading", because how on Earth can't Math be more mathematical? Mathematics does symbolic computation, Math doesn't, it works on positional notation in base/radix 2 (AKA binary numerals), that's "mathematic enough". But intentionally preventing Math from being more generalized/extended is not mathematical at all. However, the purpose of Math wasn't to be truly mathematic, but "good enough" for most arithmetic operations with minimal-to-none algebra (Math does implicit algebra, because logarithms try to solve exponential equations doing numeric computation instead of symbolic).

But we haven't talked about bike-shedding, if Decimal ever gets added, ALL Math methods could be extended to operate with it, even transcendental ones like sin can be optimized doing modular normalization/reduction.

Another point, the cognitive load of remembering which methods work with BigInt could be equivalent to the cognitive load of dealing with multiple "copies" of the same function, from 2, 3, or even 4 different namespaces. I have no proof for this statement, but some polls may help to check it.