No error_id loaded when catching an non-leaf::exception

Question

No error_id loaded when catching an non-leaf::exception

vector-of-bool opened this issue 3 years ago · comments

This one took a while to track down, but I have narrowed it down to this small snippet. This is admittedly convoluted in appearance, but I was able to run into this behavior on my own with a much more spread-out code.

Here are two test cases that exhibit two related (but unexpected(?)) behaviors:

Error Object is Dropped:

auto fun = [] {
    return leaf::try_catch([]() -> std::pair<int, int> {
        auto augment = leaf::on_error(info2{1729});  // [1]
        leaf::try_catch(
            [] { 
                throw my_exception(12);
            },
            [](const my_exception& e) {
                leaf::current_error().load(info1{42});  // [2]
                throw;
            });
        // unreachable
    }, [](const my_exception& e, info1* v1, info2* v2) {
        // Return the pair of {info1, info2}
        return std::make_pair(v1 ? v1->value : -1, v2 ? v2->value : -1);
    });
};
auto pair = fun();
BOOST_TEST_EQ(pair.first, 42);
BOOST_TEST_EQ(pair.second, 1729);
pair = fun();
BOOST_TEST_EQ(pair.first, 42);
BOOST_TEST_EQ(pair.second, 1729);

In this case

the bare throw my_exception does not initialize a new error_id in the current thread.
The handler will now try to .load() into the current in-flight error at [2].
- Since there is no new error_id in flight, it will attach info1 to whatever error_id just happened to be loaded in the current thread (Possibly just throwing the info away).
The exception is then re-thrown with a bare throw;. Still, no new error_id is generated.
The augment object's destructor at [1] will detect a new exception in-flight, but also detect that no new error_id has been created. It will then call new_error() (via error_monitor::check_id()) and attach an info2 to that error. The value of info1 is now inaccessible to the intended handler immediately below.

Additional quirk: If one moves the on_error object into the innermost throwing-lambda expression, then it's destructor will call new_error() (as expected!) before the exception is caught and this code will work.

Result differences:

auto fun = [](bool use_leaf_exception) {
    return leaf::try_catch([&]() -> std::pair<int, int> {
        auto augment = leaf::on_error(info2{1729}); // [1]
        leaf::try_catch(
            [&] { 
                if (use_leaf_exception) {
                    throw leaf::exception(my_exception(12));
                } else {
                    throw my_exception(12);
                }
            },
            [](const my_exception& e) {
                leaf::current_error().load(info1{42});  // [2]
                throw;
            });
        // unreachable
    }, [](const my_exception& e, info1* v1, info2* v2) {
        // Return the pair of {info1, info2}
        return std::make_pair(v1 ? v1->value : -1, v2 ? v2->value : -1);
    });
};
auto pair = fun(false);
BOOST_TEST_EQ(pair.first, 42);
BOOST_TEST_EQ(pair.second, 1729);
pair = fun(true);
BOOST_TEST_EQ(pair.first, 42);
BOOST_TEST_EQ(pair.second, 1729);

As a side effect of the prior quirk, the result will differ depending on whether leaf::exception() is called. In this case, if use_leaf_exception is true, then the correct result appears, otherwise it will fail

Emil Dotchevski · Answer 1 · Mon Nov 22 2021 11:17:50 GMT+0800 (China Standard Time)

You probably have a complete program, can you put it here or maybe on Godbolt? I'll take a closer look to confirm things work as intended, but at first glance I don't see a bug: current_error is documented as "returns the error ID from the last time new_error was called", which may very well have been handled already.

The correct technique is described here: https://www.boost.org/doc/libs/release/libs/leaf/doc/html/index.html#tutorial-interoperability. Essentially, you should use error_monitor, or better yet -- just use on_error.

If you think that there is a better way to support this use case, I'm all ears.

vector-of-bool · Answer 2 · Tue Nov 23 2021 07:10:59 GMT+0800 (China Standard Time)

It is correct that this behaves as documented, but I feel like there's an unexpected interaction between disparate parts. I can't say that the behavior is necessarily "wrong", but that it felt counterintuitive once I actually understood why code was behaving as it was.

The actual program itself is fairly large and WIP, but I've tweaked my code to behave "as expected."

I suppose it may be better to show more closely what I was trying to do, and why I was very confused initially:

void do_stuff() {
  auto augment = on_error(e_some_data{data});
  return try_catch(
    [] { return do_stuff_inner(); }, // Might throw
    [] (catch_<some_exception> e) {
      current_error().load(e_some_extra_info{e.matched.something});
      throw;
    });
}

If do_stuff_inner exits with a non-leaf::exception-wrapped some_exception thrown, then the current_error().load() will load into the thread's prior error. The augment object will then call new_error because there is a new exception in-flight without a new error_id having been set for the thread, thus throwing away the e_some_extra_info. My intention is that e_some_extra_info is only loaded when a some_exception is thrown, but otherwise not loaded.

If, however, the augment object is omitted, then the "correct" behavior is seen because new_error() will not be called (although this is admittedly suspicious because current_error() will return some arbitrary prior error_id that is possibly stale).

On the other hand, if the do_stuff_inner() throws with leaf::exception, then new_error() will have been called and everything behaves correctly. This is completely non-suspicious and is the ideal situation.

As a final note, if the code is changed as such:

void do_stuff() {
  return try_catch(
    [] {
      auto augment = on_error(e_some_data{data});
      return do_stuff_inner(); 
    },
    [] (catch_<some_exception> e) {
      current_error().load(e_some_extra_info{e.matched.something});
      throw;
    });
}

(That is: augment occurs inside of the TryBlock), then new_error will be called as the exception is on its way out of the block and before catch_ occurs, and thus again shows the "correct" behavior. In actuality, it might be the case that there is an on_error somewhere inside of do_stuff_inner(), so it can't be immediately known whether new_error() will be called before the catch_, unless you have total knowledge of the control flow path inside of do_stuff_inner().

I suppose my final point is that: You cannot locally reason about whether new_error() has been called, despite being in an error-handling scope.

One possible "solution" to such a quirk would be to have try_catch() behave similarly to an on_error/error_monitor and call new_error() if-and-only-if it detects a new exception in-flight with no new error_id set for the thread. However, this could be a breaking change if much code is already written to assume otherwise.

Thoughts?

Emil Dotchevski · Answer 3 · Tue Nov 23 2021 07:39:50 GMT+0800 (China Standard Time)

Does this behave the way you want? https://godbolt.org/z/Kj3e7dxzM

vector-of-bool · Answer 4 · Tue Nov 23 2021 08:18:04 GMT+0800 (China Standard Time)

That gets close, and I thought about that, but it presents another issue in that it unconditionally calls new_error() from within the handler. See: https://godbolt.org/z/afGPqrKT5: The info<3> from do_stuff_inner is lost because the augment in the catch_ block calls new_error() again.

Emil Dotchevski · Answer 5 · Tue Nov 23 2021 09:08:11 GMT+0800 (China Standard Time)

Yeah. My example also breaks if you change it to throw using leaf::exception: https://godbolt.org/z/5n4jd8K64

Emil Dotchevski · Answer 6 · Tue Nov 23 2021 11:56:05 GMT+0800 (China Standard Time)

I don't like this but I wonder if this is the correct thing to do? Please confirm that it fixes this particular problem.

e7f0089

What I'm doing is, in the on_error destructor, if there is a std::current_exception, check if it carries an error_id and only call new_error if it does not.

vector-of-bool · Answer 7 · Tue Nov 23 2021 13:14:24 GMT+0800 (China Standard Time)

That will allow the in-handler on_error to work, but seems to have a strange side effect: If I change the bare throw; into a throw leaf::exception(catch_.matched), now the outer augment for e_info<1> gets dropped and only the e_info<2> gets loaded.

I think I can boil down the question even simpler:

void foo() {
  auto augment = on_error(info1{});
  leaf::try_catch(
    [] { do_stuff(); },
    [] (catch_<something>) {
      current_error().load(info2{});
      throw;
    });
}

Given only this snippet, is it reasonable to expect that both info1 and info2 will be loaded onto the same error_id? Or are we required to know more about what do_stuff() might be doing?

Without any changes to try_catch or on_error, one might use a wrapping function to get a desired effect:

auto new_error_on_throw(auto fn) {
  error_monitor mon;
  try { return fn(); }
  catch (...) { mon.check_id(); throw; }
}

That is: Call fn, and if it exits via exception and there is no new error_id on the current thread, call new_error and re-throw:

void foo() {
  auto augment = on_error(info1{});
  leaf::try_catch(
    [] { new_error_on_throw(do_stuff); },
    [] (catch_<something>) {
      current_error().load(info2{});
      throw;
    });
}

vector-of-bool · Answer 8 · Tue Nov 23 2021 13:48:48 GMT+0800 (China Standard Time)

If you change the definition of do_stuff() in exception_on_error_test.cpp to this:

template <class Thrower>
void do_stuff( Thrower thrower )
{
  auto augment = leaf::on_error(e_info<1>{1});
  return leaf::try_catch(
    [&]
    {
        thrower();
        BOOST_ERROR("thrower must throw");
    },

    [ ]( leaf::catch_<some_exception> e)
    {
        auto augment = leaf::on_error(e_info<2>{2});
        throw leaf::exception(e.matched);
    } );
}

then you end up with a test failure:

leaf::verbose_diagnostic_info for Error ID = 5
Exception dynamic type: boost::leaf::leaf_detail::exception<some_exception const>
std::exception::what(): std::exception:
[with Name = e_info<2>]: 2
../test/exception_on_error_test.cpp(80): test 'r == 0' ('2' == '0') failed in function 'int main()'
1 error detected.

The second test (which uses a non-leaf::exception throw) still passes, so I'm not even sure what's going on in this case.

Emil Dotchevski · Answer 9 · Tue Nov 23 2021 14:27:29 GMT+0800 (China Standard Time)

The "solution" I was exploring is pretty bad, it is not a good idea to try/catch in on_error.

Does this work for you? https://godbolt.org/z/1x7EzMP1G

Emil Dotchevski · Answer 10 · Fri Nov 26 2021 01:40:24 GMT+0800 (China Standard Time)

Actually can I get more context? To me it doesn't make sense to want to augment only a particular kind of error with some info. The info is typically independent of the error, it is additive information applicable to any error.

vector-of-bool · Answer 11 · Sat Nov 27 2021 07:23:01 GMT+0800 (China Standard Time)

I've also been thinking about this issue as well. Here's what it looked like originally, which has since been cleaned up somewhat:

widget foo::parse_widget_json(json_lib::json_data data) {
  auto _ = on_error(e_parse_widget_json_data{data});
  return try_catch(
    [&] { return walk_json_as_widget(data); }
    [] (const json_lib::walk_exception& exc) {
      current_error().load(e_parse_widget_json_error{exc.what()});
      throw;
    });
}

The idea here was that the walk_exception was kept as the active exception along with any inner error information in-flight, but when that particular error was thrown, to add a e_parse_widget_json_error object into the mix, because a walk_exception wouldn't be enough that an upper handler would understand the error (this is arguably untrue, because of the e_parse_json_widget_data object in the other scope can disambiguate the error).

The issue I was seeing is that sometimes the walk_exception was thrown (by my own code) with a new_error (or an on_load causing a new_error), but sometimes the json_lib (which is unaware of LEAF) would just throw walk_exception without any LEAF-ism to invoke the new_error. e.g.:

void handle_data(json_data d) {
  if (some_condition(d)) {
    BOOST_LEAF_THROW_EXCEPTION(walk_exception{"Ouch"});
  } else {
    throw walk_exception{"Oof"};  // No new_error()
  }
}

(In reality, I could only control the true-branch, and the else-branch was part of the external library, but they both threw the same exception type.)

So the issue boiled down to: When I reach the exception handler for walk_exception, I couldn't be sure whether or not new_error had ever been called.

My (incorrect) expectation was that leaf_detail::try_catch_ would end up invoking new_error on the case of a new exception appearing without a new error having by set (which is the case for on_error).

Here's a hack that I just thought of that can ensure new_error is called:

void handle_data(json_data d) {
  auto _ = on_error(std::monostate{});
  if (some_condition(d)) {
    BOOST_LEAF_THROW_EXCEPTION(walk_exception{"Ouch"});
  } else {
    throw walk_exception{"Oof"};
  }
}

Because on_error will call new_error during stack-unwinding-on-exception if-and-only-if the current_error() == error_id_ (via error_monitor), it doesn't the work to ensure that new_error() is always called only once.

With this you might be able to hack together a semi-useful abstraction:

struct [[nodiscard]] new_error_on_throw {
  decltype(on_error(std::monostate{})) _ = on_error(std::monostate{});
};

void handle_data(json_data d) {
  new_error_on_throw _;
  if (some_condition(d)) {
    BOOST_LEAF_THROW_EXCEPTION(walk_exception{"Ouch"});
  } else {
    throw walk_exception{"Oof"};
  }
}

Emil Dotchevski · Answer 12 · Sat Nov 27 2021 11:21:52 GMT+0800 (China Standard Time)

I don't understand why is it necessary to grab the .what() and instead communicate it as a e_parse_widget_json_error. The .what() is available when you handle the error later on, right?

Emil Dotchevski · Answer 13 · Tue Dec 14 2021 07:09:05 GMT+0800 (China Standard Time)

@vector-of-bool I'm closing this but feel free to reopen it if you think I've missed something.

vector-of-bool · Answer 14 · Tue Dec 14 2021 08:27:37 GMT+0800 (China Standard Time)

Sorry to go quiet on it. I was able to work around the behavior pretty easily, but I may revisit if I can think of a clean solution to the more general confusion.

Emil Dotchevski · Answer 15 · Wed Dec 15 2021 02:24:33 GMT+0800 (China Standard Time)

I'm curious about your workaround.