Feature Request: Aggregate and/or Calculated Metrics

Question

Feature Request: Aggregate and/or Calculated Metrics

thomascleberg opened this issue 3 months ago · comments

It would be fantastic to be able to aggregate and/or apply logic over Metrics in a custom way.

For example, if you had a requirement that context be both relevant and faithful, it would be nice to be able to implement context-relevant, context-faithful , (context-relevant > n AND context-faithful > m), context-relevant * context-faithful et cetera.

Ian Webster · Answer 1 · Sat Apr 13 2024 04:04:22 GMT+0800 (China Standard Time)

Thanks for the suggestion @thomascleberg. Definitely interested in implementing this - will follow up once I open a PR

Ian Webster · Answer 2 · Mon Apr 15 2024 11:01:01 GMT+0800 (China Standard Time)

@thomascleberg WIP here: #670

Ian Webster · Answer 3 · Wed Apr 17 2024 23:04:31 GMT+0800 (China Standard Time)

This is now supported as of 0.53.0. Documentation here: https://promptfoo.dev/docs/configuration/expected-outputs/#creating-derived-metrics