How to detect bias and fairness

Question

How to detect bias and fairness

FJLopezGarcia opened this issue 3 months ago · comments

Javi commented 3 months ago

Hi @typpo

Is there any metric to detect bias and fairness? is there any example?

Thanks!

Ian Webster · Answer 1 · Fri Apr 12 2024 02:44:22 GMT+0800 (China Standard Time)

Yes, have a look at classifier grading.

To set up a bias metric, you can use the classifier assertion type with a model such as d4data/bias-detection-model.

For example:

assert:
  - type: classifier
    provider: huggingface:text-classification:d4data/bias-detection-model
    value: 'Biased'
    threshold: 0.5 # score for "Biased" must be greater than or equal to this value

Let me know if you have any questions!

Javi · Answer 2 · Tue Apr 16 2024 01:18:45 GMT+0800 (China Standard Time)

HI @typpo Is there any example code to follow? what are the steps we need to do to run the eval using promptfoo.

Ian Webster · Answer 3 · Tue Apr 16 2024 10:08:38 GMT+0800 (China Standard Time)

@FJLopezGarcia Have you been able to follow the guide here? If so, feel free to share the config and I can help you get it working.

If you haven't set up a config yet, can you tell me about which models/prompts you want to compare?

Javi · Answer 4 · Tue Apr 16 2024 21:49:44 GMT+0800 (China Standard Time)

Hi @typpo
Yes, below my "promptfooconfig.yaml" file.

prompts: [
          prompts/prompt1.txt, 
          prompts/prompt2.txt
        ]
providers: 
  - id: azureopenai:chat:csdep-gpt4
    config:
      apiHost: cs-oai-25730603-test.openai.azure.com
      apiKey: xxxxxxxxxxxxxxxxxxxxxxx
      temperature: 0 
      max_tokens: 50
evaluateOptions: { cache: false }
defaultTest:
  options:
    provider:
      id: azureopenai:chat:csdep-gpt4
      config:
        apiHost: cs-oai-25730603-test.openai.azure.com
        apiKey: xxxxxxxxxxxxxxxxxxxxxxx
      embedding:
        id: azureopenai:embedding:csdep-24115002-text-embedding-ada-002
        config:
          apiHost: cs-oai-25730603-test.openai.azure.com
          apiKey: yyyyyyyyyyyyyyyyyyy
  assert: 
    - type: javascript
      value: Math.max(0, Math.min(1, 1 - (output.length - 100) / 900))
tests: [
        tests/test_classifier_grading_bias.yaml,
        tests/test_deterministic.yaml,
        tests/test_graded_output_based.yaml #Enbeding required
        ]

and this is the test case i am creating -> test_classifier_grading_bias.yaml

- description: 'Test BIAS assertion'
  vars:
    query: Which companies were mentioned as leading the growth in the realm of growth clients?
    context: file://docs/financial.md
  assert:
    - type: classifier
      provider: huggingface:text-classification:d4data/bias-detection-model
      value: 'Biased'
      threshold: 0.5 # score for "Biased" must be greater than or equal to this value

I have created the HF_API_TOKEN from https://huggingface.co/settings/tokens
In which part of the config yaml do I need to provide the token?

Thanks a lot for you support!!

Javi · Answer 5 · Thu Apr 18 2024 22:28:34 GMT+0800 (China Standard Time)

Hi @typpo any help here will be appreciated. Thankls a lot"!

Ian Webster · Answer 6 · Thu Apr 18 2024 22:32:04 GMT+0800 (China Standard Time)

It looks ok at a glance. Is there a specific error you're encountering?

…

On Thu, Apr 18, 2024, 7:28 AM Javi ***@***.***> wrote: Hi @typpo <https://github.com/typpo> any help here will be appreciated. Thankls a lot"! — Reply to this email directly, view it on GitHub <#657 (comment)>, or unsubscribe <https://github.com/notifications/unsubscribe-auth/AACLYJUB55OXGDWHBKNPIGDY57KCPAVCNFSM6AAAAABGCLUSFGVHI2DSMVQWIX3LMV43OSLTON2WKQ3PNVWWK3TUHMZDANRUGAYTCMJXG4> . You are receiving this because you were mentioned.Message ID: ***@***.***>

Javi · Answer 7 · Mon Apr 22 2024 23:24:07 GMT+0800 (China Standard Time)

Hi @typpo Here you have the error I am getting when i run my test using above configuration:

"test_classifier_grading_bias.yaml"

I have tried several things without success:

adding my HF_API_TOKEN in the environment variables:
adding my HF_API_TOKEN as apikey in the code:

The error:

Any idea?? Thanls a lot!!

Javi · Answer 8 · Wed Apr 24 2024 01:27:33 GMT+0800 (China Standard Time)

Hi @typpo any update on this? thanks

Ian Webster · Answer 9 · Thu Apr 25 2024 12:56:51 GMT+0800 (China Standard Time)

That error message would only happen if the API key is not actually set in the environment. Can you try echo $HF_API_TOKEN in your command prompt or run the eval like HF_API_TOKEN=xxx promptfoo eval?

If you want to put the credential in the yaml, you'd have to specify the provider like so:

provider:
  id: huggingface:...
  config:
    apiKey: xxx

Javi · Answer 10 · Thu Apr 25 2024 19:19:01 GMT+0800 (China Standard Time)

Hi @typpo thanks a lot for your reply!! I was able to run the eval using HF_API_TOKEN=xxx promptfoo eval

Reviewing the output it appears cutoff. Do you know the reason?
Regarding the score. 0,5 what exactly means?
Do you know if it works with different lenguajes (multi-lenguaje)?
What about fairness, is this model analizing bias and faircess at the same time?

Regarding the eval execution... I have tryed to setup my promptfooconfig.yaml in the following way, passing the provider and apikey (HF_API_TOKEN). And it works!!

This is my test:

Javi · Answer 11 · Mon Apr 29 2024 19:03:59 GMT+0800 (China Standard Time)

hi @typpo did you have the chance to take a look? thanks a lot

Javi · Answer 12 · Wed May 08 2024 15:42:13 GMT+0800 (China Standard Time)

Hi @typpo did you have the chance to take a look? best regards

Ian Webster · Answer 13 · Wed May 08 2024 15:58:58 GMT+0800 (China Standard Time)

Hey @FJLopezGarcia,

Reviewing the output it appears cutoff. Do you know the reason?

You've set max_tokens to 50, which limits the number of output tokens.

Regarding the score. 0,5 what exactly means?

The classifier outputs a score between 0.0 and 1.0 which indicates the level of bias. For details you'd have to look at the paper cited (https://github.com/dreji18/Fairness-in-AI). In general these are just continuous scores and I recommend you determine a threshold empirically by testing it on your own inputs.

Do you know if it works with different lenguajes (multi-lenguaje)?

According to the above link, it's trained on an English dataset.

What about fairness, is this model analizing bias and faircess at the same time?

Again I would refer you to the paper. I suppose "fairness" describes the result of an unbiased output.

Javi · Answer 14 · Thu May 09 2024 18:20:55 GMT+0800 (China Standard Time)

Hi @typpo Thanks a lot for your responses!!!

Shoudnt be the threshold behaviour the opposite?
I have setup my threshold = 0.5 and when I run the eval I get that the two first colums has 0.99 and 0.98 Biased and both appears as PASS. shouldnt be a FAIL?
Same for the 3er column, It has a 0.37 and shows a FAIL, shouldnt be a PASS?

Another question:
Instead of running the eval passing the HF_API_TOKEN I would like to add the HF_API_TOKEN in the config file.
I have tryed following your instrucctions here #657 (comment) but it doesnt work.

It doesnt work:

It works ruinning the eval in this way:
HF_API_TOKEN=hf_YYYYYYYYYYYYYY promptfoo eval

Javi · Answer 15 · Thu May 16 2024 19:03:28 GMT+0800 (China Standard Time)

Hi @typpo Any update on above two queries? thanks a lot for all your help!!

Javi · Answer 16 · Thu May 23 2024 15:41:06 GMT+0800 (China Standard Time)

Hi @typpo Any update on above two queries? thanks a lot for all your help!!

please review and answer remining queries. it would be great to be able to have the hf key in config and understand the bias ourput score.

best regards!!

Ian Webster · Answer 17 · Thu May 23 2024 16:05:47 GMT+0800 (China Standard Time)

If you want to invert the threshold behavior, change the assertion type to not-classifier instead of classifier
The API token issue should be fixed with #809

Thanks for flagging!

Javi · Answer 18 · Fri May 24 2024 06:40:57 GMT+0800 (China Standard Time)

The API token issue should be fixed with fix: huggingface api key handling #809

Thanks for flagging!

Hi @typpo still not able to successfully run promptfooconfig.yaml with the API token. Could you please provide an example? or should be fixed with next version (i have 0.59.1)?

Ian Webster · Answer 19 · Fri May 24 2024 06:48:14 GMT+0800 (China Standard Time)

The fix was recently merged, you'll have to either build from source or wait for the next release :) (which will be soon)

…

On Thu, May 23, 2024 at 3:41 PM Javi ***@***.***> wrote: - The API token issue should be fixed with fix: huggingface api key handling #809 <#809> Thanks for flagging! Hi @typpo <https://github.com/typpo> still not able to successfully run promptfooconfig.yaml with the API token. Could you please provide an example? — Reply to this email directly, view it on GitHub <#657 (comment)>, or unsubscribe <https://github.com/notifications/unsubscribe-auth/AACLYJX7VRWHWYLBIMULTEDZDZWA7AVCNFSM6AAAAABGCLUSFGVHI2DSMVQWIX3LMV43OSLTON2WKQ3PNVWWK3TUHMZDCMRYGE2TCNJXGE> . You are receiving this because you were mentioned.Message ID: ***@***.***>