promptfoo / promptfoo

Test your prompts, agents, and RAGs. Redteaming, pentesting, vulnerability scanning for LLMs. Improve your app's quality and catch problems. Compare performance of GPT, Claude, Gemini, Llama, and more. Simple declarative configs with command line and CI/CD integration.

https://www.promptfoo.dev/

promptfoo/promptfoo Issues

OpenRouter integration errors
Closed 4 months ago1
Use promptfoo for output evaluation/visualisation but not inferencing?
Closed 4 months ago6
Mistral provider is hard-coded to https://api.mistral.ai/v1/chat/completions
Closed 4 months ago1
Self-Hosting via provided Dockerfile does not work
Closed 4 months ago3
Feature Request: Add max_tokens config to Azure OpenAI provider
Closed 4 months ago3
How to generate separate report or a breakdown report per test suite instead of just one single report for all test suites?
Closed 4 months ago2
Feature request: Load var from all *.txt files in a folder
Closed 4 months ago2
Use the field "description" when loading tests from CSV
Closed 4 months ago1
Sample js for custom embedding
Closed 4 months ago6
Feature Request: Add tools and tool_choice config to Azure Openai provider
Closed 4 months ago3
response error while using replicate's Private model
Closed 4 months ago3
Web Viewer error: `e.score.toFixed` is not a function
Closed 4 months ago15
dd
Closed 4 months ago
Support Messages api in Bedrock for anthropic v3+
Closed 4 months ago3
example in assertion overview document
Closed 5 months ago1
can not load txt file in vars
Closed 5 months ago2
v0.45 breaks input data files
Closed 5 months ago1
`additionalOption` is not a dictionary in the `call_api` function from a python script
Closed 5 months ago1
version 0.45.0 breaks python providers
Closed 5 months ago6
Huggingface providers does not need apiKey to get the response
Closed 5 months ago1
ModuleNotFound error with custom python provider
Closed 5 months ago5
Claude3 Support
Closed 5 months ago5
HTTP 500 error upon clicking "Run Evaluation" on the UI
Closed 5 months ago3
Command line usage with -r --providers
Updated 5 months ago
parsing wrong path when test by list of path
Closed 5 months ago2
assertion pass when it should'nt
Closed 5 months ago2
Huggingface Api key through config results in error call
Closed 5 months ago3
Change eval name in UI
Closed 5 months ago2
Persist changes on self-deployed UI without sharing a new link
Closed 5 months ago2
How do you set a name for a provider when it's coming from a separate file?
Closed 4 months ago3
Can not replicate consistants behavior of "promptfoo view" in self hosted app.
Closed 5 months ago1
Could we get nunjucks and nunjucksFilters enabled for rubricPrompt?
Closed 5 months ago4
can we use repliacte's private models on promptfoo?
Closed 5 months ago1
Multiple prompts within one python file
Closed 5 months ago3
Failure to parse response from factuality checker results in mysterious `Invalid option: )` test failure
Closed 5 months ago1
Feature Request: How to evaluate Azure Open AI Assistant
Closed 4 months ago4
Failed to evaluate ollama model
Closed 5 months ago6
Problem using is-valid-openai-tools-call assertion
Closed 5 months ago2
Cannot set environment variables in self-hosted share server
Closed 5 months ago2
HuggingFace api key token configuration
Closed 5 months ago1
Context relevance test broken
Closed 5 months ago
error with `promptfoo view`
Closed 5 months ago2
Make the Config description visible on the view page
Closed 5 months ago1
Customizing the grader provider doesn't work
Closed 5 months ago2
A way to prevent vars array auto-expansion
Closed 5 months ago4
Response from Claude v2 (Bedrock) causes errors
Closed 5 months ago8
Bedrock anthropic Claude v2 - `llm-rubric produced malformed response:`
Closed 17 days ago1
Add bedrock:completion:anthropic.claude-v2.1
Closed 4 months ago6
python prompt display
Closed 5 months ago2
Invalid option: a
Closed 5 months ago1