There are 0 repository under ai-performance topic.
Code scanner to check for issues in prompts and LLM calls
Arbitrary Numbers
Building an AI team to play Codenames using top Large Language Models (LLMs), evaluating performance, and pitting them against each other. Explore their strategy and capabilities in this interactive competition!
KAI Data Center Builder
A streamlined and easy-to-use AI performance evaluation / summary template with modern UI in HTML, including correct percentage chart and comparison with other models, precision, recall, F1-score, and confusion matrix. Enables you to create the result chart within 3 minutes.
Open-source benchmark for real-world AI performance