There are 0 repository under evaluations topic.
Langtrace π is an open-source, Open Telemetry based end-to-end observability tool for LLM applications, providing real-time tracing, evaluations and metrics for popular LLMs, LLM frameworks, vectorDBs and more.. Integrate using Typescript, Python. ππ»π
A framework to build scenario simulation projects where human and LLM based agents can participant in, with a user-friendly web UI to visualize simulation, support automatically evaluation on agent action level.
This library implements various metrics (including Kaggle Competition, Medicine) for evaluating ML, DL, AI models, and algorithms. πππππ
NLP tool for wide-range model reliability evaluations
The official repository for the paper "Paraphrase Detection: Human vs. Machine Content".
IELTS listening, speaking, reading and writing modules practice and evaluation with IELTS band calculation based on speech and text analysis and evaluation.
A functional chess game implemented in python, with pygame as a supporting graphics module.
Do a program evaluation that really counts! That will help other students and will put really make universities and colleges take students experiences to heart!
Moodle plugin for evaluations with Moodle. This is the evaluation activity plugin.
LLM evaluation framework
Evaluations for homeschoolers
Utilizing Apache Spark in Google Collab, Jupyter Notebook, Databricks
Program made for the first evaluation of object-oriented programming
Program made for the second evaluation of object-oriented programming
A filter that removes unconstructive comments from student course evaluations
Machine learning algorithms applied to explore Enron email dataset and figure out patterns about people involved in the scandal.