evaluations

There are 0 repository under evaluations topic.

Scale3-Labs / langtrace
Langtrace 🔍 is an open-source, Open Telemetry based end-to-end observability tool for LLM applications, providing real-time tracing, evaluations and metrics for popular LLMs, LLM frameworks, vectorDBs and more.. Integrate using Typescript, Python. 🚀💻📊
ai datasets evaluations gpt langchain llm llm-framework llmops observability open-source open-telemetry openai prompt-engineering tracing
Language:TypeScript 136
log10-io / log10
Python client library for improving your LLM app accuracy
agents ai artificial-intelligence autonomous-agents debugging llmops logging monitoring openai python rlhf evaluations feedback fine-tuning anthropic llms
Language:Python 77
boxbeam / Crunch
The fastest java expression compiler/evaluator
evaluating-mathematical-expressions evaluations
Language:Java 60
LLM-Evaluation-s-Always-Fatiguing / leaf-playground
A framework to build scenario simulation projects where human and LLM based agents can participant in, with a user-friendly web UI to visualize simulation, support automatically evaluation on agent action level.
llm-evaluation agent-based-simulation automation evaluations agent agents chatgpt
Language:Python 20
yisaienkov / evaluations
This library implements various metrics (including Kaggle Competition, Medicine) for evaluating ML, DL, AI models, and algorithms. 📐📊📈📉📏
evaluations python metrics metrics-library pypi python-library python3 kaggle kaggle-competition
Language:Python 14
reliability-checklist
Maitreyapatel / reliability-checklist
NLP tool for wide-range model reliability evaluations
evaluations language-model nlp-library reliability-benchmarking robustness
Language:Python 11
ComputerScienceHouse / conditional
CSH Evals, the modern way.
csh evaluations flask hacktoberfest
Language:Python 10
argrecsys / argael
ARGAEL is an open-source Java desktop application designed to maximize the experience and efficiency of the process of annotating and evaluating arguments in large text corpora.
arguments java annotations evaluations argument-tool argument-models
Language:Java 4
jonas-becker / pd-human-vs-machine-content
The official repository for the paper "Paraphrase Detection: Human vs. Machine Content".
datasets paraphrase-detection paraphrase-recognition paraphrased-data evaluations human-data machine-data natural-language-procressing nlp paraphrase-identification paraphrases
Language:HTML 3
ZainabZaman / IELTS_PracticeAndEvaluation
IELTS listening, speaking, reading and writing modules practice and evaluation with IELTS band calculation based on speech and text analysis and evaluation.
azure evaluations gpt-3 ielts ielts-exam ielts-learning ielts-listening ielts-speaking ielts-writing openai python speech-processing text-analysis
Language:Python 3
rJefferyXie / Chess-Program-with-Minimax-Visualizer
A functional chess game implemented in python, with pygame as a supporting graphics module.
minimax-algorithm chess player evaluations move-trees single-player alpha-beta-pruning
Language:Python 2
bhadresh-laiya / program-evaluation.com
Do a program evaluation that really counts! That will help other students and will put really make universities and colleges take students experiences to heart!
evaluation students-experiences colleges universities counts students program evaluations evaluation-data built using laravel6 laravel-framework blade-template
Language:PHP 1
HarryBleckert / moodle-mod_evaluation
Moodle plugin for evaluations with Moodle. This is the evaluation activity plugin.
evaluation evaluation-kit evaluations moodle moodle-plugin evaluations-with-moodle lehrveranstaltungsevaluationen moodle-activity
Language:PHP 1
brettdidonato / BSD_Evals
LLM evaluation framework
anthropic bigquery evaluation-framework evaluation-metrics evaluations gcp gemini-pro generative-ai google-cloud llms nl2sql openai text2sql
Language:Jupyter Notebook 0
CathyNickEvaluations / cathynickevaluations.github.io
Evaluations for homeschoolers
homeschooling homeschoolers cathy nick evaluation evaluations homeschool school test testing evaluate evaluator colorado westminster
Language:HTML 0
esleipness / fluiddataPySpark
Utilizing Apache Spark in Google Collab, Jupyter Notebook, Databricks
converting evaluations queries
Language:Jupyter Notebook 0
henrique-souza / evaluation_1_POO
Program made for the first evaluation of object-oriented programming
java evaluations java-library java-exercises object-oriented-programming college-project college-exercises
Language:Java 0
henrique-souza / evaluation_2_OOP
Program made for the second evaluation of object-oriented programming
java java-library java-exercises object-oriented-programming exceptions college-exercises evaluations
Language:Java 0
johngoeltz / course_evals
A filter that removes unconstructive comments from student course evaluations
evaluations comments constructive nlp
Language:Jupyter Notebook 0
moreirab / enron-scandal
Machine learning algorithms applied to explore Enron email dataset and figure out patterns about people involved in the scandal.
decision-tree enron-dataset enron-email-dataset enron-emails enron-fraud-case evaluations feature-selection k-means machine-learning pca python regression supervised-learning svm text-learning udacity udacity-android-nanodegree udacity-machine-learning-nanodegree unsupervised-learning validation
Language:DIGITAL Command Language