Beast code in Giters

Christos Ziakas's repositories

redeval

Red-teaming LLM applications.

Language:PythonApache-2.020 2 1

backbone-learn

A Library for Scaling Mixed-Integer Optimization-Based Machine Learning.

Language:PythonMIT1100

deepeval

Evaluation and Unit Testing for LLMs

Language:PythonApache-2.0100

dt-distance

Calculate the structural distance between decision tree models

Language:Jupyter NotebookMIT000

generative_agents

Generative Agents: Interactive Simulacra of Human Behavior

Apache-2.0000

This project evaluates OpenAI's GPT-3.5 model on a sample from the HumanEval dataset to assess its code generation capabilities. The implementation is built in a way that can easily integrate new models and datasets. Parameters such as sample size and the pass@k metric are configurable.

Language:Python020