Christos Ziakas's repositories

redeval

Red-teaming LLM applications.

Language:PythonLicense:Apache-2.0Stargazers:20Issues:2Issues:1

backbone-learn

A Library for Scaling Mixed-Integer Optimization-Based Machine Learning.

Language:PythonLicense:MITStargazers:11Issues:0Issues:0

deepeval

Evaluation and Unit Testing for LLMs

Language:PythonLicense:Apache-2.0Stargazers:1Issues:0Issues:0
Language:PythonStargazers:0Issues:0Issues:0

dt-distance

Calculate the structural distance between decision tree models

Language:Jupyter NotebookLicense:MITStargazers:0Issues:0Issues:0

generative_agents

Generative Agents: Interactive Simulacra of Human Behavior

License:Apache-2.0Stargazers:0Issues:0Issues:0

humaneval_sample_eval

This project evaluates OpenAI's GPT-3.5 model on a sample from the HumanEval dataset to assess its code generation capabilities. The implementation is built in a way that can easily integrate new models and datasets. Parameters such as sample size and the pass@k metric are configurable.

Language:PythonStargazers:0Issues:2Issues:0

Minigrid

Simple and easily configurable grid world environments for reinforcement learning

Language:PythonLicense:NOASSERTIONStargazers:0Issues:0Issues:0

node-embeddings-eval

Evaluation protocol for graph embedding methods on link prediction, node classification, and node clustering

Language:Jupyter NotebookLicense:MITStargazers:0Issues:1Issues:0
Language:PythonStargazers:0Issues:1Issues:0

multimodal-rag-agent

A retrieval-augmented generative agent with access to image and text memories.

Language:PythonLicense:Apache-2.0Stargazers:0Issues:1Issues:0

selfcheckgpt

SelfCheckGPT: Zero-Resource Black-Box Hallucination Detection for Generative Large Language Models

License:MITStargazers:0Issues:0Issues:0