Not Diamond's repositories
human-eval
Code for the paper "Evaluating Large Language Models Trained on Code"
Language:PythonMIT000
openai-python
The official Python library for the OpenAI API
Language:PythonApache-2.0000
opencompass
OpenCompass is an LLM evaluation platform, supporting a wide range of models (InternLM2,GPT-4,LLaMa2, Qwen,GLM, Claude, etc) over 100+ datasets.
Language:PythonApache-2.0000
ppdeep
Pure-Python library for computing fuzzy hashes (ssdeep)
Language:Python000
PromptBreeder
Google Deepmind's PromptBreeder for automated prompt engineering implemented in langchain expression language.
Language:Python000