Lorenzo Pacchiardi's repositories
LLM-LieDetector
Code for the ICLR 2024 paper "How to catch an AI liar: Lie detection in black-box LLMs by asking unrelated questions"
SM-ExpFam-LFI
Code for the paper: "Score Matched Conditional Exponential Families for Likelihood-Free Inference", https://jmlr.org/papers/v23/21-0061.html
GenBayes_LikelihoodFree_ScoringRules
Code for the paper "Generalized Bayesian Likelihood-Free Inference Using Scoring Rules Estimators"
SBI_gen_networks_SRs
Code for the paper: "Simulation-Based Inference with Generative Neural Networks via Scoring Rule Minimization"
ABC_model_choice
Simple python implementation of Approximate Bayesian Computation for Model Choice
abcpy
ABCpy package
Awesome-LLMs-Evaluation-Papers
The papers are organized according to our survey: Evaluating Large Language Models: A Comprehensive Survey.
conformal_bayes
Conformal Bayes with importance sampling
deepspeed_llama
Finetuning LLaMA with DeepSpeed
emergent_analogies_LLM
Code for 'Emergent Analogical Reasoning in Large Language Models'
evals
Evals is a framework for evaluating LLMs and LLM systems, and an open-source registry of benchmarks.
helm
Holistic Evaluation of Language Models (HELM), a framework to increase the transparency of language models (https://arxiv.org/abs/2211.09110).
KL-divergence-estimators
Testing methods for estimating KL-divergence from samples.
lorypack.github.io
My personal webpage
Mine_pytorch
MINE: Mutual Information Neural Estimation in pytorch
open-react-template
A free React / Next.js landing page template designed to showcase open source projects, SaaS products, online services, and more. Made by
prontoqa
Synthetic question-answering dataset to formally analyze the chain-of-thought output of large language models on a reasoning task.
py-irt
Bayesian IRT models in Python
pysteps
Python framework for short-term ensemble prediction systems.
recommend
recommendation system with python
sbibm
Simulation-based inference benchmark
tinyBenchmarks
Evaluating LLMs with fewer examples