Amazon Science's repositories
chronos-forecasting
Chronos: Pretrained Models for Probabilistic Time Series Forecasting
RAGChecker
RAGChecker: A Fine-grained Framework For Diagnosing RAG
carbon-assessment-with-ml
CaML: Carbon Footprinting of Household Products with Zero-Shot Semantic Text Similarity
llm-code-preference
Training and Benchmarking LLMs for Code Preference.
factual-confidence-of-llms
Code for paper "Factual Confidence of LLMs: on Reliability and Robustness of Current Estimators"
synthesizrr
Synthesizing realistic and diverse text-datasets from augmented LLMs
llm-rank-pruning
LLM-Rank: A graph theoretical approach to structured pruning of large language models based on weighted Page Rank centrality as introduced by the related paper.
job-posting-structure
Extract structured information from job postings.
snakes_and_ladders_adapting_the_surface_code_to_defects
Tools to build STIM circuits for the rotated surface code in presence of non-operational qubits and gates.
LatticeAlgorithms.jl
Algorithms to solve lattice problems in Julia
entity-salience-short-documents
A dataset for evaluating entity salience prediction on extremely short documents
BeyondCorrelation
Implementation of the paper: Beyond Correlation: The impact of human uncertainty in measuring the effectiveness of automatic evaluation and LLM-as-a-judge
PIXELS
Official implementation for the AAAI2025 paper "PIXELS - Progressive Image Xemplar-based Editing with Latent Surgery"