David Sontag's starred repositories
llama_index
LlamaIndex is a data framework for your LLM applications
smoothquant
[ICML 2023] SmoothQuant: Accurate and Efficient Post-Training Quantization for Large Language Models
torchxrayvision
TorchXRayVision: A library of chest X-ray datasets and models. Classifiers, segmentation, and autoencoders.
enformer-pytorch
Implementation of Enformer, Deepmind's attention network for predicting gene expression, in Pytorch
cotrain-prompting
Code for co-training large language models (e.g. T0) with smaller ones (e.g. BERT) to boost few-shot performance
PheValuator
An R package for evaluating phenotype algorithms.
Twin_Causal_Nets
Estimating the probabilities of caution via deep monotonic twin networks
weaksup-subset-selection
Subset selection / data pruning for weak supervision
real-time-admissions
Code to accompany paper published in Nature Digital Medicine
parametric-robustness-evaluation
Code for paper "Evaluating Robustness to Dataset Shift via Parametric Robustness Sets"
large-scale-temporal-shift-study
Code for Large-Scale Study of Temporal Shift in Health Insurance Claims. Christina X Ji, Ahmed M Alaa, David Sontag. CHIL, 2023. https://arxiv.org/abs/2305.05087