David Sontag (dsontag)

dsontag

Geek Repo

Company:MIT

Location:Cambridge, MA

Home Page:http://people.csail.mit.edu/dsontag/

Github PK Tool:Github PK Tool


Organizations
clinicalml

David Sontag's starred repositories

langchain

🦜🔗 Build context-aware reasoning applications

Language:PythonLicense:MITStargazers:84493Issues:651Issues:6701

llama

Inference code for LLaMA models

Language:PythonLicense:NOASSERTIONStargazers:50895Issues:499Issues:872

llama_index

LlamaIndex is a data framework for your LLM applications

Language:PythonLicense:MITStargazers:31501Issues:225Issues:3836

peft

🤗 PEFT: State-of-the-art Parameter-Efficient Fine-Tuning.

Language:PythonLicense:Apache-2.0Stargazers:13994Issues:104Issues:891

allennlp

An open-source NLP research library, built on PyTorch.

Language:PythonLicense:Apache-2.0Stargazers:11699Issues:279Issues:2557

dspy

DSPy: The framework for programming—not prompting—foundation models

Language:PythonLicense:MITStargazers:11259Issues:113Issues:444

NeMo

A scalable generative AI framework built for researchers and developers working on Large Language Models, Multimodal, and Speech AI (Automatic Speech Recognition and Text-to-Speech)

Language:PythonLicense:Apache-2.0Stargazers:10170Issues:188Issues:2073

composer

Supercharge Your Model Training

Language:PythonLicense:Apache-2.0Stargazers:5014Issues:51Issues:521

trlx

A repo for distributed training of language models with Reinforcement Learning via Human Feedback (RLHF)

Language:PythonLicense:MITStargazers:4336Issues:49Issues:282

alpa

Training and serving large-scale neural networks with auto parallelization.

Language:PythonLicense:Apache-2.0Stargazers:2990Issues:46Issues:295
Language:PythonLicense:Apache-2.0Stargazers:2510Issues:39Issues:134

RL4LMs

A modular RL library to fine-tune language models to human preferences

Language:PythonLicense:Apache-2.0Stargazers:2095Issues:26Issues:54

NeuroNER

Named-entity recognition using neural networks. Easy-to-use and state-of-the-art results.

Language:PythonLicense:MITStargazers:1679Issues:79Issues:151

smoothquant

[ICML 2023] SmoothQuant: Accurate and Efficient Post-Training Quantization for Large Language Models

Language:PythonLicense:MITStargazers:1031Issues:20Issues:78

torchxrayvision

TorchXRayVision: A library of chest X-ray datasets and models. Classifiers, segmentation, and autoencoders.

Language:Jupyter NotebookLicense:Apache-2.0Stargazers:842Issues:18Issues:71

truss

The simplest way to serve AI/ML models in production

Language:PythonLicense:MITStargazers:839Issues:11Issues:111

enformer-pytorch

Implementation of Enformer, Deepmind's attention network for predicting gene expression, in Pytorch

Language:PythonLicense:MITStargazers:392Issues:17Issues:34

OMOP2OBO

OMOP2OBO: A Python Library for mapping OMOP standardized clinical terminologies to Open Biomedical Ontologies

Language:Jupyter NotebookLicense:MITStargazers:79Issues:10Issues:46

LiST

Lite Self-Training

Language:PythonLicense:MITStargazers:29Issues:6Issues:3

primock57

Dataset of 57 mock medical primary care consultations: audio, consultation notes, human utterance-level transcripts.

Language:PythonLicense:NOASSERTIONStargazers:28Issues:30Issues:0

ETAB

A Benchmark Suite for Visual Representation Learning in Echocardiography

Language:Jupyter NotebookStargazers:21Issues:3Issues:2

dataset

Multi-LexSum is an abstractive summarization dataset for US Civil Rights Lawsuits

Language:Jupyter NotebookStargazers:17Issues:0Issues:1

PheValuator

An R package for evaluating phenotype algorithms.

ehr_ml

Code for doing machine learning with various EHRs

Language:C++License:MITStargazers:17Issues:8Issues:22

Twin_Causal_Nets

Estimating the probabilities of caution via deep monotonic twin networks

Language:PythonLicense:MITStargazers:17Issues:0Issues:0

cotrain-prompting

Code for co-training large language models (e.g. T0) with smaller ones (e.g. BERT) to boost few-shot performance

Language:PythonLicense:MITStargazers:16Issues:17Issues:0

weaksup-subset-selection

Subset selection / data pruning for weak supervision

Language:PythonLicense:MITStargazers:13Issues:0Issues:0

real-time-admissions

Code to accompany paper published in Nature Digital Medicine

Language:RLicense:BSD-3-ClauseStargazers:8Issues:0Issues:0

parametric-robustness-evaluation

Code for paper "Evaluating Robustness to Dataset Shift via Parametric Robustness Sets"

Language:PythonLicense:MITStargazers:5Issues:8Issues:0

large-scale-temporal-shift-study

Code for Large-Scale Study of Temporal Shift in Health Insurance Claims. Christina X Ji, Ahmed M Alaa, David Sontag. CHIL, 2023. https://arxiv.org/abs/2305.05087