David Sontag (dsontag)

dsontag

Geek Repo

Company:MIT

Location:Cambridge, MA

Home Page:http://people.csail.mit.edu/dsontag/

Github PK Tool:Github PK Tool


Organizations
clinicalml

David Sontag's starred repositories

langchain

🦜🔗 Build context-aware reasoning applications

Language:Jupyter NotebookLicense:MITStargazers:92510Issues:679Issues:7601

llama

Inference code for Llama models

Language:PythonLicense:NOASSERTIONStargazers:55545Issues:518Issues:959

llama_index

LlamaIndex is a data framework for your LLM applications

Language:PythonLicense:MITStargazers:35525Issues:246Issues:5109

dspy

DSPy: The framework for programming—not prompting—foundation models

Language:PythonLicense:MITStargazers:16901Issues:139Issues:722

peft

🤗 PEFT: State-of-the-art Parameter-Efficient Fine-Tuning.

Language:PythonLicense:Apache-2.0Stargazers:15891Issues:106Issues:1028

allennlp

An open-source NLP research library, built on PyTorch.

Language:PythonLicense:Apache-2.0Stargazers:11735Issues:280Issues:2557

NeMo

A scalable generative AI framework built for researchers and developers working on Large Language Models, Multimodal, and Speech AI (Automatic Speech Recognition and Text-to-Speech)

Language:PythonLicense:Apache-2.0Stargazers:11558Issues:202Issues:2235

composer

Supercharge Your Model Training

Language:PythonLicense:Apache-2.0Stargazers:5120Issues:49Issues:541

trlx

A repo for distributed training of language models with Reinforcement Learning via Human Feedback (RLHF)

Language:PythonLicense:MITStargazers:4448Issues:49Issues:289

alpa

Training and serving large-scale neural networks with auto parallelization.

Language:PythonLicense:Apache-2.0Stargazers:3047Issues:45Issues:296
Language:PythonLicense:Apache-2.0Stargazers:2640Issues:36Issues:140

RL4LMs

A modular RL library to fine-tune language models to human preferences

Language:PythonLicense:Apache-2.0Stargazers:2175Issues:25Issues:56

NeuroNER

Named-entity recognition using neural networks. Easy-to-use and state-of-the-art results.

Language:PythonLicense:MITStargazers:1690Issues:79Issues:151

smoothquant

[ICML 2023] SmoothQuant: Accurate and Efficient Post-Training Quantization for Large Language Models

Language:PythonLicense:MITStargazers:1183Issues:21Issues:87

torchxrayvision

TorchXRayVision: A library of chest X-ray datasets and models. Classifiers, segmentation, and autoencoders.

Language:Jupyter NotebookLicense:Apache-2.0Stargazers:894Issues:18Issues:76

truss

The simplest way to serve AI/ML models in production

Language:PythonLicense:MITStargazers:882Issues:15Issues:121

enformer-pytorch

Implementation of Enformer, Deepmind's attention network for predicting gene expression, in Pytorch

Language:PythonLicense:MITStargazers:415Issues:17Issues:36

OMOP2OBO

OMOP2OBO: A Python Library for mapping OMOP standardized clinical terminologies to Open Biomedical Ontologies

Language:Jupyter NotebookLicense:MITStargazers:83Issues:9Issues:46

primock57

Dataset of 57 mock medical primary care consultations: audio, consultation notes, human utterance-level transcripts.

Language:PythonLicense:NOASSERTIONStargazers:34Issues:30Issues:0

LiST

Lite Self-Training

Language:PythonLicense:MITStargazers:29Issues:6Issues:3

ETAB

[ NeurIPS 2022 ] Official Codebase for "ETAB: A Benchmark Suite for Visual Representation Learning in Echocardiography"

Language:Jupyter NotebookStargazers:25Issues:3Issues:2

ehr_ml

Code for doing machine learning with various EHRs

Language:C++License:MITStargazers:21Issues:8Issues:22

cotrain-prompting

Code for co-training large language models (e.g. T0) with smaller ones (e.g. BERT) to boost few-shot performance

Language:PythonLicense:MITStargazers:17Issues:18Issues:0

dataset

Multi-LexSum is an abstractive summarization dataset for US Civil Rights Lawsuits

Language:Jupyter NotebookStargazers:17Issues:0Issues:1

PheValuator

An R package for evaluating phenotype algorithms.

Twin_Causal_Nets

Estimating the probabilities of caution via deep monotonic twin networks

Language:PythonLicense:MITStargazers:17Issues:0Issues:0

weaksup-subset-selection

Subset selection / data pruning for weak supervision

Language:PythonLicense:MITStargazers:14Issues:1Issues:0

real-time-admissions

Code to accompany paper published in Nature Digital Medicine

Language:RLicense:BSD-3-ClauseStargazers:8Issues:0Issues:0

parametric-robustness-evaluation

Code for paper "Evaluating Robustness to Dataset Shift via Parametric Robustness Sets"

Language:PythonLicense:MITStargazers:7Issues:7Issues:0

large-scale-temporal-shift-study

Code for Large-Scale Study of Temporal Shift in Health Insurance Claims. Christina X Ji, Ahmed M Alaa, David Sontag. CHIL, 2023. https://arxiv.org/abs/2305.05087