mlfoundations

mlfoundations

Organization data from Github https://github.com/mlfoundations

Home Page:https://people.csail.mit.edu/ludwigs/

GitHub:@mlfoundations

mlfoundations's repositories

open_clip

An open source implementation of CLIP.

Language:PythonLicense:NOASSERTIONStargazers:12607Issues:83Issues:539

open_flamingo

An open-source framework for training large multimodal models.

Language:PythonLicense:MITStargazers:4008Issues:48Issues:177

dclm

DataComp for Language Models

Language:HTMLLicense:MITStargazers:1274Issues:37Issues:71

MINT-1T

MINT-1T: A one trillion token multimodal interleaved dataset.

datacomp

DataComp: In search of the next generation of multimodal datasets

Language:PythonLicense:NOASSERTIONStargazers:693Issues:17Issues:67

wise-ft

Robust fine-tuning of zero-shot models

Language:PythonLicense:NOASSERTIONStargazers:691Issues:6Issues:27

open_lm

A repository for research on medium sized language models.

Language:PythonLicense:MITStargazers:492Issues:20Issues:67

task_vectors

Editing Models with Task Arithmetic

model-soups

Model soups: averaging weights of multiple fine-tuned models improves accuracy without increasing inference time

Language:PythonLicense:MITStargazers:455Issues:10Issues:18

evalchemy

Automatic evals for LLMs

open-diffusion

Simple large-scale training of stable diffusion with multi-node support.

scaling

Language models scale reliably with over-training and on downstream tasks

Language:Jupyter NotebookLicense:MITStargazers:96Issues:8Issues:4

patching

Patching open-vocabulary models by interpolating weights

Language:PythonLicense:MITStargazers:91Issues:6Issues:4

tableshift

A benchmark for distribution shift in tabular data

Language:PythonLicense:MITStargazers:52Issues:7Issues:11

imagenet-captions

Release of ImageNet-Captions

rtfm

Research on Tabular Foundation Models

Language:PythonLicense:MITStargazers:44Issues:6Issues:15

tabliblib

A Python library for processing and filtering TabLib

Language:PythonLicense:MITStargazers:11Issues:6Issues:0
Language:Jupyter NotebookLicense:MITStargazers:7Issues:2Issues:1

webdataset-resharder

Efficiently process webdatasets

Language:CSSLicense:MITStargazers:3Issues:7Issues:0
Language:Jupyter NotebookLicense:MITStargazers:1Issues:3Issues:0

llm-foundry

LLM training code for MosaicML foundation models

Language:PythonLicense:Apache-2.0Stargazers:1Issues:1Issues:0
Stargazers:0Issues:5Issues:0

MixEval

The official evaluation suite and dynamic data release for MixEval.

Language:PythonStargazers:0Issues:0Issues:0