BigScience Workshop's repositories
data_sourcing
This directory gathers the tools developed by the Data Sourcing Working Group
bigscience-workshop.github.io
Alternative to https://github.com/Dynalon/mdwiki-seed
amazon-sagemaker-mlflow-fargate
Managing your machine learning lifecycle with MLflow and Amazon SageMaker
scaling-laws-tokenization
scaling-laws-tokenization
codecarbon
Track emissions from Compute and recommend ways to reduce their impact on the environment.