bigscience-workshop / data_sourcing

This directory gathers the tools developed by the Data Sourcing Working Group

Geek Repo:Geek Repo

Github PK Tool:Github PK Tool

BigScience Data Sourcing Code

This directory gathers the tools developed by the Data Sourcing Working Group

First Sourcing Sprint: October 2021

The code for the input form can be found in sourcing_sprint/streamlit_form.py

The code for the exploration tool can be found in sourcing_sprint/streamlit_explore.py

The resource entries can be found in sourcing_sprint/resources (one folder per language, one .jsonl file per resource)

About

This directory gathers the tools developed by the Data Sourcing Working Group

License:Apache License 2.0


Languages

Language:Python 100.0%