A dataset scrapped from Open Food Facts.
- off/ sources optained from Open Food Facts
- data-fields.txt documentation of the fields for the product database
- categories.txt product taxonomy
- en.openfoodfacts.org.products.csv products (not versionned, available from OFF database)
- tsv/ cleaned data sets ready for use with d3.js
- categories.tsv list of all non empty categories
- categories_taxonomy.tsv parent/children relation between categories
- products.tsv all 167300 products
- products_additives.tsv product/additive relations
- products_categories_full.tsv product/category relations (all categories, including parents, listed for all products)
- products_categories_min.tsv product/category relations (only most specific categories)
- products_countries.tsv product/country relations
- products_labels.tsv product/label relations
- viz/
- cookies.html sample visualisation of elements from the cookies category generated using d3.js
- vendor/ vendorized libraries
- d3
The tsv folder also includes restrictions of the while products.tsv dataset to specific categories:
file | # products |
---|---|
products_only_cookies.tsv |
|
products_only_mueslis.tsv |
|
products_only_breakfast-cereals.tsv |
|
products_only_breads.tsv |
|
products_only_fruits-based-foods.tsv |
|
products.tsv |
|
Files in the off/ directory come from the Open Food Facts project and are licenced under the Open Database License.
Files in the tsv/ directory are extractions of the OFF database, and as such are made available here under the Open Database License.