bigscience-workshop / data_tooling

Tools for managing datasets for governance and training.

Geek Repo:Geek Repo

Github PK Tool:Github PK Tool

Create dataset human_instructions_in_indonesian_extracted_from_wikihow

albertvillanova opened this issue · comments

  • uid: human_instructions_in_indonesian_extracted_from_wikihow
  • type: primary
  • description:
  • languages:
    • language_names:
      • Indonesian
    • language_comments:
    • language_locations:
    • validated: False
  • custodian:
  • availability:
    • procurement:
      • for_download: No - but the current owners/custodians have contact information for data queries
      • download_url:
      • download_email:
    • licensing:
      • has_licenses: Yes
      • license_text:
      • license_properties:
      • license_list:
    • pii:
      • has_pii: Yes
      • generic_pii_likely:
      • generic_pii_list:
      • numeric_pii_likely:
      • numeric_pii_list:
      • sensitive_pii_likely:
      • sensitive_pii_list:
      • no_pii_justification_class:
      • no_pii_justification_text:
    • validated: False
  • source_category:
    • category_type: website
    • category_web:
    • category_media:
    • validated: False
  • media:
    • category:
      • text
    • text_format:
      • other
      • .ttl
    • audiovisual_format:
    • image_format:
    • database_format:
    • text_is_transcribed: No
    • instance_type:
    • instance_count:
    • instance_size:
    • validated: False
  • fname: human_instructions_in_indonesian_extracted_from_wikihow.json