bigscience-workshop / data_tooling

Tools for managing datasets for governance and training.

Geek Repo:Geek Repo

Github PK Tool:Github PK Tool

Create dataset the_hill_newspaper_and_digital_media

albertvillanova opened this issue · comments

  • uid: the_hill_newspaper_and_digital_media
  • type: primary
  • description:
    • name: The Hill Newspaper and Digital Media
    • description: The Hill is an American newspaper and digital media company based in Washington, D.C. that was founded in 1994. In 2020, it was the largest independent political news site in the United States.
      Focusing on politics, policy, business and international relations, The Hill's coverage includes the U.S. Congress, the presidency and executive branch, and election campaigns. The Hill describes its output as "nonpartisan reporting on the inner workings of Government and the nexus of politics and business". The company's primary outlet is TheHill.com. The Hill is additionally distributed in print for free around Washington, D.C. and distributed to all congressional offices.
    • homepage: thehill.com
    • validated: True
  • languages:
    • language_names:
      • English
    • language_comments:
    • language_locations:
      • Northern America
      • United States of America
    • validated: False
  • custodian:
  • availability:
    • procurement:
      • for_download: No - but the current owners/custodians have contact information for data queries
      • download_url:
      • download_email: contribute@changingamerica.com
    • licensing:
    • pii:
      • has_pii: Yes
      • generic_pii_likely: very likely
      • generic_pii_list:
        • names
        • email addresses
        • website account name or handle
      • numeric_pii_likely: somewhat likely
      • numeric_pii_list:
        • telephone numbers
      • sensitive_pii_likely: somewhat likely
      • sensitive_pii_list:
        • racial or ethnic origin
        • political opinions
        • religious or philosophical beliefs
        • health-related data
      • no_pii_justification_class:
      • no_pii_justification_text:
    • validated: False
  • source_category:
    • category_type: website
    • category_web: news or magazine website
    • category_media:
    • validated: False
  • media:
    • category:
      • text
      • image
    • text_format:
      • .HTML
      • .JS
    • audiovisual_format:
    • image_format:
      • .JPG
    • database_format:
    • text_is_transcribed: No
    • instance_type: article
    • instance_count: 100K<n<1M
    • instance_size: 100<n<10,000
    • validated: False
  • fname: the_hill_newspaper_and_digital_media.json