bigscience-workshop / data_tooling

Tools for managing datasets for governance and training.

Geek Repo:Geek Repo

Github PK Tool:Github PK Tool

Create dataset apple_insider_blog

albertvillanova opened this issue · comments

  • uid: apple_insider_blog
  • type: primary
  • description:
    • name: Apple Insider blog
    • description: AppleInsider is a news and rumor website that includes a forum for discussion of news stories and other community news. It has tons of resources, auctions, and news for anyone who wants to know about Apple. AppleInsider, as the name implies, has the inside scope about lots of different Apple-related info.
      AppleInsider launched in 1997 and quickly grew to become one of the Internet's premier sources of information for all things Apple. Each month, AppleInsider caters to several million unique visitors including consumers, engineers, bankers, and CEOs of Fortune 500 companies.
    • homepage: appleinsider.com
    • validated: True
  • languages:
    • language_names:
      • English
    • language_comments:
    • language_locations:
      • World-Wide
      • United States of America
    • validated: False
  • custodian:
  • availability:
    • procurement:
      • for_download: No - but the current owners/custodians have contact information for data queries
      • download_url:
      • download_email: news@appleinsider.com
    • licensing:
    • pii:
      • has_pii: Yes
      • generic_pii_likely: very likely
      • generic_pii_list:
        • names
        • email addresses
      • numeric_pii_likely: somewhat likely
      • numeric_pii_list:
        • telephone numbers
      • sensitive_pii_likely: unlikely
      • sensitive_pii_list:
        • racial or ethnic origin
      • no_pii_justification_class:
      • no_pii_justification_text:
    • validated: False
  • source_category:
    • category_type: website
    • category_web: news or magazine website
    • category_media:
    • validated: False
  • media:
    • category:
      • text
      • image
    • text_format:
      • .HTML
    • audiovisual_format:
    • image_format:
      • .JPG
    • database_format:
    • text_is_transcribed: No
    • instance_type: article
    • instance_count: 10K<n<100K
    • instance_size: 100<n<10,000
    • validated: False
  • fname: apple_insider_blog.json