syndacks / WikipediaSentences

Sentences scraped from wikipedia featured ariticle

Geek Repo:Geek Repo

Github PK Tool:Github PK Tool

WikipediaSentences

Sentences scraped from wikipedia featured ariticles.

If you would like to pull data from different articles in Wikipedia then edit the validWiki.txt file. Delete all of the article titles, if you do not want to pull data from those articles, and write the titles of the articles you are interested in.

Run the code in gatherWikiData.js and you will get a textfile called allWikipediaSentences.txt with all of the content from the articles you were interested in.

About

Sentences scraped from wikipedia featured ariticle

License:GNU Affero General Public License v3.0


Languages

Language:JavaScript 100.0%