mihaibogdan10 / json-reuters-21578

Reuters 21578 dataset in json and sgm format, and the conversion script.

Geek Repo:Geek Repo

Github PK Tool:Github PK Tool

json-reuters-21578

Reuters 21578 dataset in json and sgm format, and the conversion script.

Uses BeautifulSoup for XML parsing:

pip install BeautifulSoup

The entire original data can be found in other-files and sgm-data. You can find the original archive on archive.ics.uci.edu

About

Reuters 21578 dataset in json and sgm format, and the conversion script.

License:MIT License


Languages

Language:Python 100.0%