This repository contains:
- Data mining scripts that are used to categorize documents based on words they contain
- Web scraping scripts that are used to connect to external API and scrape the content of some webpages
- Data filtering scripts that are used to filter out words that are not needed for the project