Count IDs for each language
eklem opened this issue · comments
When crawling IDs, check the length of the stored array and console.log it. Then you'll somewhat know the quality of the stopword list if generating it on the basis of those IDs.
Crawler for NRK Sapmi news bulletins that will be the basis for Sami stopword lists and an example search engine for content in Sami.
eklem opened this issue · comments
When crawling IDs, check the length of the stored array and console.log it. Then you'll somewhat know the quality of the stopword list if generating it on the basis of those IDs.