eklem / nrk-sapmi-crawler

Crawler for NRK Sapmi news bulletins that will be the basis for Sami stopword lists and an example search engine for content in Sami.

Geek Repo:Geek Repo

Github PK Tool:Github PK Tool

Count IDs for each language

eklem opened this issue · comments

When crawling IDs, check the length of the stored array and console.log it. Then you'll somewhat know the quality of the stopword list if generating it on the basis of those IDs.