aradzie / keybr.com

The smartest way to learn touch typing and improve your typing speed.

Home Page:https://www.keybr.com/

Geek Repo:Geek Repo

Github PK Tool:Github PK Tool

word "näo" doesn't exist in Portuguese

LucasRTonin opened this issue · comments

the correct spelling is "não" which is on the app, also in the app there's "nao" without accents which too doesn't exist in Portuguese language

I took the list of Portuguese words from here, and it includes the word "não". Unfortunately, almost every such a word list you can find in the Internet if of low quality. That is why I usually build word lists myself. However, I need help from native speakers to remove bad, profane, swearing words because I don't understand the language.

You can help me to improve the Portuguese language in keybr. I could send you a link to a spreadsheet to review and remove inappropriate words. Ping me if you are willing to do this.

Thanks!

Sure i can do that

So, as promised, here is a spreadsheet. It has about 10,000 words that I obtained by scanning the movie subtitles database.

I passed it through a list of bad words that I found on the Internet. However, I don't think that the list is complete so that manual work is still required.

Please review the first 3000 words, remove any bad words, or words like "rape", "murder", etc.

Then I'll update the web site with the final list.

Thanks!

@aradzie reviewed till line 3045, I removed basically words that are used in Portugal but not in Brazil, I also noticed that a lot of very common words are at the end, so the method you used is not accurate to gauge usage,
eg:
turista (turist) is a at line 9579
saliva (salive) ...9461

also it seems to be heavily sourced from PT-PT sources, but it shouldn't be a issue, no bad words, maybe a list of words taken straight from a actual dictionary could work better

@LucasRTonin Thanks! I have update the website with the updated word list.

You were right, the words came from the European Portuguese. Please excuse my ignorance, I thought the difference between the Brazilian and the European Portuguese is something like the difference between the American and the British English, not too strong to care about.

I hope that now we have a list of words that are common in two languages, so it satisfies everyone.