chrisrzhou / react-wordcloud

☁️ Simple React + D3 wordcloud component with powerful features.

Home Page:https://react-wordcloud.netlify.app

Geek Repo:Geek Repo

Github PK Tool:Github PK Tool

Word cloud splits words on accent

AlexGibson12 opened this issue · comments

image

If I include for instance the word fettsäuren, the word cloud will generate the two words "fetts" and "uren". I saw someone elsewhere talking about a seperate word cloud library suggesting it could have something to do with the regex used to split words?

@AlexGibson12 this library doesn't handle tokenization, it's just rendering what you give it. To support Unicode characters in words, I use /[\p{L}']+/gu in my word splitting code.