Todo:
- Exclude punctuation from highlighting
- Include greek/latin prefix/root/suffix detection
- Handle whitespace and newlines correctly
- Improve text matching
- detect two-word pairs separated by space
- allow hyphenated terms
- detect word variants based on similar root