Unicode Text Segmentation in V
This vlang package implements Unicode Text Segmentation according to Unicode Standard Annex #29.
Status
- Grapheme cluster boundaries: supported and passed official break tests.
- Word boundaries: supported and passed official break tests.
- Sentence boundaries: supported and passed official break tests.
Installation
v install magic003.uniseg
Examples
Check out the examples
folder.
Documentation
Refer to http://magic003.github.io/uniseg for the documentation.
References
- Unicode Standard Annex #29.
- The design and implementation of this library is heavily influenced by uniseg in Go and unicode-segmentation in Rust.