isvilen / unicodedata

Access to the Unicode Character Database from Erlang

Geek Repo:Geek Repo

Github PK Tool:Github PK Tool

Unicode support library

This Erlang library provides access to character properties defined in the Unicode Character Database and implements following algorithms specified in the Unicode 9.0.0 standard:

  • Default Case Algorithms specified in The Unicode Standard Section 3.13

    • Default Case Conversion
    • Default Case Folding
    • Default Caseless Matching
  • Unicode normalization specified in The Unicode Standard Annex #15

    • Normalization Form D
    • Normalization Form KD
    • Normalization Form C
    • Normalization Form KC
  • Unicode Text Segmentation specified in The Unicode Standard Annex #29

    • Grapheme Cluster Boundaries
    • Word Boundaries
    • Sentence Boundaries
  • Unicode Line Breaking Algorithm specified in The Unicode Standard Annex #14

  • Unicode Bidirectional Algorithm specified in The Unicode Standard Annex #9

Build

$ rebar3 compile

Documentation

Run rebar3 edoc and open generated doc/index.html file.

About

Access to the Unicode Character Database from Erlang

License:Other


Languages

Language:Erlang 100.0%