onuf / uni-nlp-project

tokenization tool

Geek Repo:Geek Repo

Github PK Tool:Github PK Tool

Tokenization

In this repository, you can find a set of tokenizers and tests for them. The tokenizers segment a given text into tokens of different types, e.g., alphabetic, numeric, alphanumeric sequences, punctuation.

About

tokenization tool


Languages

Language:Python 100.0%