jeancsil / cli-lucene-analyzers

CLI for seeing how text is analyzed by diverse lucene analyzers

Geek Repo:Geek Repo

Github PK Tool:Github PK Tool

CLI to analyze text with Lucene/Solr/Elasticsearch

Usage

make build
java -jar build/libs/lucene-analyzers-1.0-all.jar  "your text 日本"
Script    Bytes                                                       Text
Latin     [79 6f 75 72]                                               your
Latin     [74 65 78 74]                                               text
Jpan      [e6 97 a5 e6 9c ac]                                         日本

Ideas

This lib in alpha version, only one analyzer is supported but this will change soon. The current supported analyzer is ICUAnalyzer and it can be found in the package com.jeancsil.cli.analyzers

I want to add a new parameter where the user can select the analyzer it wants.

Contact me: me@jeancsil.com

About

CLI for seeing how text is analyzed by diverse lucene analyzers


Languages

Language:Java 98.7%Language:Makefile 1.3%