fooger / elasticsearch-analysis-morphology

Morphological Analysis Plugin for ElasticSearch

Geek Repo:Geek Repo

Github PK Tool:Github PK Tool

Morphological Analysis Plugin for ElasticSearch

The Morphological Analysis plugin integrates Russian and English morphology for java and lucene framework into elasticsearch. This plugin adds two new analyzers: “russian_morphology” and “english_morphology” and two token filters with the same names.

The demo.sh file shows a few examples of the analyzers behavior.

Switching to Hunspell

For Elasticsearch version 6.0 and above there is an officially supported hunspell token filter with russian dictionaries. But in my opinion it behaves much poorer than this plugin because of limited dictionary and no predefined behavior on unknown words.

Building
For building use latest gradle 6.2+ (https://gradle.org/install/#manually) and run

gradle build
.
Java SDK 13+ required.

Compatibility

Plugin is avaliable only for Elasticsearch 7.6.0.

Morphological Analysis Plugin Elasticsearch URL
7.6.0 7.6.0 build yourself: gradlew build

Installation

In order to install the plugin, simply run the following command in the elasticsearch home directory:


bin/elasticsearch-plugin install file:///C:\PROJECTS\elasticsearch-analysis-morphology\analysis-morphology-7.6.0.zip

where zip_file_path is full path to zip archive with builded plugin.
Also, if you not know how build the plugin, the compiled version is added (analysis-morphology-7.6.0.zip)

P.S. I am not java developer, and I do not know about any errors. I am not tested yet the plugin, only install.

About

Morphological Analysis Plugin for ElasticSearch

License:Apache License 2.0


Languages

Language:Java 62.1%Language:Shell 37.9%