NTBTI / elasticsearch-entity-resolution

Elasticsearch entity resolution plugin based on Duke

Home Page:http://yannbrrd.github.io

Geek Repo:Geek Repo

Github PK Tool:Github PK Tool

Build Status

elasticsearch-entity-resolution

This project is an interactive entity resolution plugin for Elasticsearch based on Duke. Basically, it uses [Bayesian probabilities] (http://en.wikipedia.org/wiki/Bayesian_probability) to compute probability. You can pretty much use it as an interactive deduplication engine.

To understand basics, go to Duke project documentation.

A list of [available comparators] (https://github.com/larsga/Duke/wiki/Comparator) is available here.

Documentation

FAQ

How to install

Configuring the plugin

Caveats

Going to indus mode

Run examples

Contact

Credits

What else ?

Licence

This project is licended under LGPLv3

Copyright (c) 2014 Yann Barraud

About

Elasticsearch entity resolution plugin based on Duke

http://yannbrrd.github.io

License:GNU Lesser General Public License v3.0


Languages

Language:Java 84.5%Language:Shell 15.5%