robbi5 / nomenklatura

Create lists of reference data interactively.

Home Page:http://opennames.org

Geek Repo:Geek Repo

Github PK Tool:Github PK Tool

nomenklatura

Nomenklatura de-duplicates and integrates different names for entities - people, organisations or public bodies - to help you clean up messy data and to find links between different datasets.

The service will create references for all entities mentioned in a source dataset. It then helps you to define which of these entities are duplicates and what the canonical name for a given entity should be. This information is available in data cleaning tools like OpenRefine or in custom data processing scripts, so that you can automatically apply existing mappings in the future.

The focus of nomenklatura is on data integration, it does not provide further functionality with regards to the people and organisations that it helps to keep track of.

About this fork

This is a fork of pudo/nomenklatura. OpenNames.org, a public hosted instance of nomenklatura got recently shut down because the project has taken a different direction. This fork tries to maintain a compatible version of nomenklatura thats usable as a plug-in replacement.

A docker image is available as robbi5/nomenklatura in the docker index.

Usage

If you want to deploy your own Nomenklatura instance, use the robbi5/nomenklatura docker container:

docker pull robbi5/nomenklatura
docker run -e DATABASE_URL=postgres://username:password@databasehost/nomenklatura -e SECRET_KEY=something_unique_and_secret_here -p 8080:8080 robbi5/nomenklatura

Required environment variables:

DATABASE_URL: postgres://username:password@databasehost/nomenklatura
SECRET_KEY: something_unique_and_secret_here # used for the session cookie

Authentication

Register a new OAuth application on github with the following parameters:

Then add these environment variables:

GITHUB_CLIENT_ID: your_client_id_from_github
GITHUB_CLIENT_SECRET: your_client_secret_from_github

Reverse Proxy

It is recommended that you deploy an reverse proxy like nginx in front of your Nomenklatura instance - especially in production.

You should add the FORWARDED_ALLOW_IPS environment variable, so Nomenklatura recognizes your reverse proxy and uses the X-Forwarded-For and X-Forwarded-Scheme headers. This enables usage of Nomenklatura if your reverse proxy accepts HTTPS connections.

Contact, contributions etc.

nomenklatura is developed with generous support by Knight-Mozilla OpenNews and the Open Knowledge Foundation Labs. The codebase is licensed under the terms of an MIT license (see LICENSE.md).

We're keen for any contributions, bug fixes and feature suggestions, please use the GitHub issue tracker for this repository.

About

Create lists of reference data interactively.

http://opennames.org

License:MIT License


Languages

Language:HTML 40.1%Language:Python 39.4%Language:JavaScript 15.9%Language:CSS 2.5%Language:Shell 1.7%Language:Ruby 0.4%