lazymike / BlackLab

A corpus retrieval engine based on Apache Lucene

Home Page:http://inl.github.io/BlackLab/

Geek Repo:Geek Repo

Github PK Tool:Github PK Tool

What is BlackLab?

BlackLab is a corpus retrieval engine built on top of Apache Lucene. It allows fast, complex searches with accurate hit highlighting on large, tagged and annotated, bodies of text. It was developed at the Institute of Dutch Lexicology (INL) to provide a fast and feature-rich search interface on our historical and contemporary text corpora.

We're also working on BlackLab Server, a web service interface to BlackLab, so you can access it from any programming language. See the BETA version here: https://github.com/INL/BlackLab-server

BlackLab is licensed under the Apache License 2.0.

More information:

About

A corpus retrieval engine based on Apache Lucene

http://inl.github.io/BlackLab/


Languages

Language:Java 98.6%Language:C 1.0%Language:Shell 0.2%Language:HTML 0.2%Language:CSS 0.0%Language:Makefile 0.0%