Open Semantic Search's repositories
open-semantic-search
Open Source research tool to search, browse, analyze and explore large document collections by Semantic Search Engine and Open Source Text Mining & Text Analytics platform (Integrates ETL for document processing, OCR for images & PDF, named entity recognition for persons, organizations & locations, metadata management by thesaurus & ontologies, search user interface & search apps for fulltext search, faceted search & knowledge graph)
open-semantic-etl
Python based Open Source ETL tools for file crawling, document processing (text extraction, OCR), content analysis (Entity Extraction & Named Entity Recognition) & data enrichment (annotation) pipelines & ingestor to Solr or Elastic search index & linked data graph database
open-semantic-entity-search-api
Open Source REST API for named entity extraction, named entity linking, named entity disambiguation, recommendation & reconciliation of entities like persons, organizations and places for (semi)automatic semantic tagging & analysis of documents by linked data knowledge graph like SKOS thesaurus, RDF ontology, database(s) or list(s) of names
open-semantic-search-apps
Python/Django based webapps and web user interfaces for search, structure (meta data management like thesaurus, ontologies, annotations and named entities) and data import (ETL like text extraction, OCR and crawling filesystems or websites)
open-semantic-visual-graph-explorer
Open Semantic Visual Linked Data Graph Explorer: Open Source tool (web app) and user interace (UI) for discovery, exploration and visualization of direct and indirect connections between named entities like persons, organizations, locations & concepts from thesarus or ontologies within your documents and knowledgegraph
solr-ontology-tagger
Automatic tagging and analysis of documents in an Apache Solr index for faceted search by RDF(S) Ontologies & SKOS thesauri
solr-php-ui
Solr client and user interface for search
solr-relevance-ranking-analysis
Solr Relevance Ranking Analysis and Visualization Tool
open-semantic-search-appliance
Open Semantic Search Appliance (VM)
solr-synonames
Import synonames (multilingual variants of first names from Wikidata) to Solr managed synonyms graph
spacy-services.deb
Debian & Ubuntu package for REST microservices for spaCy natural language processing and machine learning framework for named entity recognition
tesseract-ocr-cache
Tesseract OCR wrapper for Apache Tika and/or Open Semantic ETL caching the OCR results, so Tika-Server or Open Semantic ETL has not to reprocess slow and expensive OCR on same images again
tika-server.deb
Apache Tika Server as Debian GNU/Linux and Ubuntu Linux package
open-semantic-etl-filemonitoring-remote
File monitoring of filesystem by inotify for indexing new/changed files immediately by a remote API on remote search server
tika-python.deb
tika-python as Debian GNU/Linux and Ubuntu Linux package
spacy-services
đź’« REST microservices for various spaCy-related tasks
aleph-elasticsearch
Custom ElasticSearch configuration for Aleph from which we use the synonames based synonym config (name part aliases extracted from Wikidata)