Norconex

Norconex

Geek Repo

Location:Canada

Home Page:http://www.norconex.com

Github PK Tool:Github PK Tool

Norconex's repositories

crawlers

Norconex Crawlers (or spiders) are flexible web and filesystem crawlers for collecting, parsing, and manipulating data from the web or filesystem to various data repositories such as search engines.

Language:JavaLicense:Apache-2.0Stargazers:179Issues:33Issues:818

importer

Norconex Importer is a Java library and command-line application meant to "parse" and "extract" content out of a file as plain text, whatever its format (HTML, PDF, Word, etc). In addition, it allows you to perform any manipulation on the extracted text before using it in your own service or application.

Language:JavaLicense:Apache-2.0Stargazers:32Issues:17Issues:110

collector-filesystem

Norconex Filesystem Collector is a flexible crawler for collecting, parsing, and manipulating data ranging from local hard drives to network locations into various data repositories such as search engines.

commons-lang

Generic library shared between several projects.

Language:JavaLicense:Apache-2.0Stargazers:12Issues:10Issues:12

committer-elasticsearch

Implementation of Norconex Committer for Elasticsearch.

Language:JavaLicense:Apache-2.0Stargazers:10Issues:11Issues:44

collector-core

Collector-related code shared between different collector implementations

Language:JavaLicense:Apache-2.0Stargazers:7Issues:11Issues:24

commons-wicket

Generic Wicket components and utilities.

committer-core

Norconex Committer is a java library and command line application used to route content to local or remote target repositories, such as a search engine index.

Language:JavaLicense:Apache-2.0Stargazers:4Issues:10Issues:19

committer-idol

Autonomy IDOL implementation of Norconex Committer.

Language:JavaLicense:Apache-2.0Stargazers:4Issues:12Issues:2

jef

Job Execution Framework.

Language:JavaLicense:Apache-2.0Stargazers:4Issues:11Issues:14

jef-monitor

Web-based application for monitoring jobs progress (created with JEF).

committer-solr

Solr implementation of Norconex Committer. Should also work with any Solr-based products, such as LucidWorks.

Language:JavaLicense:Apache-2.0Stargazers:3Issues:10Issues:24

committer-neo4j

Implementation of Norconex Committer for Neo4j.

Language:JavaLicense:Apache-2.0Stargazers:2Issues:3Issues:6

committer-azuresearch

Implementation of Norconex Committer for Microsoft Azure Search.

Language:JavaLicense:Apache-2.0Stargazers:1Issues:2Issues:5

committer-sql

Implementation of Norconex Committer for SQL (JDBC) databases.

Language:JavaLicense:Apache-2.0Stargazers:1Issues:2Issues:12

language-detection

Fork of the Shuyo "language-detection" project hosted on Google Code. Original project web site: https://code.google.com/p/language-detection/

Language:JavaStargazers:1Issues:2Issues:0

committer-cloudsearch

Amazon CloudSearch implementation of Norconex Committer.

Language:JavaLicense:Apache-2.0Stargazers:0Issues:8Issues:2

committer-gsa

Google Search Appliance implementation of Norconex Committer.

Language:JavaStargazers:0Issues:11Issues:0

commons-maven-parent

Maven parent POM for many Norconex Maven projects.

Language:JavaScriptLicense:Apache-2.0Stargazers:0Issues:2Issues:1

language-detector

Detects languages out of any text (50+ languages supported).

Language:JavaStargazers:0Issues:2Issues:0

liresolr

Putting LIRE into Solr - an ongoing project

Language:JavaLicense:GPL-2.0Stargazers:0Issues:2Issues:0

tika

Mirror of Apache Tika

Language:JavaLicense:Apache-2.0Stargazers:0Issues:1Issues:0