Tim Allison (tballison)

tballison

Geek Repo

Company:Rhapsode Consulting LLC

Home Page:https://mastodon.social/@tallison

Github PK Tool:Github PK Tool

Tim Allison's repositories

quaerite

Search relevance evaluation toolkit

Language:JavaLicense:NOASSERTIONStargazers:30Issues:4Issues:12

lucene-addons

Standalone versions of LUCENE_5205 and other patches: SpanQueryParser, Concordance and Co-occurrence stats

Language:JavaLicense:Apache-2.0Stargazers:18Issues:7Issues:41

file-observatory

Single server/laptop grade file-observatory

Language:JavaLicense:Apache-2.0Stargazers:9Issues:6Issues:8

rhapsode

Advanced desktop search/corpus exploration prototype

Language:JavaLicense:NOASSERTIONStargazers:7Issues:2Issues:0

tika-gui-v2

Unofficial user interface for Apache Tika

Language:HTMLLicense:Apache-2.0Stargazers:6Issues:3Issues:71

SimpleCommonCrawlExtractor

Simple wrapper around IIPC Web Commons to take a literal warc.gz and extract standalone binaries

Language:JavaLicense:Apache-2.0Stargazers:5Issues:6Issues:0

cord-19

Data munging for CORD-19

Language:JavaLicense:NOASSERTIONStargazers:3Issues:2Issues:0

mp4parser

A Java API to read, write and create MP4 files

Language:JavaLicense:Apache-2.0Stargazers:2Issues:2Issues:0

share

Public share

chorus

Towards an open source stack for e-commerce search

Language:RubyLicense:Apache-2.0Stargazers:1Issues:1Issues:0

hodgepodge

one off dev repo, very experimental

Language:HTMLStargazers:1Issues:3Issues:0
Language:JavaLicense:Apache-2.0Stargazers:1Issues:3Issues:0

tika-addons

Addons not part of the official Tika release

AGPL

Repo of AGPL licensed code -- nothing in here is connected/related to anything outside of this repo

Stargazers:0Issues:0Issues:0

droid

DROID (Digital Record and Object Identification)

Language:JavaLicense:BSD-3-ClauseStargazers:0Issues:1Issues:0

james-mime4j

Mirror of Apache James Mime4j

Language:JavaLicense:Apache-2.0Stargazers:0Issues:1Issues:0

java-bplist

A Java library for reading Apple bplists, based on the work of

Language:JavaLicense:Apache-2.0Stargazers:0Issues:1Issues:0

junrar

plain java unrar util (former sf project)

Language:JavaLicense:NOASSERTIONStargazers:0Issues:3Issues:0
Language:JavaLicense:Apache-2.0Stargazers:0Issues:3Issues:1

logging-log4j2

Apache Log4j 2 is an upgrade to Log4j that provides significant improvements over its predecessor, Log4j 1.x, and provides many of the improvements available in Logback while fixing some inherent problems in Logback's architecture.

Language:JavaLicense:Apache-2.0Stargazers:0Issues:2Issues:0

lucene-solr

Mirror of Apache Lucene + Solr

Language:JavaLicense:Apache-2.0Stargazers:0Issues:2Issues:0

metadata-extractor

Extracts Exif, IPTC, XMP, ICC and other metadata from image files

Language:JavaLicense:Apache-2.0Stargazers:0Issues:3Issues:0

nanite

Nanite - a friendly swarm of format-identifying robots.

Language:JavaStargazers:0Issues:2Issues:0

opennlp

Mirror of Apache OpenNLP

Language:JavaLicense:Apache-2.0Stargazers:0Issues:0Issues:0

parso

lightweight Java library designed to read SAS7BDAT datasets

License:Apache-2.0Stargazers:0Issues:0Issues:0

pdfbox

Mirror of Apache PDFBox

Language:JavaLicense:Apache-2.0Stargazers:0Issues:1Issues:0

tabula-java

Extract tables from PDF files

Language:JavaLicense:MITStargazers:0Issues:1Issues:0

tika-docker

Convenience Docker images for Apache Tika Server

Language:ShellLicense:Apache-2.0Stargazers:0Issues:2Issues:0

xmpcore-shaded

Shaded version of Adobe's xmpcore to remove *.internal.* part of namespace

Language:JavaStargazers:0Issues:2Issues:0

yalder

Yet another language detector

Language:JavaLicense:Apache-2.0Stargazers:0Issues:0Issues:0