USC Information Retrieval & Data Science's repositories
dl4j-kerasimport-examples
This repository contains deeplearning4j examples for importing and making use of models trained in keras
hadoop-pot
A scalable Apache Hadoop-based implementation of the Pooled Time Series video similarity algorithm based on M. Ryoo et al paper CVPR 2015.
TextREST.jl
Language Detection REST Server using MIT Lincoln Lab’s Text.jl library
counterfeit-electronics-tesseract
Training Tesseract to better extract serial numbers from images of electronic items
nlp-datasets
Alphabetical list of free/public domain datasets with text data for use in Natural Language Processing (NLP)
nutch-analytics
Nutch Crawl Analysis - Spark based project
filetypeDetection
File Byte Histogram Machine learnig Classification
counterfeit-crawling
Focused Crawling and Evaluation of Counterfeit Electronics Sites
NN-fileTypeDetection
This repository contains files of generating tika neural network model using Theano. Ir provides a way for you to build Deep Neural Network and increase Tika's detection capability.
tika-dl4j-spark-imgrec
Image recognition on Spark cluster powered by Deeplearning4j and Apache Tika
Tika-NER-Libraries
This is a d3 visualization benchmarking named entities extracted by NLTK and Standford Core NLP
TrojanFootball
Analyses athletes past performance and workload for a better training
Annotated-Semantic-Relationships-Datasets
Public and free annotated datasets of relationships between entities/nominals
loaded-language-linter
A small Node.JS library to detect loaded language.
parser-indexer
Metadata Parser and Solr Indexer. For Python equivalent, checkout https://github.com/USCDataScience/parser-indexer-py
PlanetaryIR
Information Retrieval for Planetary Science using DeepDive
ColumbiaImageSearch
Columbia Image Search tool for MEMEX
d3kit-timeline
A simple timeline component that labels do not overlap.
polar-domain-discovery
Domain Discovery on Polar Domain
scala-json-doclet
Scala Doclet that produces JSON output
tika-python
Tika-Python is a Python binding to the Apache Tika™ REST services allowing Tika to be called natively in the Python community.