Sujen Shah's repositories
nutch-rest-api-docs
Document of Apache Nutch Rest API
algorithms
Practice problems
document-relevancy
To retrieve a set of relevant documents from a corpus given a 'gold-standard' document.
goes-notify
A script that checks for a better Global Entry enrollment appointment time
grobid
A machine learning software for extracting information from scholarly documents
hysds-packer-templates
HySDS packer templates
icesat2_boreal
Biomass modeling and mapping of forest biomass in the boreal using NASA's ICESat-2
incubator-zeppelin
Mirror of Apache Zeppelin (Incubating)
maap-py
Python library for working with MAAP
memex-scripts
Scripts created to achieve tasks for the MEMEX program
nutchpy
For interacting with nutch via Python
oodt
Mirror of Apache OODT
reactive-facetsearch
A FacetView like application built using reactivesearch
ScoringModelGenerator
Contains the code to generate the model file used in the Nutch Similarity based scoring plugin
SIGIR-2016
Work for a paper submission at SIGIR - 2016
sparkler
Spark-Crawler : Evolving Apache Nutch to run on Spark.
tika
Mirror of Apache Tika