Jeroen Steggink's repositories
aop
AMQP on Pulsar protocol handler
collector-core
Collector-related code shared between different collector implementations
committer-core
Norconex Committer is a java library and command line application used to route content to local or remote target repositories, such as a search engine index.
committer-solr
Solr implementation of Norconex Committer. Should also work with any Solr-based products, such as LucidWorks.
CoreNLP
Stanford CoreNLP: A Java suite of core NLP tools.
ddth-queue
Library to interact with various queue implementations
elasticsearch-hadoop
:elephant: Elasticsearch real-time search and analytics natively integrated with Hadoop
hadoop-ceph
Implementation of Hadoop file system
lucene-solr
Mirror of Apache Lucene + Solr
pulsar-io-amqp-1-0
support sink/source for AMQP version 1.0.0
pulsar-spark
When Apache Pulsar meets Apache Spark
spark-corenlp
Stanford CoreNLP wrapper for Apache Spark
spark-on-k8s-operator
Kubernetes operator for managing the lifecycle of Apache Spark applications on Kubernetes.
spark-on-openshift
Spark operator deployment and usage on OpenShift
spark-solr
Tools for reading data from Solr as a Spark RDD and indexing objects from Spark into Solr using SolrJ.
streaming-amqp
AMQP data source for dstream (Spark Streaming)
transformers
🤗 Transformers: State-of-the-art Natural Language Processing for Pytorch, TensorFlow, and JAX.