Andrew Evans's repositories
pentaho-kettle
Pentaho Data Integration ( ETL ) a.k.a Kettle
threadpools
Non-big batch rust threadpools with std library, crossbeam, and tokio. Rayon can be used for big batch. Tokio is so I don't need to rewrite code.
apm-server
APM Server
bastion
Highly-available Distributed Fault-tolerant Runtime
cdrs-async
Async Cassandra driver made in Rust
CreoNLPNERPDIPlugin
A plugin for entity recogntiion in Pentaho using CoreNLP
datadruidanalytics
Analytics Repository for the Website backing Blog Articles on dadruid.com
datasketch-rs
Port of datasketch to hopefully pure rust
DeduplicationUtils
Python Utilities for creating deduplication tasks
JPostalPDIPLugin
A plugin for pentaho outputing the results of JPostal
LibphoneNumberServer
A server for running libphonenumber on Windows and Linux.
LibpostalAddressExpander
Lib postal address parser
LibPostalServer
A server with libpostal running in Docker for windows and Linux users.
machine_learning
Machine learning code
NLPServer
An NLP server for speeding up loading of Pentaho jobs in pan and spoon. Attempting to find a no/low-code solution to ETL.
node-wkhtmltopdf
A wrapper for the wkhtmltopdf HTML to PDF converter using WebKit
nucypher
A decentralized threshold cryptography network offering interfaces and runtimes for secrets management and dynamic access control.
PDINLPServerIntegration
Integrates the NLP Server in this repository with PDI.
PDINominatimGeocoder
A Pentaho 8.2+ plugin for geocoding data using nominatim
PDIPhotonGeocoder
Geocoder for Photon which works on Windows.
PDISentTokenizer
A sentence tokenizer for pentaho 8.2+
PDIStateEncoder
Encodes states to abbreviations and vice versa for kettle and spoon.
PDIStringDeduplicator
Simple string deduplication for those really nasty sources where info is repeated a bunch
PDITextTopicSplitter
Text topic splitter for Pentaho using C99 or another algorithm
pika
Pure Python RabbitMQ/AMQP 0-9-1 client library
redis-async-rs
A Rust client for Redis, using Tokio
scrapy-selenium
Scrapy middleware to handle javascript pages using selenium