Swoop's repositories
spark-alchemy
Collection of open-source Spark tools & frameworks that have made the data engineering and data science teams at Swoop highly productive
spark-records
Bulletproof Apache Spark jobs with fast root cause analysis of failures.
fast_cache
Very fast in-process cache with least-recently used (LRU) and time-to-live (TTL) expiration semantics.
composable_state_machine
Tiny state machine implementation with clean separation between transitions, transition logic & state management.
datascience
Data and code for our data science writing
kafka-graphite
Kafka Graphite Metrics Reporter
spark-infotheoretic-feature-selection
This package contains a generic implementation of greedy Information Theoretic Feature Selection (FS) methods. The implementation is based on the common theoretic framework presented by Gavin Brown. Implementations of mRMR, InfoGain, JMI and other commonly used FS filters are provided.
activerecord-import
A library for bulk insertion of data into your database using ActiveRecord.
ActiveRecordExtended
Adds additional postgres functionality to an ActiveRecord / Rails application
ClickHouse
ClickHouse is a free analytic DBMS for big data.
databricks
Rubygem wrapping the Databricks REST API
delta
An open-source storage layer that brings scalable, ACID transactions to Apache Spark™ and big data workloads.
docker-zk-exhibitor
Docker definition for an Exhibitor-managed ZooKeeper instance
hoxy
Web-hacking proxy API for node
langchain
⚡ Building applications with LLMs through composability ⚡
mapbox-gl-js
Interactive, thoroughly customizable maps in the browser, powered by vector tiles and WebGL
nodes
A library to implement asynchronous dependency graphs for services in Java
postgraphile-plugin-fulltext-filter
Full-text filtering in PostGraphile
statsd-client
StatsD client for java
TopicModeling
Topic Modeling on Apache Spark