Swoop's repositories
spark-alchemy
Collection of open-source Spark tools & frameworks that have made the data engineering and data science teams at Swoop highly productive
spark-records
Bulletproof Apache Spark jobs with fast root cause analysis of failures.
fast_cache
Very fast in-process cache with least-recently used (LRU) and time-to-live (TTL) expiration semantics.
composable_state_machine
Tiny state machine implementation with clean separation between transitions, transition logic & state management.
datascience
Data and code for our data science writing
kafka-graphite
Kafka Graphite Metrics Reporter
spark-infotheoretic-feature-selection
This package contains a generic implementation of greedy Information Theoretic Feature Selection (FS) methods. The implementation is based on the common theoretic framework presented by Gavin Brown. Implementations of mRMR, InfoGain, JMI and other commonly used FS filters are provided.
activerecord-import
A library for bulk insertion of data into your database using ActiveRecord.
ActiveRecordExtended
Adds additional postgres functionality to an ActiveRecord / Rails application
ClickHouse
ClickHouse is a free analytic DBMS for big data.
databricks
Rubygem wrapping the Databricks REST API
docker-zk-exhibitor
Docker definition for an Exhibitor-managed ZooKeeper instance
langchain
⚡ Building applications with LLMs through composability ⚡
mapbox-gl-js
Interactive, thoroughly customizable maps in the browser, powered by vector tiles and WebGL
postgraphile-plugin-fulltext-filter
Full-text filtering in PostGraphile
statsd-client
StatsD client for java
TopicModeling
Topic Modeling on Apache Spark