There are 6 repositories under data-infrastructure topic.
Postgres operator creates and manages PostgreSQL clusters running in Kubernetes
Preswald is a WASM packager for Python-based interactive data apps: bundle full complex data workflows, particularly visualizations, into single files, runnable completely in-browser, using Pyodide, DuckDB, Pandas, and Plotly, Matplotlib, etc. Build dashboards, reports, and notebooks that run offline, load fast, and share like a document.
Production PostgreSQL for Kubernetes, from high availability Postgres clusters to full-scale database-as-a-service.
Data transformation framework for AI. Ultra performant, with incremental processing.
TensorBase is a new big data warehousing with modern efforts.
A distributed event bus that implements a RESTful API abstraction on top of Kafka-like queues
A battle-tested, flexible & comprehensive monitoring solution for your PostgreSQL databases
Python function to stream unzip all the files in a ZIP archive on the fly
Python server to on-the-fly extract and serve vector tiles from an mbtiles file on S3
Python function to construct a ZIP archive on the fly
The Data Engineering Book - หนังสือวิศวกรรมข้อมูล ของคนไทย เพื่อคนไทย
Build & Learn Data Engineering,Machine Learning over Kubernetes. No Shortcut approach.
Spawns JupyterHub single user servers in Docker containers running in AWS Fargate
An open source data analysis platform with features for users with a range of technical skills
Python utility function to ingest data into a SQLAlchemy-defined PostgreSQL table
A DNS proxy server that conditionally rewrites and filters A record requests
Kanadi is a Nakadi client for Scala
Python package to parse Companies House accounts data in a streaming way
Python function to extract rows from a SQLite file while iterating over its bytes
A generic data pipeline which will map Elasticsearch documents to Bigquery table rows
Python utility function to convert an iterable of bytes or str to a readable file-like object
Information relating to topics on Data Engineering, Data Infrastructure, Data Storing, Data Warehouses and Business Analysis. For those interested in both conceptual theory and use case examples for database design and development.
Service for sharing user consent to cookies across multiple domains
Stateless JWT authentication in front of PostgreSQL
Python PostgreSQL adapter to stream results of multi-statement queries without a server-side cursor
Python context manager to communicate with a subprocess using iterables: for when data is too big to fit in memory and has to be streamed
An extension for Google Chrome that crawls a website for cookies and fingerprinting behaviour
Collections of POC/dev data infrastructure. | #SE