orthoxerox's starred repositories
seaweedfs
SeaweedFS is a fast distributed storage system for blobs, objects, files, and data lake, for billions of files! Blob store has O(1) disk seek, cloud tiering. Filer supports Cloud Drive, cross-DC active-active replication, Kubernetes, POSIX FUSE mount, S3 API, S3 Gateway, Hadoop, WebDAV, encryption, Erasure Coding.
docker-bench-security
The Docker Bench for Security is a script that checks for dozens of common best-practices around deploying Docker containers in production.
ConsoleAppFramework
Zero Dependency, Zero Overhead, Zero Reflection, Zero Allocation, AOT Safe CLI Framework powered by C# Source Generator.
NonBlocking
Implementation of a lock-free dictionary on .Net.
circus-train
Circus Train is a dataset replication tool that copies Hive tables between clusters and clouds.
NanoGptDotnet
A miniature large language model (LLM) that generates shakespeare like text written in C#. Project meant to help dotnet developers get introduced to torch and AI/LLM's. Code filled with comments to help you learn.
shunting-yard
Shunting Yard is a real-time data replication tool that copies data between Hive Metastores.
datacooker-etl
Data transformation framework for ETL processing with SQL-like syntax and GIS extensions, based on Apache Spark
hadoop-sandbox-images
Docker image builds for Hadoop sandbox.