earle's starred repositories

uv

An extremely fast Python package and project manager, written in Rust.

Language:RustLicense:Apache-2.0Stargazers:23465Issues:50Issues:3560

zellij

A terminal workspace with batteries included

pandas_exercises

Practice your pandas skills!

Language:Jupyter NotebookLicense:BSD-3-ClauseStargazers:10809Issues:312Issues:65

mage-ai

🧙 Build, run, and manage data pipelines for integrating and transforming data.

Language:PythonLicense:Apache-2.0Stargazers:7844Issues:62Issues:840

1brc

1️⃣🐝🏎️ The One Billion Row Challenge -- A fun exploration of how quickly 1B rows from a text file can be aggregated with Java

Language:JavaLicense:Apache-2.0Stargazers:6185Issues:61Issues:67

quarto-cli

Open-source scientific and technical publishing system built on Pandoc.

Language:JavaScriptLicense:NOASSERTIONStargazers:3879Issues:31Issues:5002

arroyo

Distributed stream processing engine in Rust

Language:RustLicense:Apache-2.0Stargazers:3714Issues:42Issues:148

DataFrame

C++ DataFrame for statistical, Financial, and ML analysis -- in modern C++ using native types and contiguous memory storage

Language:C++License:BSD-3-ClauseStargazers:2458Issues:70Issues:219

trustfall

A query engine for any combination of data sources. Query your files and APIs as if they were databases!

Language:RustLicense:Apache-2.0Stargazers:2397Issues:20Issues:75

quary

Open-source BI for engineers

Language:RustLicense:Apache-2.0Stargazers:2175Issues:12Issues:44

mono

Free and open-source monospaced font from Evil Martians

great-tables

Make awesome display tables using Python.

Language:PythonLicense:MITStargazers:1843Issues:13Issues:219

incubator-gluten

Gluten is a middle layer responsible for offloading JVM-based SQL engines' execution to native engines.

Language:ScalaLicense:Apache-2.0Stargazers:1177Issues:41Issues:2233

excalidraw-libraries

Collection of publicly available libraries

Language:JavaScriptLicense:MITStargazers:837Issues:16Issues:42

datafusion-comet

Apache DataFusion Comet Spark Accelerator

Language:RustLicense:Apache-2.0Stargazers:791Issues:58Issues:429

nimble

New file format for storage of large columnar datasets.

Language:C++License:Apache-2.0Stargazers:435Issues:21Issues:4

polars_ds_extension

Polars extension for general data science use cases

Language:RustLicense:MITStargazers:361Issues:8Issues:60

sqruff

Fast SQL formatter/linter

Language:RustLicense:Apache-2.0Stargazers:328Issues:4Issues:56

sqlframe

Turning PySpark Into a Universal DataFrame API

Language:PythonLicense:MITStargazers:302Issues:6Issues:44

h3-duckdb

Bindings for H3 to DuckDB

Language:C++License:Apache-2.0Stargazers:167Issues:7Issues:48

sqloxide

Python bindings for sqlparser-rs

Language:RustLicense:MITStargazers:159Issues:3Issues:15

Contoso-Data-Generator

Custom Contoso database generator and ready-to-use Contoso sample databases for SQL Server

Language:C#License:MITStargazers:144Issues:12Issues:12

fireducks

Create an issue on FireDucks

pipxu

Install and Run Python Applications in Isolated Environments using UV

Deneb-Vega-Templates

Data visualization templates for Deneb, a custom visual for Power BI. The templates are examples of Vega (not Vega-Lite) data visualizations that can be used in Deneb as is or as a starting point for developing more advanced custom data visualizations.

Deneb-Vega

This repository contains source code and data for data visualizations created by Andrzej Leszkiewicz using Vega visualization grammar. The visualizations presented here can be embedded into any Power BI report using the Deneb custom visual, as well as directly into any web page or app.

duckdb-power-query-connector

DuckDB Power Query Custom Connector by MotherDuck

deltaray

Delta reader for the Ray open-source toolkit for building ML applications

Language:PythonLicense:Apache-2.0Stargazers:42Issues:5Issues:8

spark-substrait-gateway

Implements a gateway that speaks the SparkConnect protocol and drives a backend using Substrait (over ADBC Flight SQL).

Language:PythonLicense:Apache-2.0Stargazers:17Issues:5Issues:10

chdb-cli

Simple CLI / REPL for chdb made in Python