David Roher's repositories

boxball

Prebuilt Docker images with Retrosheet's complete baseball history data for many analytical frameworks. Includes Postgres, cstore_fdw, MySQL, SQLite, Clickhouse, Drill, Parquet, and CSV.

Language:PythonLicense:Apache-2.0Stargazers:110Issues:14Issues:32

etymology-db

An open etymology dataset created using Wiktionary data. Contains 3.8M entries, 1.8M terms, 2900 languages, and 31 unique relationship types.

Language:PythonLicense:Apache-2.0Stargazers:66Issues:3Issues:2
Language:Jupyter NotebookLicense:NOASSERTIONStargazers:9Issues:0Issues:0

diachronic

Get daily historical snapshots of every article on any Wiki, formatted as Parquet files

Language:PythonLicense:Apache-2.0Stargazers:2Issues:1Issues:0

aoc-2018-sql

Advent of Code 2018 in SQL

Stargazers:0Issues:2Issues:0

arrow

Apache Arrow is a cross-language development platform for in-memory data. It specifies a standardized language-independent columnar memory format for flat and hierarchical data, organized for efficient analytic operations on modern hardware. It also provides computational libraries and zero-copy streaming messaging and interprocess communication. Languages currently supported include C, C++, Java, JavaScript, Python, and Ruby.

Language:C++License:Apache-2.0Stargazers:0Issues:0Issues:0

arrow-datafusion

Apache Arrow DataFusion and Ballista query engines

Language:RustLicense:Apache-2.0Stargazers:0Issues:1Issues:0

arrow-rs

Official Rust implementation of Apache Arrow

Language:RustLicense:Apache-2.0Stargazers:0Issues:0Issues:0

baseball.computer.rs

Rust parser for the baseball.computer database.

Language:RustStargazers:0Issues:0Issues:0
Language:SvelteStargazers:0Issues:0Issues:1

boxball-snippets

Queries run on the Boxball DB.

License:Apache-2.0Stargazers:0Issues:2Issues:0

dbt-duckdb

dbt (http://getdbt.com) adapter for DuckDB (http://duckdb.org)

Language:PythonLicense:Apache-2.0Stargazers:0Issues:0Issues:0

droher

Config files for my GitHub profile.

Stargazers:0Issues:2Issues:0

duckdb-web

DuckDB-Web - Source code of duckdb.org

Language:JavaScriptStargazers:0Issues:1Issues:0
Language:HTMLStargazers:0Issues:1Issues:0

retrosheet

Enhanced version of Retrosheet (http://www.retrosheet.org) data.

Language:Grammatical FrameworkStargazers:0Issues:1Issues:0

metricflow

MetricFlow allows you to define, build, and maintain metrics in code.

Language:PythonLicense:NOASSERTIONStargazers:0Issues:0Issues:0
Language:ProcessingStargazers:0Issues:0Issues:0
Language:HCLLicense:Apache-2.0Stargazers:0Issues:0Issues:0