Paweł Tokaj's repositories
incubator-sedona
A cluster computing framework for processing large-scale geospatial data
airbyte
Airbyte is an open-source EL(T) platform that helps you replicate your data in your warehouses, lakes and databases.
Language:JavaNOASSERTION000
airflow
Apache Airflow - A platform to programmatically author, schedule, and monitor workflows
Language:PythonApache-2.0000
awesome-getindata-recommended-sources
A curated list of links to sources of latest updates in data/ml/ai
MIT000
Language:Kotlin000
Language:XSLT000
Apache-2.0000
dbt-airflow-factory
Library to convert DBT manifest metadata to Airflow tasks
Apache-2.0000
Language:ScalaMIT000
GeoSparkAndLivy
This repo is showing combination of GeoSpark and Livy
000
iceberg
Apache Iceberg
Apache-2.0000
Language:ScalaApache-2.0000
Language:Scala000
OpenLineage
An Open Standard for lineage metadata collection
Language:JavaApache-2.0000
Language:Jupyter Notebook000
Language:Python000
Apache-2.0000
trino
Official repository of Trino, the distributed SQL query engine for big data, formerly known as PrestoSQL (https://trino.io)
Language:JavaApache-2.0000