Jacek Laskowski (jaceklaskowski)

jaceklaskowski

Geek Repo

Company:Freelance Data Engineer

Location:Warsaw, Poland

Home Page:https://books.japila.pl

Twitter:@jaceklaskowski

Github PK Tool:Github PK Tool


Organizations
japila-books

Jacek Laskowski's repositories

spark-workshop

Apache Spark™ and Scala Workshops

Language:HTMLLicense:Apache-2.0Stargazers:253Issues:31Issues:0

kafka-notebook

The Internals of Apache Kafka

License:Apache-2.0Stargazers:131Issues:11Issues:0

spark-kubernetes-book

The Internals of Spark on Kubernetes

kafka-workshop

Materials (slides and code) for Kafka and Kafka Streams Workshops

Language:JavaScriptLicense:Apache-2.0Stargazers:59Issues:4Issues:0

spark-delta-lake-workshop

Spark and Delta Lake Workshop

Language:PythonLicense:Apache-2.0Stargazers:21Issues:4Issues:0

learn-databricks

Notebooks to learn Databricks Lakehouse Platform

Language:PythonLicense:Apache-2.0Stargazers:12Issues:3Issues:0

scala-academy

Scala Academy

Language:ScalaLicense:Apache-2.0Stargazers:11Issues:12Issues:0

spark-meetups

Learning Spark on Kubernetes in a series of Warsaw Data Engineering meetups online!

Language:ScalaLicense:Apache-2.0Stargazers:9Issues:4Issues:0

spark-examples

Apache Spark Examples

Language:ScalaStargazers:4Issues:4Issues:0

spark

Mirror of Apache Spark

Language:ScalaLicense:Apache-2.0Stargazers:2Issues:3Issues:0

trino-meetups

Learning Trino in a series of Warsaw Data Engineering meetups online!

Language:ScalaLicense:Apache-2.0Stargazers:1Issues:3Issues:0

ccloud-gitpod-demo

demo ccloud + gitpod

Language:JavaStargazers:0Issues:2Issues:0

cloud-bigtable-examples

Examples of how to use Cloud Bigtable both with GCE map/reduce as well as stand alone applications.

Language:JavaLicense:Apache-2.0Stargazers:0Issues:3Issues:0

couchbase-spark-connector

The Official Couchbase Spark Connector

Language:ScalaLicense:Apache-2.0Stargazers:0Issues:2Issues:0

dbt-spark

dbt-spark contains all of the code enabling dbt to work with Apache Spark and Databricks

Language:PythonLicense:Apache-2.0Stargazers:0Issues:2Issues:0

delta

An open-source storage layer that brings scalable, ACID transactions to Apache Spark™ and big data workloads.

Language:ScalaLicense:Apache-2.0Stargazers:0Issues:2Issues:0

docs

Linode guides and tutorials.

Language:HTMLStargazers:0Issues:2Issues:0

docusaurus-tutorial

https://docusaurus.io/docs/en/tutorial-setup

Stargazers:0Issues:4Issues:0

horovod

Distributed training framework for TensorFlow, Keras, PyTorch, and Apache MXNet.

Language:PythonLicense:NOASSERTIONStargazers:0Issues:2Issues:0

hudi

Upserts, Deletes And Incremental Processing on Big Data.

Language:JavaLicense:Apache-2.0Stargazers:0Issues:2Issues:0

iceberg

Apache Iceberg

Language:JavaLicense:Apache-2.0Stargazers:0Issues:2Issues:0

jaceklaskowski

My personal repository

Stargazers:0Issues:3Issues:0

java-docs-samples

Java and Kotlin Code samples used on cloud.google.com

Language:JavaLicense:Apache-2.0Stargazers:0Issues:2Issues:0

ksql

The database purpose-built for stream processing applications.

Language:JavaLicense:NOASSERTIONStargazers:0Issues:2Issues:0

LightGBM

A fast, distributed, high performance gradient boosting (GBT, GBDT, GBRT, GBM or MART) framework based on decision tree algorithms, used for ranking, classification and many other machine learning tasks.

Language:C++License:MITStargazers:0Issues:2Issues:0
Language:ScalaLicense:Apache-2.0Stargazers:0Issues:3Issues:1

spark-flowchart

Flowchart for debugging Spark aplications

Language:ShellStargazers:0Issues:2Issues:0

xgboost

Scalable, Portable and Distributed Gradient Boosting (GBDT, GBRT or GBM) Library, for Python, R, Java, Scala, C++ and more. Runs on single machine, Hadoop, Spark, Dask, Flink and DataFlow

Language:C++License:Apache-2.0Stargazers:0Issues:2Issues:0

yurii-double-metrics

Spark app to demo multiple executions of flatMapGroupsWithState's stateUpdateFunc when used with DeltaTable.merge

Language:ScalaStargazers:0Issues:3Issues:0