Juan C. Alvarez's starred repositories
Big-Data-Cluster
The goal of this project is to build a docker cluster that gives access to Hadoop, HDFS, Hive, PySpark, Sqoop, Airflow, Kafka, Flume, Postgres, Cassandra, Hue, Zeppelin, Kadmin, Kafka Control Center and pgAdmin. This cluster is solely intended for usage in a development environment. Do not use it to run any production workloads.
polars-streaming
Stream Processing using Polars
learning-notes
Notes on books I read, talks I watch, articles I study, and papers I love
docker-elk
The Elastic stack (ELK) powered by Docker and Compose.
amazon-kinesis-analytics-beam-taxi-consumer
Sample Apache Beam pipeline that can be deployed to Amazon Managed Service for Apache Flink. It reads taxi events from a Kinesis data stream, processes and aggregates them, and ingests the result to Amazon CloudWatch for visualization.
Python-MongoDB-Example
A Live working Example Application of Python, Qt, PySide2, MongoDB, PyMongo, QTreeView, QAbstractTableModel
redis-stream-with-python
It is a POC using Redis Stream + Python to write/read data
Mastering-Concurrency-in-Python
Mastering Concurrency in Python, published by Packt
cookiecutter-data-science
A logical, reasonably standardized, but flexible project structure for doing and sharing data science work.
beam_flex_demo
A Supporting Repo to my Flex Template Blog
training-data-analyst
Labs and demos for courses for GCP Training (http://cloud.google.com/training).
comet-busters
Comet Busters! 1994 remake using SDL
savagewheels
:checkered_flag: 2D car crashing game armageddon
Data-Engineering-with-AWS
Data Engineering with AWS, Published by Packt
curso-apache-spark-platzi
Repositorio utilizado para el Curso de Apache Spark en Platzi
Cuso_Introductorio_de_Spark
Curso Introductorio de Spark by Platzi 💚