Maciej Szymczyk's repositories
wiadro-danych-kafka-to-es-ztm
Public transport API -> Python -> Kafka -> Kafka Streams -> Kafka -> Logstash -> Elasticsearch
wiadro-danych-kafka-streams
Materiały do wpisu https://wiadrodanych.pl/big-data/apache-kafka/kafka-streams/kafka-streams-101/
wiadro-danych-elk-map-ztm
Materiały do wpisu https://wiadrodanych.pl/bazy-danych/elasticsearch/wizualizacja-map-w-elasticsearch-i-kibana-gps-komunikacji-miejskiej/
cybersecurity-ksqldb
Results of playing with ksqlDB. The main context is cybersecurity, but there will also be general Big Data topics.
elastic-stack-docker-boilerplate
Created for the Elastic Stack Course of mine: https://wiadrodanych.pl/elastic
top-10-mitre-data-sources-with-pandas
Getting TOP 10 MITRE ATT&CK data sources with pandas
wiadro-danych-simple-spark-etl
Simply ETL written in PySpark. MongoDB + MySQL => Apache Cassandra
logstash-deduplication
Deduplicating events in Logstash using ruby filter and Redis
4developers-2020
Serilog + Elasticsearch + Kibana from 4developers 2020 conference
bigdata-scripts
A collection of big data scripts.
CDMCS
Cyber Defence Monitoring Course Suite :: Suricata, Moloch and others
cleaning-data-with-pandas
Cleaning "Vehicles registered in Poland broken down by voivodeships" dataset with Pandas.
detection-rules
Rules for Elastic Security's detection engine
ironman_pandas_jupyter
Fun with csv ironman results
jvm-bloggers
JVM Bloggers - website and newsletter with JVM blogs from Poland
kafka-connect-slack
Kafka Connect Sink for posting to Slack
wgnet-serilog-elk
Code used in lecture at WG.NET Meetup https://www.meetup.com/WG-NET/events/267718778/
wiadro-danych-koalas-pandas-fun
Simple operations using pandas/koalas/pyspark
wiadro-danych-spark-elasticsearch
Materiały do wpisu https://wiadrodanych.pl/big-data/spark/elasticsearch-spark/