Hedi Bejaoui's repositories
delta
An open-source storage framework that enables building a Lakehouse architecture with compute engines including Spark, PrestoDB, Flink, Trino, and Hive and APIs for Scala, Java, Rust, Ruby, and Python.
deequ
Deequ is a library built on top of Apache Spark for defining "unit tests for data", which measure data quality in large datasets.
spark
Apache Spark
spark-timeseries
A library for time series analysis on Apache Spark
livy-docker
Livy Docker image built on top of Hadoop+Hive+Spark
hadoop-hive-spark-docker
Base Docker image with just essentials: Hadoop, Hive and Spark.
mlflow-docker-compose
Deploy mlflow with docker-compose
docker-spark-livy
Spark Standalone & Livy
spark-nlp
State of the Art Natural Language Processing
DeepFaceLab
DeepFaceLab is the leading software for creating deepfakes.
covid19-simulation
Toolkit for COVID-19 simulation.
coding-interview-university
A complete computer science study plan to become a software engineer.
HealthcareManagementSystem
Medical office management application