andreale28's repositories
Polars-Analysis
A pipeline to pull data from S3 and process using Polars, Delta-RS and DuckDB
awesome-courses
:books: List of awesome university courses for learning Computer Science!
awesome-hadoop
A curated list of amazingly awesome Hadoop and Hadoop ecosystem resources
awesome-python
A curated list of awesome Python frameworks, libraries, software and resources
Big-Data-Engineering-Coursera-Yandex
Big Data for Data Engineers Coursera Specialization from Yandex
brats17
Brain tumor segmentation for MICCAI 2017 BraTS challenge
Coursera-BigData-DistributedSystems-Specializations
A cache to store my Distributed System and Big Data related resources
data-science-ipython-notebooks
Data science Python notebooks: Deep learning (TensorFlow, Theano, Caffe, Keras), scikit-learn, Kaggle, big data (Spark, Hadoop MapReduce, HDFS), matplotlib, pandas, NumPy, SciPy, Python essentials, AWS, and various command lines.
ShopeeChallenge
My approach to Shopee Kaggle Challenge
spark-playground
Batch Pipeline to pull data from S3 to Clickhouse as OALP database and use Starrocks for ad-hoc interactive query engine
deepschool.io
Deep Learning tutorials in jupyter notebooks.
ebookML_src
Source code in ebook Machine Learning
Introduction_to_Julia_tutorials
These are the jupyter notebooks used for intro tutorials to teach Julia
ipython-notebooks
A collection of IPython notebooks covering various topics.
machine-learning-for-software-engineers
A complete daily plan for studying to become a machine learning engineer.
nsgaiii
An implementation of NSGA-III in Python.
OptML_course
EPFL Course - Optimization for Machine Learning - CS-439
paper-tips-and-tricks
Best practice and tips & tricks to write scientific papers in LaTeX, with figures generated in Python or Matlab.
PolarsPipeline
Log ETL Pipeline with Polars, Delta-RS and more
pysmt
pySMT: A library for SMT formulae manipulation and solving
python-machine-learning-book-2nd-edition
The "Python Machine Learning (2nd edition)" book code repository and info resource
rsome
Robust Stochastic Optimization Made Easy
StreamingAnalytics
A streaming ETL pipeline with Postgresql, Debezium, RisingWave, Clickhouse
SubsetSelection.jl
Fast Subset Selection algorithm for Statistics/Machine Learning