Olivier Brunet's repositories
Hadoop_applications_smoke_tests
Basic apps / scripts that cover the most important functionality of each component of the Hadoop framework. Tested on serveral distributions such as MapR, Hortonworks but mainly on Cloudera CDP
Looker_Data_Analysis
Get prepared for the Looker certifications ! Theory & Practice : developement with LookML, Buisness / Data Analysis
Memory_systems_-_Anki_decks
All my memory techniques & training stuff with Anki decks (a program which makes remembering things easier with spaced repetition)
Awesome_Data_Science_Cheat_Sheets
Some of the most interesting Cheat Sheets i've found so far with my own ones // work always in progress :)
GCP_Data_Engineering
Notes aggregated from various sources in order to prepare the Google Cloud Platform Profesionnal Data Engineer Certification + Labs & Practice, tips, references...
obrunet.github.io
My Personnal Static Web Site https://obrunet.github.io/ with a new design
Data_Analytics_with_SPARK
Various projects with PySpark
My_Data_Science_Portfolio
An aggregation of all my Data Science challenges, studies, projects... from Kaggle / ENS Data Challenges / Zindi - various ML tasks : regression / classification / computer vision / N.L.P / recommendation engine
Spark_Computation_of_Connected_Component_in_Graphs
Implementation of the "CCF: Fast and Scalable Connected Component Computation in MapReduce" paper with Spark. Study of its scalability on several datasets using various clusters' sizes on Databricks and Google Cloud Platform (GCP)
Andrew-NG-Notes
This is Andrew NG Coursera Handwritten Notes.
Building_GHG_emissions
A Data Science project aimed at predicting the total greenhouse gas emissions of buildings in Seattle city.
current_private_ds_challs
all my current chall private repo WiP
Daily_coding_challenges
Exercices in various languages (Python, Scala, Shell, SQL, Go...) to improve your coding skills
Data_Science_Challenges_2020
An other year, an other Kaggle public kernels on all kind of topics, but also with Zindi & E.N.S challenges :)
data_to_viz
Leading to the dataviz you need
deepmath-exo7
Deepmath : Mathématiques des réseaux de neurones
Few_Data_Analysis
Analysis of various datasets with Python, statistics, data viz with Matplotlib & Seaborn
introduction_to_ml_with_python
Notebooks and code for the book "Introduction to Machine Learning with Python"
LeetCode_Solution
LeetCode Solution
ML-foundations
Machine Learning Foundations: Linear Algebra, Calculus, Statistics & Computer Science
ML-Notebooks
:fire: A series of code examples for all sorts of machine learning tasks and applications.
MLE-Flashcards
200+ detailed flashcards useful for reviewing topics in machine learning, computer vision, and computer science.
MLflow_End_to_End_Example
MLflow is Open source platform for the machine learning lifecycle so here you can learn MLflow End to End Example with Prediction.
notebooks
Jupyter notebooks for the Natural Language Processing with Transformers book
pandas_exercises
Practice your pandas skills!
Scala_exercices_beginner_to_advanced
Some simple exercices to get familiar with Scala
Spark_lessons_in_Scala
How to manipulate data with Spark in Scala
The-Python-Graph-Gallery
A website displaying hundreds of charts made with Python
Web_Scraping_Projects
Various projects in order to improve my scraping skills :)