AnnthomyGILLES / AnnthomyGILLES

Geek Repo:Geek Repo

Github PK Tool:Github PK Tool

πŸ‘‹ Hi, I'm Annthomy GILLES

🌎 Location: Montréal, Québec, Canada
πŸ”— LinkedIn: https://www.linkedin.com/in/annthomygilles
πŸ“– Fundamentals of Data Engineering (∼50%)

πŸ“š About Me

Senior Data Consultant and Data Scientist with extensive experience in Information Management and Data Analytics. Adept at designing and implementing innovative solutions across various industries, including government, automotive, and IT consulting. Proficient in Python, R, Big Data technologies, and Machine Learning. Passionate about leveraging data to drive actionable insights, improve processes, and support decision-making.

My Articles

ChatGPT: Reflet du bullshit en entreprise
Comprendre les mΓ©tiers de la Data le temps d’une pause cafΓ©.
Is your organization TRULY data-driven? 12 questions to find out!
Le Temps GuΓ©rit Tout. ExceptΓ© Le Mauvais Code.

Current side projects

Dashboard Project - Private πŸ“ŠπŸ’»

  • Building a dashboard connected to a database using Flask, mySQL, and web scraping.
  • Implemented automatic notifications sent to Discord and Telegram.

WhatsApp Integration - PrivateπŸ“±πŸ’¬

  • Building a Python pipeline connected to a database using Flask,
  • MongoDB, and Docker. Implemented API integration with WhatsApp for automated messaging.

Weather Data Aggregation with Kafka - Public ☁️🌑️

  • Building a project to scrape weather data from different APIs.
  • Experimenting with Kafka to aggregate the data.
  • Integrating Spark for data analysis and processing.
  • Project is focused on learning Kafka and expanding knowledge of big data technologies.

πŸ’Ό Experience

πŸ‡¨πŸ‡¦ KPMG Canada (Oct 2022 - Present): Senior Consultant, Information Management & Data Analytics

πŸ‡§πŸ‡ͺ Belgian Government (Mar 2021 - Oct 2022): Data Scientist

  • Worked on a graph-based modelling project for COVID-19 infection spread and management.
  • Gained experience with Neo4j, ElasticSearch, PostgreSQL, MongoDB, Prefect, Dask, Python, Apache Airflow, Unit testing, CI/CD, JIRA, Agile, API, Pandas, and scikit-learn.

πŸ‡§πŸ‡ͺ Toyota Motor Corporation (Feb 2020 - Nov 2020): Data Scientist Consultant

  • Worked on a DataOps project to clean and prepare data from car sensors for R&D use cases.
  • Gained experience with AWS services, Dask, Python, multi Unit testing, CI/CD, JIRA, Agile, API, Pandas, and scikit-learn.

πŸ‡§πŸ‡ͺ Positive Thinking Company (Oct 2019 - Oct 2022): Data Scientist

  • Developed an automated tool for resume classification and summarization using NLP techniques.
  • Gained experience with Python, R, Shiny, MongoDB, TFIDF, word2vec, doc2vec, Random Forest, XGboost, and Docker.

πŸ‡«πŸ‡· Devoteam (Feb 2019 - May 2019): Data Consultant

  • Built a comprehensive web app dashboard for employee management and tracking.
  • Gained experience with Google Cloud Platform (GCP), Docker, web development, Firebase, R, JavaScript, MongoDB, Git, HTML, and CSS.

πŸ‡«πŸ‡· bioMerieux ( Sept. 2016 - Sept. 2018): Data Scientist

  • Worked on a decision support system for improving doctors prescribing behavior during infectious disease.
  • Gained experience with Python, R, inferential statistics, machine learning, dimensionality reduction, business intelligence, metagenomics, differential abundance analysis, nanopore technology, and SQL.

πŸ‡©πŸ‡ͺ Max Planck Institute (Mar. 2016 - Aug. 2016): Computational Biologist

  • Developed a differential gene expression analysis workflow using Python, shell, and R languages.
  • Gained experience with Tuxedo suite, DeSEQ2, MEME suite, GATK, Picards-tools, Stringtie, Go enrichment, variant calling, and differential expression.

πŸ‡«πŸ‡· Merial, a Sanofi Company (Mar. 2015 - Sept. 2015): Biological Engineer

  • Characterized virulence factors and vaccine targets of a bacterial canine pathogen.
  • Gained experience with cell culture techniques, flow cytometry, genetic engineering, northern and western blotting, fluorescent and confocal microscopy, and PCR.

Skills

Category Skills
Programming 🐍 Python, R, πŸ’» Shell/Bash/Command line
Databases πŸƒ MongoDB, πŸ—ƒοΈ SQL, πŸ”— Neo4J
Statistics & Machine Learning πŸ”¬ Inferential Statistics, πŸ“ˆ Hypothesis testing, πŸ“Š Regression methods, πŸ”„ Correlation, πŸ“‰ Descriptive Statistics, 🚦 Markov model, 🌐 Dimensionality reduction, 🧩 Clustering, 🌳 Decision tree, 🧠 KNN, πŸŽ„ SVM, 🌱 Random forest
Tools 🧰 Git, πŸ“Š Matplotlib, πŸ”’ Numpy, 🐼 Pandas, πŸƒ Pymongo, πŸ”¬ Scipy, πŸ€– Scikit-learn, 🌊 Seaborn, πŸ”— SQLalchemy
Web Development 🌐 HTML5/CSS3, πŸ’» Javascript, Typescript, NestJs, Prisma, 🌢️ Flask
Environment πŸ’» High Performance Computing, 🐧 Linux
Data Science πŸ› οΈ Data Engineering, πŸ§‘β€πŸ’Ό Data Governance, πŸ“ˆπŸ“‰πŸ“Š Big Data, πŸ€– Machine Learning, πŸ“Š Data Analytics, πŸƒMongoDB, 🐳 Docker, πŸ—ƒοΈ PostgreSQL, ☁️ Amazon Web Services (AWS), πŸ“ˆ JIRA, 🌐 Web Development, πŸ§‘β€πŸ”¬ NLP

🏫 Education

University of Rouen Normandie

Master in Bioinformatics and Statistics (2015 - 2018)

  • Three-year Research & Professional Master's Degree in Bioinformatics, Statistics and Mathematics.
  • Curriculum covers management, processing, and analysis of sequences and massive data.
  • Data science: supervised learning (Regression, Decision Tree, Random Forests, Markov Chains, SVM, KNN, Neural Network) and unsupervised learning (KNN, K-means, CAH)

University of Poitiers

Master's Degree in Bioengineering and Biomedical Engineering (2013 - 2015)

  • Interdisciplinary education in biomedical research and engineering program from various backgrounds including bioengineering, cell and molecular biology, oncology, pharmacology, genetics, and microbiology.

University of the French West Indies and Guiana

Bachelor's Degree (Licence) in Biochemistry and Biology (2010 - 2013)

  • Curriculum covers biochemistry, cellular & molecular biology, immunology, physiology, biological statistics, organic chemistry.

About