π Hello, I'm Kenneth Leung
Thanks for popping by! As an avid learner, bold builder, curious explorer, and driven doer with a bias towards action, I enjoy seeking and solving meaningful problems with data and technology while having fun at the same time.
I welcome you to join me on a learning journey! Follow me on GitHub , Medium , and LinkedIn for a great dose of practical educational data science content.
You can find my data science portfolio below, where every project and article was born out of inspiration, curiosity, and motivation. Feel free to reach out for a chat on topics common to both of us!
π¨βπ§ Currently working on: (i) Applied Generative AI Use Cases, and (ii) Compilation of high-profile ML failures: Failed-ML . If you're keen to join me in contributing, let's connect!
How to reach me
Portfolio Contents
Computer Vision
Database Management
Data Extraction and Web Scraping
Data Science Certification Guides
Data Science Toolkit
Data Science in the Real World
Generative AI
Insights from Data Science Talks
Machine Learning
MLOps
Natural Language Processing
Networks and Graphs
Sports Analytics
Visualization
Web Development
Web3 and Metaverse
Writing for DataCamp
Writing Tips
Projects with β are my personal favourites, so do check them out!
Computer Vision ποΈ
Title
Article
Repo
Classifying Images of Alcoholic Beverages with fast.ai v2
π
π
Russian Car Plate Detection with OpenCV and TesseractOCR
π
π
Evaluate OCR Output Quality with Character Error Rate (CER) and Word Error Rate (WER)
π
π
Top Python libraries for Image Augmentation in Computer Vision
π
π
β PyTorch Ignite Tutorial - Classifying Tiny ImageNet with EfficientNet
π
π
Practical Guide to Transfer Learning in TensorFlow for Multiclass Image Classification
π
π
Database Management ποΈ
Title
Article
Repo
β Definitive Guide to Creating a SQL Database on Cloud with AWS and Python
π
π
PyMySQLβ-βConnecting Python and SQL for Data Science
π
π
Data Extraction and Web Scraping π§°
Title
Article
Repo
Using OneMap API to extract Singapore postal codes, coordinates and travel distance
-
π
A Detailed Web Scraping Walkthrough Using Python and Selenium
π
π
β How to Web Scrape Wikipedia using LangChain Agents and Tools with OpenAI's LLMs and Function Calling
π
π
Data Science Certification Guides π¨βπ
Title
Article
Repo
3 Steps to Get AWS Cloud Practitioner Certified in 2 Weeks
π
π
3 Steps to Get Tableau Desktop Certified in 2 Weeks
π
-
β No-Frills Guide to Passing the AWS Certified Machine Learning Specialty Exam
π
-
Data Science Toolkit π οΈ
Title
Article
Repo
Common Python codes for Data Wrangling
-
π
Enhance your Python codeβs readability with pycodestyle
π
-
Free Resources for Generating Realistic Fake Data
π
-
Most Starred and Forked GitHub Repos for Data Science and Python
π
-
Most Starred and Forked GitHub Repos for Data Science and R
π
-
Automatically Generate Machine Learning Code with Just a Few Clicks
π
-
Read and Modify Image Metadata with Python
π
π
Top Tips to Google Search Like a Seasoned Data Scientist
π
-
How to Swap Day and Month of Incorrectly Formatted Excel Dates
π
-
Data Science in the Real World π
Title
Article
Repo
Exploring Illegal Drugs in Singapore β A Data Perspective
π
π
Pharmacokinetic Modeling of Drug Concentration Trajectories using Ordinary Differential Equations (ODE) and Global Optimization with Differential Evolution
-
π
Healthcareβs AI Future β In Conversation with Andrew Ng and Fei-Fei Li
π
-
Real-World Data Science Use Cases in the Insurance Industry
π
-
β Failed-ML: Compilation of high-profile real-world examples of failed machine learning projects
π
π
Generative AI π€
Title
Article
Repo
Generative AI Pharmacist - Macy
π
π
β ChatPod - Q&A over your Podcasts with Whisper, FAISS, and LangChain
π
π
β Running Llama 2 and other Open-Source LLMs on CPU Inference Locally for Document Q&A
π
π
Domain LLMs - Compilation of Customized LLMs for Specific Domains and Industries
-
π
β Text-to-Audio Generation with Bark, Clearly Explained
π
π
Guide to ChatGPT's Advanced Settings β Top P, Frequency Penalties, Temperature, and More
π
-
Insights from Data Science Talks π¨βπ«
Title
Article
Repo
Bridging AIβs Proof-of-Concept to Production Gap β Insights from Andrew Ng
π
-
Machine Learning π°
Title
Article
Repo
Exploring Condominium Rental Prices with Web Scraping and Exploratory Data Analysis
π
π
Using Ensemble Regressors to Predict Condominium Rental Prices
π
π
The Dying ReLU Problem, Clearly Explained
π
-
Why Bootstrapping Actually Works
π
-
β Assumptions of Logistic Regression, Clearly Explained
π
π
Data-Centric AI Competition - Tips and Tricks of a Top 5% Finish
π
π
Credit Card Fraud Detection with AutoXGB
π
π
β Micro, Macro & Weighted Averages of F1 Score, Clearly Explained
π
-
Principal Component Regression - Clearly Explained and Implemented
π
π
β Feature Selection with Simulated Annealing in Python, Clearly Explained
π
π
Quick Primer on Types of Missing Data and Imputation Techniques
π
-
Imputation of Missing Data in Tables with DataWig
π
π
MLOps - Machine Learning Operations π¨βπ§
Title
Article
Repo
Key Learning Points from MLOps SpecializationβββCourse 1/4
π
π
Key Learning Points from MLOps SpecializationβββCourse 2/4
π
π
Key Learning Points from MLOps SpecializationβββCourse 3/4
π
π
Key Learning Points from MLOps SpecializationβββCourse 4/4
π
π
β End-to-End AutoML Pipeline with H2O AutoML, MLflow, FastAPI, and Streamlit for Insurance Cross-Sell
π
π
β How to Dockerize Machine Learning Applications Built with H2O, MLflow, FastAPI, and Streamlit
π
π
β Building and Managing an Isolation Forest Anomaly Detection Pipeline with Kedro
π
π
Natural Language Processing π
Title
Article
Repo
COVID-19 Vaccine β Whatβs the Public Sentiment?
π
π
Keyword Extraction and Analysis Pipeline with KeyBERT and Taipy
π
π
Networks and Graphs π
Title
Article
Repo
β Network Analysis and Visualization of Drug-Drug Interactions
π
π
How to Deploy Interactive Pyvis Network Graphs on Streamlit
π
π
A No-Code Approach to Building Knowledge Graphs
π
π
Sports Analytics β½
Title
Article
Repo
β Analyzing English Premier League VAR Football Decisions
π
π
Combining Python and R for FIFA Football World Ranking Analysis
π
π
Visualization π
Title
Article
Repo
Uniform Singapore Energy Price and Demand Forecast Dashboard (with Plotly Dash)
-
π
Visualizing Fortune 500 Companies in a Bar Chart Race
π
π
How to Easily Draw Neural Network Architecture Diagrams
π
π
Web Development π₯οΈ
Title
Article
Repo
β Post COVID-19 Vaccination Wait-Time Tracker (with Python Flask)
π
π
From HTTP to HTTPS β Easily Secure Flask Web Apps With Talisman
π
-
β Food King Directory (in collaboration with Night Owl Cinematics)
π
π
Web3 and Metaverse π¨βπ»
Title
Article
Repo
The Web3 / Metaverse Glossary β A Keyword Guide to the Tech Future
π
-
Writing for DataCamp βοΈ
Title
Article
Repo
β What Mature Data Infrastructure Looks Like
π
-
Democratizing Data in Government Agencies
π
-
A Survey Into Data Governance Tools
π
-
Scaling Data Science With Data Governance
π
-
3 Reasons Why All Teams Should Learn SQL
π
-
3 Reasons Why All Teams Should Learn R
π
-
How Tableau Helps Your Organization Achieve Greater Data Insights
π
-
How PowerBI Helps Your Organization Achieve Greater Data Insights
π
-
Writing Tips π
Title
Article
Repo
Create a Clickable Table of Contents for Your Medium Posts
π
-