Leong Kwok Hing's repositories
GSP345-Automating-Infrastructure-on-Google-Cloud-with-Terraform-Challenge-Lab
Steps to complete the Automating Infrastructure on Google Cloud with Terraform: Challenge Lab (GSP345)
YandexCatBoost-Python-Demo
Demo on the capability of Yandex CatBoost gradient boosting classifier on a fictitious IBM HR dataset obtained from Kaggle. Data exploration, cleaning, preprocessing and model tuning are performed on the dataset
SentimentAnalysis-Python-Demo
Submission of an in-class NLP sentiment analysis competition held at Microsoft AI Singapore group. This submission entry explores the performance of both lexicon & machine-learning based models
Network-Analysis-on-MRT-Station
Demo on applying the concept of network analysis on a network of connected railway stations, attempting to identify the important stations (nodes) in this network. Web scraping techniques using rvest package is also briefly discussed upon.
AI-Planet-LLM-Bootcamp-Challenge
An LLM challenge to (i) fine-tune pre-trained HuggingFace transformer model to build a Code Generation language model, and (ii) build a retrieval-augmented generation (RAG) application using LangChain
Exploratory-Data-Analysis-on-SMRT-Tweets
Demo on performing exploratory data analysis (EDA) on train service disruptions based on scrapped (user generated contents) tweets from the train operator's (SMRT) twitter account
Rfacebook-Demo
Demo on using the Rfacebook package to scrape/mine user generated contents (UGCs) from public FB pages. Simple sentiment, text, trend and posting frequency analysis are performed on these scraped UGCs
SHAP-XAI-Demo
Demo on performing Explainable AI using the SHAP Library
Udacity-Project-Dog-Classification
This project from the Udacity Deep Learning Nanodegree Programme deploys the use of Convolutional Neural Networks (CNN) in PyTorch to classify dog breeds.
Creating-a-multi-sports-event-dashboard-in-PowerBI
A demonstration on creating a simple multi-sports event dashboard using PowerBI and DAX
Demo-on-automated-fact-checking-using-S-BERT
In this demo, we illustrate the the possibility of using Semantic Search + Recognising Textual Entailment with Gradio to build an automated fact checking tool
Regression-with-a-Crab-Age-Dataset
A light-weight Kaggle challenge to predict crabs' age
TF2-Cifar10-CNN-Demo
Demo on performing multiclass image classification using Convolutional Neural Network (CNN) in Tensorflow 2. Techniques such as earlystopping, batchnormalizing and dropout are explored to prevent overfitting
Uplimit-Project-Podcast-Frontend
An LLM app to summarize a podcast episode, identifies podcast guests and attempts to retrieve the guest's public information from wikipedia, and identifies key highlights using OpenAI ChatGPT with prompting techniques
WiDS-Datathon-Patient-Survival
A challenge to create a model that uses data from the first 24 hours of intensive care to predict patient survival
Pencils-of-Promise
Data for A Cause - Pencils of Promise
The-Best-Things-In-Life-Are-Free
A Repository of Learning Resources & Massive Open Online Courses (MOOCs) for personal development
Udacity-Bertelsmann-Intro-to-ML-with-TensorFlow
Udacity x Bertelsmann Scholarship - Introduction to Machine Learning with TensorFlow
Visualizing-Datasets-with-Facets
Demo on using Facets: An Open Source Visualization Tool for Machine Learning Training Data developed by Google's PAIR Initiative