Computer Vision Nanodegree Projects

Overview

This repository is a collection of projects I completed for the Udacity Computer Vision Nanodegree, which I implemented with various computer vision tools from PyTorch to OpenCV.

Content

Project 1: Facial Keypoint Detection
Project 2: Image Captioning
Project 3: Landmark Detection (SLAM)

Projects

Project 1: Facial Keypoint Detection

For this project, I built a facial keypoint detection system using Haar Casacdes and Convolutional Neural Network (CNN). This facial keypoint detection system that takes in any image with faces, and predicts the location of 68 distinguishing keypoints on each face!

Demo: results produced by my model

Project 2: Image Captioning

For this project, I designed and created a Convolutional Neural Network (CNN), Recurrent Neural Network (RNN) and LSTM (Long Short Term Memory) architecture to automatically generate captions from images. The model is trained on Microsoft Common Objects in COntext (MS COCO) dataset.

Demo: results produced by my model

Project 3: Landmark Detection (SLAM)

For this project, I implemented SLAM (Simultaneous Localization and Mapping) for a 2 dimensional world by combining robot sensor measurements and movement to create a map of an environment.

Demo: results produced by my model

About

Projects from my Udacity Computer Vision Nanodegree that includes: Facial Keypoint Detection, Image Captioning and SLAM

Languages

Language:Jupyter Notebook 99.5%Language:Python 0.5%