jackyangzzh / Computer-Vision-Nanodegree-Projects

Projects from my Udacity Computer Vision Nanodegree that includes: Facial Keypoint Detection, Image Captioning and SLAM

Geek Repo:Geek Repo

Github PK Tool:Github PK Tool

Computer Vision Nanodegree Projects

Overview

This repository is a collection of projects I completed for the Udacity Computer Vision Nanodegree, which I implemented with various computer vision tools from PyTorch to OpenCV.

Content

  1. Project 1: Facial Keypoint Detection
  2. Project 2: Image Captioning
  3. Project 3: Landmark Detection (SLAM)

Projects

Project 1: Facial Keypoint Detection

For this project, I built a facial keypoint detection system using Haar Casacdes and Convolutional Neural Network (CNN). This facial keypoint detection system that takes in any image with faces, and predicts the location of 68 distinguishing keypoints on each face!

Demo: results produced by my model
Obama Obama Obama


Project 2: Image Captioning

For this project, I designed and created a Convolutional Neural Network (CNN), Recurrent Neural Network (RNN) and LSTM (Long Short Term Memory) architecture to automatically generate captions from images. The model is trained on Microsoft Common Objects in COntext (MS COCO) dataset.

Demo: results produced by my model

captioning

captioning


Project 3: Landmark Detection (SLAM)

For this project, I implemented SLAM (Simultaneous Localization and Mapping) for a 2 dimensional world by combining robot sensor measurements and movement to create a map of an environment.

Demo: results produced by my model

SLAM

About

Projects from my Udacity Computer Vision Nanodegree that includes: Facial Keypoint Detection, Image Captioning and SLAM


Languages

Language:Jupyter Notebook 99.5%Language:Python 0.5%