bexcite/CarND-Capstone

Team Members

The following are the team members of selfdriven.tech team:

Khalid Ashmawy (khalid.ashmawy@gmail.com)
Pavlo Bashmakov (pavel.bashmakov@gmail.com)
Mertin Curban-gazi (mertin23@yahoo.com)
John Ryan (jtryan666@gmail.com)
Brian Chan (499938834@qq.com)

Architecture

The following diagram shows the architecture of the project:

Components

The following are the main components of the project:

Perception: The perception component can be found under tl_detector and is responsible for classifying if the vehicle has a red traffic light or green traffic light ahead of it.

TL Detector components subscribes to the images captured from the camera mounted on the vehicle (/image_color) and whenever the traffic light is within a certain distance, then it starts passing the image to the classifier for classification.

The classifier classifies the image into either traffic light with red lights, green lights or yellow lights. The check for the traffic light being within a certain distance is done by checking against the YAML file which contains the positions of the traffic lights. The state of the traffic light is then published to the following topic: /traffic_waypoint to be used by the other components.

Planning: The planning component can be found under waypoint_updater and is responsible for creating a list of waypoints with an associated target velocity based on the perception component.

Waypoint Updater component subscribes to the following topics:

/traffic_waypoint: The topic that TL detector component publishes on whenever there is a traffic light.
/current_velocity: This topic is used to receive the vehicle velocity.
/current_pose: The topic used to receive the vehicle position.

Waypoint Updater detects if there is a red traffic light ahead from the /traffic_waypoint in order to trigger deceleration, otherwise it updates the waypoints ahead on the path with velocities equal to the maximum velocities allowed. This component finally publishes the result to /final_waypoints.

Control: The control component can be found under twist_controller and is responsible for the throttle, brake and steering based on the planning component.

DBW Node is the component responsible for the control of the car. DBW node subscribes to the following topics:

/current_velocity: This topic is used to receive the vehicle velocity.
/twist_cmd: provides proposed linear and angular velocities.
/vehicle/dbw_enabled: This topic is used to receive if the manual or autonomous mode is enabled.
/current_pose: This topic is used to receive the vehicle position.
/final_waypoints: This topic is used to receive the waypoints from the planning component (Waypoint Updater).

DBW Node implements a PID controller that takes into account if manual or autonomous mode is enabled. It takes the input from final_waypoints, current_pose, current_velocity and outputs throttling, steering and brake messages to the following topics /vehicle/throttle_cmd, /vehicle/steering_cmd and /vehicle/brake_cmd.

Classification

We tried multiple approaches for classification and then we finally settled on one based on the results.

The following are the approaches we tried:

VGG16 with ImageNet weights and then finetuning

We started with VGG16 with ImageNet weights and then we added one extra fully connected layer and finetuned it with both simulator images and site images. There were advantages and disadvantages to this approach. This approach gave good predictions with accuracy higher than 90%. However inference speed was very slow. This led us to trying out SqueezeNet which has much faster inference speed. Having a fast inference speed is critical here because we want to ensure that this can run in realtime.
SqueezeNet that was trained on Nexar and then finetuning

We started with SqueezeNet that was trained on Nexar dataset and then we further finetuned it with the simulator images. SqueezeNet was much more faster than VGG16 and had very high accuracy rate, over 90% and performed much better than the first approach. Being originally trained on Nexar dataset as opposed to ImageNet must have contributed to that given that Nexar dataset are mainly traffic lights as opposed to ImageNet. However initially this model was not performing well on simulator images, only on site images. After fine tuning the model, the model started to perform much better on simulator images.
Tensorflow APIs and then finetuning

Finally we've used Tensorflow Object Detection API with a fine tuned model (faster_rcnn_resnet101_coco) on sim (collected from simulator) and site data (provided in udacity rosbag) separately.

Video with recognitions on simulator images (new test set that wasn't used during the training):

Speed of inference is: 0.04 - 0.09 s with Nvidia GPU 1080 Ti.

Cropping is not used.

Video of traffic light detector testing:

We also publish /image_test topic that is useful to see the detection in real life during sim/site tests:

Run on Carla

We've tried to made all necessary param changes inside relevant launch files, so the whole sequence will be launched by just:

roslaunch launch/site.launch

Run Video

Installation

Be sure that your workstation is running Ubuntu 16.04 Xenial Xerus or Ubuntu 14.04 Trusty Tahir. Ubuntu downloads can be found here.
If using a Virtual Machine to install Ubuntu, use the following configuration as minimum:
- 2 CPU
- 2 GB system memory
- 25 GB of free hard drive space
The Udacity provided virtual machine has ROS and Dataspeed DBW already installed, so you can skip the next two steps if you are using this.
Follow these instructions to install ROS
- ROS Kinetic if you have Ubuntu 16.04.
- ROS Indigo if you have Ubuntu 14.04.
Dataspeed DBW
- Use this option to install the SDK on a workstation that already has ROS installed: One Line SDK Install (binary)
Download the Udacity Simulator.

Usage

Clone the project repository

git clone https://github.com/bexcite/CarND-Capstone.git

Install python dependencies

cd CarND-Capstone
pip install -r requirements.txt

Make and run styx

cd ros
catkin_make
source devel/setup.sh
roslaunch launch/styx.launch

Run the simulator

Real world testing

Download training bag that was recorded on the Udacity self-driving car (a bag demonstraing the correct predictions in autonomous mode can be found here)
Unzip the file

unzip traffic_light_bag_files.zip

Play the bag file

rosbag play -l traffic_light_bag_files/loop_with_traffic_light.bag

Launch your project in site mode

cd CarND-Capstone/ros
roslaunch launch/site.launch

Confirm that traffic light detection works on real life images

bexcite / CarND-Capstone