AffordanceNet_DA

This is the implementation of our RA-L work 'Learning Affordance Segmentation for Real-world Robotic Manipulation via Synthetic Images'. The framework segments affordance maps by jointly detecting and localizing candidate regions within an image. Rather than requiring annotated real-world images, the framework learns from synthetic data and adapts to real-world data without supervision. The original arxiv paper can be found here.

If you find it helpful for your research, please consider citing:

@inproceedings{chu2019learning,
  title = {Learning Affordance Segmentation for Real-world Robotic Manipulation via Synthetic Images},
  author = {F. Chu and R. Xu and P. A. Vela},
  journal = {IEEE Robotics and Automation Letters},
  year = {2019},
  volume = {4},
  number = {2},
  pages = {1140-1147},
  DOI = {10.1109/LRA.2019.2894439},
  ISSN = {2377-3766},
  month = {April}
}

Requirements

Caffe:
- Install Caffe: Caffe installation instructions.
- Caffe must be built with support for Python layers.
- You will need the modified caffe layer in this repository. Please make sure you clone from here.
Specifications:
- CuDNN-5.1.10
- CUDA-8.0

Demo

Clone the AffordanceNet_DA repository into your $AffordanceNet_DA_ROOT folder

git clone https://github.com/ivalab/affordanceNet_DA.git
cd affordanceNet_DA

Export pycaffe path

`export PYTHONPATH=$AffordanceNet_DA_ROOT/caffe-affordance-net/python:$PYTHONPATH`

Build Cython modules

cd $AffordanceNet_DA_ROOT/lib
make clean
make
cd ..

Download pretrained models
- trained model for DEMO on dropbox
- put under ./pretrained/
Demo

cd $AffordanceNet_DA_ROOT/tools
python demo_img.py

Training

We train AffordanceNet_DA on GAZEBO synthetic data and UMD real data
- You will need synthetic data and real data in Pascal dataset format.
- For your convinience, we did it for you. Just download this file on dropbox and extract it into your $AffordanceNet_DA_ROOT folder.
- The extracted folder should contain three sub-folders: $AffordanceNet_DA_ROOT/data/cache, $AffordanceNet_DA_ROOT/data/imagenet_models, and $AffordanceNet_DA_ROOT/data/VOCdevkit2012 .
- You will need the VGG-16 weights pretrained on imagenet. For your convinience, please find it here
- You will need to continue training VGG-16 weights on the model finetuned on synthetic data. For your convinience, please find it here
- Put the weight into $AffordanceNet_DA_ROOT/imagenet_models
Train AffordanceNet_DA:

cd $AffordanceNet_ROOT
./experiments/scripts/faster_rcnn_end2end.sh 0 VGG16 pascal_voc

License

MIT License

Acknowledgment

This repo borrows tons of code from

affordanceNet by nqanh
da-faster-rcnn by yuhuayc

Contact

If you encounter any questions, please contact me at fujenchu[at]gatech[dot]edu

About

An implementation of our RA-L work 'Learning Affordance Segmentation for Real-world Robotic Manipulation via Synthetic Images'

Other

Languages

Language:Jupyter Notebook 46.0%Language:C++ 37.4%Language:Python 10.5%Language:Cuda 3.2%Language:CMake 1.4%Language:MATLAB 0.4%Language:Shell 0.4%Language:Makefile 0.3%Language:C 0.3%Language:CSS 0.1%Language:HTML 0.1%