It was very hard to find a good resource for using gensim's doc2vec. A lot of the work here is derived from this post with some tweaks given my own format of training data.