There are 2 repositories under training-set-generator topic.
A synthetic data generator for text recognition
generate physically realistic synthetic dataset of cluttered scenes using 3D CAD models to train CNN based object detectors
Cross-Platform Browser Based Tool to Manually Demarcate Regions in Images