Synthetic computer vision data

Hi,

I am currently looking for computer vision package/plugin such as the one in Unity that allow one to move around camera and save ground truth data such as bounding boxes, masks, pose info, etc. with images. If there is no such package/plugin, is there any easy way for me to start using blue print to build simple bounding box camera that I can move around and save images with ground truth bounding boxes programmatically?

Thanks,
Sungshik