I think the only way to get this working is to mask each and every image before feedeing it into RC.
Here is a method that works without much additional effort:
https://support.capturingreality.com/hc/en-us/community/posts/360043121532/comments/360005149732#community_comment_360005149732 (scroll down to the second screenshot)
The trick is to have different backgrouds for each take and not have anything obscure the scanned object.
Of course, you need a feature-rich object for this to work, which, as you suspect, may be a problem with the sole.