Almost everything has been said.
I think the right method is :
-align by small chunk - I like to keep my small components in the 300-1500 range, because as the time is directly proportional to the number of features squared, above 1500 this is exploding - of course, it depends on your subject, how “local” it is, the precision of your pictures, the computer you have, the memory you have…
-export and import your components in one final project. I confirm that you will need the big CLI license if the total pictures goes above 2500 - but if you are at the limit sometimes the small chunks prealignment helps you to kick out the useless pictures and if the total is under 2500, you are fine with the Promo license. You can also filter out some cameras.
Then, you don’t care about the components in different positions / scales / orientations because you need to realign to get one component, and this one will be oriented and positioned using ground control points, or manually.
As it was said before, you then need to reconstruct in one pass - so you need to first make sure you have one component first.
Of course, depending on your subject you can also try to reconstruct seperate objects and stitch them in a 3D software - it could work in some cases, but usually this is less precise…
So, about parallelism, you can’t on one computer (RC is already parallel in nature but only one instance can run at a time - which is enough in my opinion), so one way to deal with a kind of parallelism (that is what I am doing) is to use one computer for the small individual alignments (with the Promo license - you don’t even need a GPU) and one computer (this one needs some GPU power and RAM) with another license (possibly the CLI if you go above 2500 pictures).
And then, all the “game” is to plan the captures so you have these chunks/clusters to align and that they eventually align together at the end, and gives you the precision you need.