So you decided against implementing something along the lines of taking the first bounce of gi from one frame and injecting it into the next one to finally remove it from the second frame when the bounce has already been calculated , in order to achieve infinite bounces?
ps: what is the size of the voxels and how big is the voxel grid that you are using to obtain a cost of 10-12 ms ?