It has to load the entire scene because it’s difficult to tell what things would be effecting the lighting of another. And no, you can’t combine GPU memory.
You would need to reduce lightmap resolutions, texture resolutions, and polygon count to improve the amount of memory it uses.