Thats memory argument is inacurrate, both in cuda as in dx12/vulkan you can specify host visible memory buffers and of course stream data from ram. Also you could split the levels in chunks or perform progressive light baking, keeping only the necessary parts for a frame in memory. How do you think dynamic lighting is calculated?