Now I know that sounds like an immensely hard ask, but we could do so much if data layers were to load when a player fires up an instigated cinematic sequence device, so the data layer only shows for them.
This will allow to manage performance and have hundreds of players in the same map, all viewing something relevant to their client and not loading everything to memory that they are close to.