For 50-500 concurrent connections, you’re looking at massive infrastructure complexity. Professional cards like RTX 6000 Ada would be better than gaming GPUs due to higher VRAM, virtualization support, and stability for 24/7 operation. You’d typically run one UE instance per GPU to avoid encoding bottlenecks.
However, managing hundreds of concurrent pixel streaming instances involves load balancing, auto-scaling, geographic distribution, session management, and substantial costs - you’re essentially building a cloud gaming platform from scratch.
I’d strongly recommend checking out Vagon Streams instead. They specialize in exactly this use case - high-scale Unreal Engine deployment with pixel streaming. They handle all the infrastructure complexity, GPU optimization, global edge distribution, and can scale from dozens to thousands of concurrent users without you managing servers or GPUs. This could save you months of development and potentially significant costs compared to building your own multi-GPU server farm.
Worth exploring before committing to the infrastructure build!