NVidia GPUs are general-purpose highly parallelized computing devices at their core. NVidia CUDA is one of their primary technologies. For cross-platform (i.e cross-vendor) applications, we have OpenCL. Those computing technologies are used everywhere nowadays (physic simulations, non-polygonal rendering engines, neural networks, etc, etc, etc), saying “they’re not suited for other applications” is nonsense.
When the amount of data is massive, you modify algorithm to render it without decompressing it. Most of the point cloud will be highly similar data or empty space. There’s already DXT compression for 2d images, something similar should be done for volumetric data too. Voxels compress well with octrees.
Frankly, I’d expect you to know/understand that already.