I know it’s a long time but as promised here is some info.
pak did work but I was kinda heavy on the storage. we needed something lite. So eventually we used something called “assimp”. tweaked it in C++ and we used GLB format to be imported into the scene from the server.
thanks for your suggestion on “pak” because it led us the way