That’s the idea, here is some example c_api for loading a graph and running a session: https://github.com/Neargye/hello_tf_…ession_run.cpp. Wrapping that into a more unreal way would be the next step to make things easier.
In terms of packaging the .pb files, it’s not a big issue you can just add your pbs as runtime dependencies and they’ll get packaged along with other things, see https://github.com/getnamo/tensorflow-ue4/blob/master/Source/TensorFlow/TensorFlow.Build.cs#L32 on how that’s done for the script files.
Keep in mind that another way people do machine learning is to run it as a cloud service and just pipe data to your server and get results back, then you don’t have to worry about compatibility for your platform of choice.