I might be wrong, but I think the textures are streamed from their individual assets into a VT in memory as needed, not stored into the VT assets. The whole point of VT is to not have everything loaded at once and break textures into individually streamable blocks, so you can have only a portion of it loaded, or different parts loaded with different mip map levels.