The instructions used to derive the count are abstract, they are not direct GPU hardware instructions (which are different for every GPU vendor). They are referred to as an “intermediate language” (IL). The IL is converted to GPU instructions by the graphics driver at a later stage. The final GPU instructions should be roughly the same whether from SM5 or SM6, but the IL for SM6 is completely different. It is documented here if you are curious.
3 Likes