The LAS 1.4 spec (link below) states that VLRs of a las file shall be encoded as UTF8. However, RealityCapture exports las files with VLRs (in particular the CRS) encoded as latin1. This results in errors when trying to manipulate these las files with standard libraries such as laspy, as they do not follow the official spec.
One question though, that Howard Butler mentions in this github issue is that the spec seems to now indicate the VLRs should be encoded as ascii (not sure about that part).
He found out, that the specification doesn’t define, if the strings should be 7bit ascii or utf8 or latin1.
The User ID field is ASCII character data that identifies the user
Description
Optional text description of the data. Any remaining unused characters must be null
UTF8 is mentioned only in case, if is used OGC, but we are using GeoTiff.
But there was a bug about latin1 info writing into file. It writes into file that there is more VLR blocks as in the reality. This bug was solved and should be implemented in one of the next releases.
Regarding the question, the 1.4 spec says that “string (a variable series of 1 byte characters, ASCII encoded , null terminated)” (page 4).‘’