For instance, the texts at asianclassics.org are encoded in the TibetanMachineWeb font. This font relies on some arcane encoding to produce the proper stacks of consonants, etc. Because of this, the texts offered by that site cannot be used as-is if any kind of sensible information processing is going to be performed on [...]