I wonder in how far the correct or wrong transcription system affects the observed entropy of the VM text, namely the observed “low information content”.
Obviously, there are two major ways in which the transcription can be wrong: Either ciphertext character strings are broken up or joined at the wrong position (Is qo really one letter or two? What about dain, daiin and daiiin?), or characters which are identical are treated as different, or vice versa. (C/e/cc/ch come to mind. How many different gallows are really there?)
What would the effect on entropy be? Perhaps I should look up the old statistics books and see what difference a larger/smaller word length and/or character set would make.