9 Matching Annotations
  1. Nov 2022
    1. Thus the replacement character is now only seen for encoding errors, such as invalid UTF-8.
    2. Consider a text file containing the German word für (meaning 'for') in the ISO-8859-1 encoding (0x66 0xFC 0x72). This file is now opened with a text editor that assumes the input is UTF-8. The first and last byte are valid UTF-8 encodings of ASCII, but the middle byte (0xFC) is not a valid byte in UTF-8. Therefore, a text editor could replace this byte with the replacement character symbol to produce a valid string of Unicode code points. The whole string now displays like this: "f�r".
  2. Aug 2018
    1. This text analysis that it contains words written in hebrew and deciphering of the first sentence of the text using hebrew translation seems to align with what this author is saying about the text being passed down through the family.

      She made recommendations to the priest, man of the house and me and people.

      [Source] (https://hyp.is/GB7sZKjvEeidoGeGo8L6jA/www.independent.co.uk/news/science/mysterious-manuscript-decoded-computer-scientists-ai-a8180951.html)

    1. Comments, questions, suggestions? Your feedback is welcome.

      Sukhwant Singh's analysis here seems to fit with a lot of other's partial analysis/observations such as multiple characters representing the same character, certain characters only appearing at the end of words etc. It seems quite compelling. The dates however, are a century too early although that does not necessarily dispel his theory that it is written in Landa Khojki.

    2. Many "words" differ by only one character and are found in each other's vicinity

      This might suggest the same thing as Tiltman's analysis in that a single character may take several forms.

    3. Tiltman treats f as a variant form of k and p as a variant form of t

      When learning that there were over 100 characters used in the manuscript my first thought was that perhaps variations of a character were used to represent the same character.

    4. Speaking generally, each character behaves as if it has its own place in an 'order of precedence' within words; some symbols such as o and y seem to be able to occupy two functionally different places.

      This is very interesting. It seems to suggest that each word may be scrambled based on the characters used.