Hypothesis

39 Matching Annotations

Mar 2024
chiselapp.com chiselapp.com

Intro a Pharo

1
1. lj19 14 Mar 2024
  
  in Public
  
  $@ charCode
  
  El carácter ($) se utiliza para denotar un carácter literal en Pharo, y (@) es el carácter específico que sigue al signo ($). El carácter charCode es un mensaje que se puede enviar a un carácter para obtener su valor numérico de código de carácter Unicode.
  
  Unicode
Visit annotations in context

Tags

Unicode

Annotators

lj19

URL

chiselapp.com/user/leidy-palma/repository/leidy-palma/doc/7b97cd39c4dd5a27a64615ec05d321a5e97f52e40ea5b284804c6dee231f816b/Wiki/es/intro-a-pharo--em9qm.md.html
Jan 2024
en.wikipedia.org en.wikipedia.org

List of Unicode characters - Wikipedia

1
1. pivic 31 Jan 2024
  
  in Public
  
  U+003E > 62 076 Greater-than sign
  
  \u003c
  
  unicode
Visit annotations in context

Tags

unicode

Annotators

pivic

URL

en.wikipedia.org/wiki/List_of_Unicode_characters
Dec 2023
cuis-smalltalk.github.io cuis-smalltalk.github.io

String -- a particular collection (The Cuis-Smalltalk Book)

1
1. EBirman 08 Dec 2023
  
  in Public
  
  In Cuis-Smalltalk, strings with characters not part of the ASCII table are usually instances of UnicodeString. In the same way, you may get an instance of UnicodeSymbol and not Symbol, or UnicodeCodePoint and not Character. You usually don’t need to care about this. The ASCII and Unicode classes provide the same services.
  
  See Unicode Support in Cuis Smalltalk and its presentation at Smalltalks 2022. See also the 2023-04-05 Cuis Smalltalk Meeting
  
  Unicode
Visit annotations in context

Tags

Unicode

Annotators

EBirman

URL

cuis-smalltalk.github.io/TheCuisBook/String-_002d_002d-a-particular-collection.html
Aug 2023
www.unicode.org www.unicode.org

UTS #51 Addendum: Unicode Emoji QID

1
1. kael 25 Aug 2023
  
  in Public
  
  unicode emoji qid
Visit annotations in context

Tags

emoji

qid

unicode

Annotators

kael

URL

unicode.org/review/pri408/pri408-tr51-QID.html
Apr 2023
symbl.cc symbl.cc

Musical Symbols, 𝄀 𝄁 𝄂 𝄃 𝄄, 256 symbols, Unicode Range: 1D100-1D1FF (◕‿◕) SYMBL

1
1. kael 29 Apr 2023
  
  in Public
  
  unicode html music solfege wikipedia:en=Musical_Symbols_(Unicode_block)
Visit annotations in context

Tags

html

unicode

wikipedia:en=Musical_Symbols_(Unicode_block)

music

solfege

Annotators

kael

URL

symbl.cc/en/unicode/blocks/musical-symbols/
www.unicode.org www.unicode.org

The Unicode Standard, Version 15.0

1
1. kael 29 Apr 2023
  
  in Public
  
  unicode html music solfege wikipedia:en=Musical_Symbols_(Unicode_block)
Visit annotations in context

Tags

html

unicode

wikipedia:en=Musical_Symbols_(Unicode_block)

music

solfege

Annotators

kael

URL

unicode.org/charts/PDF/U1D100.pdf
molekulo.medium.com molekulo.medium.com

Accidentals (♭ ♮ ♯) and Other Musical Symbols as HTML Entities

1
1. kael 29 Apr 2023
  
  in Public
  
  unicode html music solfege
Visit annotations in context

Tags

music

html

unicode

solfege

Annotators

kael

URL

molekulo.medium.com/accidentals-and-other-musical-symbols-as-html-entities-c3eac35e6a6b
Mar 2023
github.com github.com

Improve Password Length Validation for BCrypt Compatibility by guilleiguaran · Pull Request #47708 · rails/rails

1
1. TylerRick 30 Mar 2023
  
  in Public
  
  user = User.new(password: "あ" * 25) # 25 characters, 75 bytes
  
  characters vs. bytes
  
  Unicode characters character encoding distinction
Visit annotations in context

Tags

distinction

character encoding

Unicode characters

Annotators

TylerRick

URL

github.com/rails/rails/pull/47708/files
Dec 2022
www.zhihu.com www.zhihu.com

Unicode 和 UTF-8 有什么区别？ - 知乎

1
1. caocao485 12 Dec 2022
  
  in Public
  
  Unicode 和 UTF-8 有什么区别？
  
  utf-8 unicode 字符编码编码
Visit annotations in context

Tags

unicode

utf-8

编码

字符编码

Annotators

caocao485

URL

zhihu.com/question/23374078
Nov 2022
developer.mozilla.org developer.mozilla.org

btoa() - Web APIs | MDN

2
1. TylerRick 19 Nov 2022
  
  in Public
  
  The btoa() function takes a JavaScript string as a parameter. In JavaScript strings are represented using the UTF-16 character encoding: in this encoding, strings are represented as a sequence of 16-bit (2 byte) units. Every ASCII character fits into the first byte of one of these units, but many other characters don't. Base64, by design, expects binary data as its input. In terms of JavaScript strings, this means strings in which each character occupies only one byte. So if you pass a string into btoa() containing characters that occupy more than one byte, you will get an error, because this is not considered binary data:
  
  JavaScript Unicode characters UTF-16 character encoding complicated/intricate
2. TylerRick 19 Nov 2022
  
  in Public
  
  If you need to encode Unicode text as ASCII using btoa(), one option is to convert the string such that each 16-bit unit occupies only one byte.
  
  unfortunate workaround workaround character encoding Unicode characters JavaScript
Visit annotations in context

Tags

JavaScript

character encoding

complicated/intricate

Unicode characters

workaround

unfortunate workaround

UTF-16

Annotators

TylerRick

URL

developer.mozilla.org/en-US/docs/Web/API/btoa
en.wikipedia.org en.wikipedia.org

Specials (Unicode block) - Wikipedia

3
1. TylerRick 03 Nov 2022
  
  in Public
  
  Thus the replacement character is now only seen for encoding errors, such as invalid UTF-8.
  
  Unicode replacement character problem: incorrectly encoded character / invalid byte sequence
2. TylerRick 03 Nov 2022
  
  in Public
  
  At one time the replacement character was often used when there was no glyph available in a font for that character. However, most modern text rendering systems instead use a font's .notdef character, which in most cases is an empty box (or "?" or "X" in a box[5]), sometimes called a "tofu" (this browser displays 􏿾). There is no Unicode code point for this symbol.
  
  "glyph not found" glyph Unicode replacement character distinction
3. TylerRick 03 Nov 2022
  
  in Public
  
  The replacement character � (often displayed as a black rhombus with a white question mark) is a symbol found in the Unicode standard at code point U+FFFD in the Specials table. It is used to indicate problems when a system is unable to render a stream of data to a correct symbol.[4] It is usually seen when the data is invalid and does not match any character:
  
  Unicode replacement character
Visit annotations in context

Tags

Unicode replacement character

distinction

problem: incorrectly encoded character / invalid byte sequence

"glyph not found" glyph

Annotators

TylerRick

URL

en.wikipedia.org/wiki/Specials_(Unicode_block)
stackoverflow.com stackoverflow.com

Is there a "glyph not found" character?

3
1. TylerRick 03 Nov 2022
  
  in Public
  
  By the way, I am not talking about � (replacement character). This one is displayed when a Unicode character could not be correctly decoded from a data stream. It does not necessarily produce the same glyph:
  
  distinction Unicode replacement character "glyph not found" glyph
2. TylerRick 03 Nov 2022
  
  in Public
  
  replacement glyph
  
  Unicode replacement character
3. TylerRick 03 Nov 2022
  
  in Public
  
  U+25A1 □ WHITE SQUARE may be used to represent a missing ideograph
  
  apparently distinct from: Unicode replacement character (U+FFFD)
  
  Unicode replacement character
Visit annotations in context

Tags

Unicode replacement character

distinction

"glyph not found" glyph

Annotators

TylerRick

URL

stackoverflow.com/questions/13730544/is-there-a-glyph-not-found-character
www.w3.org www.w3.org

Missing characters and glyphs

1
1. TylerRick 03 Nov 2022
  
  in Public
  
  The character exists in Unicode/ISO 10646, but not in the character encoding used for the document. In this case, use Numeric Character References (NCRs, example: 噸).
  
  problem: character missing in a given encoding Unicode replacement character web HTML
Visit annotations in context

Tags

Unicode replacement character

HTML

problem: character missing in a given encoding

web

Annotators

TylerRick

URL

w3.org/International/articles/missing-char-glyph/index.en
stackoverflow.com stackoverflow.com

Is there any unicode character whose glyph is missing in all fonts?

2
1. TylerRick 03 Nov 2022
  
  in Public
  
  However after doing a bit of testing I see that this character is not used to represent missing glyphs on either my Windows 7 computer or the Android phone I've tested with (Motorola Atrix).
  
  surprising Unicode replacement character cross-platform differences Windows platform: Android Android
2. TylerRick 03 Nov 2022
  
  in Public
  
  The Unicode replacement character sounds promising when reading about it on Wikipedia: It is used to indicate problems when a system is not able to render a stream of data to a correct symbol. It is most commonly seen when a font does not contain a character, but is also seen when the data is invalid and does not match any character
  
  Unicode replacement character
Visit annotations in context

Tags

cross-platform differences

Unicode replacement character

Windows

surprising

Android

platform: Android

Annotators

TylerRick

URL

stackoverflow.com/questions/22475157/is-there-any-unicode-character-whose-glyph-is-missing-in-all-fonts
apple.stackexchange.com apple.stackexchange.com

What typeface is used by macOS to render unicode glyphs?

1
1. TylerRick 03 Nov 2022
  
  in Public
  
  All glyphs are Unicode glyphs!
  
  Unicode glyph
Visit annotations in context

Tags

Unicode

glyph

Annotators

TylerRick

URL

apple.stackexchange.com/questions/381669/what-typeface-is-used-by-macos-to-render-unicode-glyphs
Oct 2022
www.unicode.org www.unicode.org

Unicode Mail List Archive: UTF-64 [warning: contains bits &

1
1. almereyda 31 Oct 2022
  
  in Public
  
  Of course, if super-intelligent Aliens will arrive on our planet, bearing a writing system with billions characters, I will withdraw this proposal and donate the name "UTF-64" to the Unicode Consortium.
  
  Unicode UTF UTF-64
Visit annotations in context

Tags

Unicode

UTF

UTF-64

Annotators

almereyda

URL

unicode.org/mail-arch/unicode-ml/y2001-m05/0425.html
Aug 2022
www.runoob.com www.runoob.com

字符集和字符编码（Charset & Encoding） | 菜鸟教程

1
1. caocao485 26 Aug 2022
  
  in Public
  
  Unicode 是基于通用字符集（Universal Character Set）的标准来发展，并且同时也以书本的形式[1]对外发表
  
  utf-8是unicode字符集的编码方式之一
  
  unicode 字符集
Visit annotations in context

Tags

字符集

unicode

Annotators

caocao485

URL

runoob.com/w3cnote/charset-encoding.html
Apr 2022
dev.to dev.to

Use Unicode characters for bullet points in CSS using ::marker

1
1. kael 10 Apr 2022
  
  in Public
  
  css ul { list-style-type: none; } ul li:before { content:"\2713"; }
  
  css unicode
Visit annotations in context

Tags

unicode

css

Annotators

kael

URL

dev.to/cassidoo/use-unicode-characters-for-bullet-points-in-css-using-marker-3bnj
Mar 2022
code.visualstudio.com code.visualstudio.com

Visual Studio Code February 2022

1
1. TylerRick 15 Mar 2022
  
  in Public
  
  ambiguous and invisible Unicode characters
  
  'е' != 'e'
  
  ambiguous Unicode characters good example see content below see image below
Visit annotations in context

Tags

Unicode characters

good example

see content below

see image below

ambiguous

Annotators

TylerRick

URL

code.visualstudio.com/updates/v1_65
Dec 2021

Here are the single characters which can be normalised down to a valid TLD. They're mostly country codes, but there are a few interesting exceptions:

㏕ - US Military
℡ - .tel registry
№ - Norway
㍳ - Australia
㍷ - Dominica
㎀ - Panama
㎁ - Namibia
㎃ - Morocco
㎊ - French Polynesia
㎋ - Norfolk Island
㎏ - Kyrgyzstan
㎖ - Mali
㎙ - Federated States of Micronesia
ﬁ - Finland
㎜ - Myanmar
㎝ - Cameroon
㎞ & ㏎ - Comoros
㎰ - Palestine
㎳ - Montserrat
㎷ & ㎹ - Republic of Maldives.
㎺ - Palau
㎽ & ㎿ - Malawi
㏄ - Cocos (Keeling) Islands
㏅ - Democratic Republic of Congo
㏉ - Guyana
㏗ - Philippines
㏘ - Saint Pierre and Miquelon
㏚ - Puerto Rico
㏛ - Suriname
㏜ - El Salvador
℠ - San Marino
™ - Turkmenistan
ﬆ & ﬅ - São Tomé and Príncipe
㎇ - Great Britain (Obsolete)
ß - South Sudan (Not available)
㏌ - India and Indiana (subdomain of .us)
Ⅵ & ⅵ - Virgin Islands and Virginia (subdomain of .us)
ﬂ - Florida (subdomain of .us)
㎚ - New Mexico (subdomain of .us)
㎵ - Nevada (subdomain of .us)
㍵ - As part of .ovh

url unicode

kael 16 Dec 2021

in Public
Nestling among the "Letterlike Symbols" are two curious entries. Both of these are single characters:
- Telephone symbol - ℡
- Numero Sign - №
What's interesting is both .tel and .no are Top-Level-Domains (TLD) on the Domain Name System (DNS).

So my contact site - https://edent.tel/ - can be written as - https://edent.℡/

And the Norwegian domain name registry NORID can be accessed at https://www.norid.№/

Copy and paste those links - they work in any browser!
url unicode

Visit annotations in context

Annotators

kael

URL

shkspr.mobi/blog/2018/11/domain-hacks-with-unusual-unicode-characters/

Jun 2021
en.wikipedia.org en.wikipedia.org

ISO 15924 - Wikipedia

1
1. TylerRick 04 Jun 2021
  
  in Public
  
  Through a linkpin called "Property Value Alias", Unicode has made a 1:1 connection between a script defined, and its ISO 15924 standard.
  
  correspondence ISO language codes Unicode
Visit annotations in context

Tags

Unicode

correspondence

ISO language codes

Annotators

TylerRick

URL

en.wikipedia.org/wiki/ISO_15924
Apr 2021
en.wikipedia.org en.wikipedia.org

List of XML and HTML character entity references - Wikipedia

1
1. TylerRick 16 Apr 2021
  
  in Public
  
  The use of U+212B 'Angstrom sign', which was encoded due to round-trip mapping compatibility with an East-Asian character encoding, is discouraged, and the preferred representation is U+00C5 'capital letter A with ring above', which has the same glyph.
  
  Is there a difference in semantic meaning between the two? And if so, what is it?
  
  recommended option/alternative character encoding Unicode subtle distinction semantic meaning easy to confuse (mix up) glyph preferred variant official preferred convention / way to do something round-tripping
Visit annotations in context

Tags

preferred variant

official preferred convention / way to do something

subtle distinction

Unicode

glyph

character encoding

round-tripping

semantic meaning

easy to confuse (mix up)

recommended option/alternative

Annotators

TylerRick

URL

en.wikipedia.org/wiki/List_of_XML_and_HTML_character_entity_references
en.wikipedia.org en.wikipedia.org

Miscellaneous Symbols and Arrows - Wikipedia

1
1. TylerRick 02 Apr 2021
  
  in Public
  
  symbols Unicode characters
Visit annotations in context

Tags

Unicode characters

symbols

Annotators

TylerRick

URL

en.wikipedia.org/wiki/Miscellaneous_Symbols_and_Arrows
Feb 2021
copyheart.org copyheart.org

Copyheart.org

1
1. TylerRick 08 Feb 2021
  
  in Public
  
  But the circle on its own doesn’t seem to be available as a nonspacing diacritic in Unicode. Bugger.
  
  unfortunate Unicode
Visit annotations in context

Tags

Unicode

unfortunate

Annotators

TylerRick

URL

copyheart.org/
Sep 2020
developer.mozilla.org developer.mozilla.org

RegExp.prototype.dotAll

2
1. TylerRick 23 Sep 2020
  
  in Public
  
  The value of dotAll is a Boolean and true if the "s" flag was used; otherwise, false. The "s" flag indicates that the dot special character (".") should additionally match the following line terminator ("newline") characters in a string, which it would not match otherwise: U+000A LINE FEED (LF) ("\n") U+000D CARRIAGE RETURN (CR) ("\r") U+2028 LINE SEPARATOR U+2029 PARAGRAPH SEPARATOR This effectively means the dot will match any character on the Unicode Basic Multilingual Plane (BMP). To allow it to match astral characters, the "u" (unicode) flag should be used. Using both flags in conjunction allows the dot to match any Unicode character, without exceptions.
  
  Unicode exceptions to the rule
2. TylerRick 23 Sep 2020
  
  in Public
  
  regular expressions javascript: RegExp Unicode
Visit annotations in context

Tags

exceptions to the rule

regular expressions

Unicode

javascript: RegExp

Annotators

TylerRick

URL

developer.mozilla.org/en-US/docs/Web/JavaScript/Reference/Global_Objects/RegExp/dotAll
Jun 2020
codepoints.net codepoints.net

U+FFFC OBJECT REPLACEMENT CHARACTER

1
1. TylerRick 24 Jun 2020
  
  in Public
  
  UTF-8 EF BF BC UTF-16 FF FC
  
  Unicode
Visit annotations in context

Tags

Unicode

Annotators

TylerRick

URL

codepoints.net/U+FFFC
Feb 2020
www.w3.org www.w3.org

Case folding - Internationalization

1
1. TylerRick 04 Feb 2020
  
  in Public
  
  natural languages Unicode letter case
Visit annotations in context

Tags

Unicode

letter case

natural languages

Annotators

TylerRick

URL

w3.org/International/wiki/Case_folding
Oct 2018
ru.wikipedia.org ru.wikipedia.org

Проект:Внесение символов алфавитов народов России в Юникод — Википедия

1
1. ildar 30 Oct 2018
  
  in Public
  
  unicode
Visit annotations in context

Tags

unicode

Annotators

ildar

URL

ru.wikipedia.org/wiki/Проект:Внесение_символов_алфавитов_народов_России_в_Юникод
Sep 2018
www.discoversdk.com www.discoversdk.com

ES2018 - Unicode with Regex - DiscoverSDK Blog

1
1. kael 28 Sep 2018
  
  in Public
  
  tl;rl js regex unicode
Visit annotations in context

Tags

regex

unicode

js

tl;rl

Annotators

kael

URL

discoversdk.com/blog/es2018-unicode-with-regex
Sep 2015
www.unicode.org www.unicode.org

Full Emoji Data

1
1. regis 12 Sep 2015
  
  in Public
  
  GMail
  
  Gmail now uses the same set of emojis as other Google properties (Android, Hangouts) http://gmailblog.blogspot.com/2015/06/express-yourself-in-email-hundreds-more.html
  
  emoji gmail unicode
Visit annotations in context

Tags

emoji

unicode

gmail

Annotators

regis

URL

unicode.org/emoji/charts/full-emoji-list.html
Apr 2015
www.w3.org www.w3.org

Character Model for the World Wide Web: String Matching and Searching

1
1. judell 23 Apr 2015
  
  in Public
  
  This part of the Character Model for the World Wide Web covers string matching—the process by which a specification or implementation defines whether two string values are the same or different from one another.
  
  w3c unicode
Visit annotations in context

Tags

unicode

w3c

Annotators

judell

URL

w3.org/International/docs/charmod-norm/

Tags

Annotators

URL

Tags

Annotators

URL

Tags

Annotators

URL

Tags

Annotators

URL

Tags

Annotators

URL

Tags

Annotators

URL

Tags

Annotators

URL

Tags

Annotators

URL

Tags

Annotators

URL

Tags

Annotators

URL

Tags

Annotators

URL

Tags

Annotators

URL

Tags

Annotators

URL

Tags

Annotators

URL

Tags

Annotators

URL

Tags

Annotators

URL

Tags

Annotators

URL

Tags

Annotators

URL

Tags

Annotators

URL

Tags

Annotators

URL

Tags

Annotators

URL

Tags

Annotators

URL

Tags

Annotators

URL

Tags

Annotators

URL

Tags

Annotators

URL

Tags

Annotators

URL

Tags

Annotators