Hypothesis

18 Matching Annotations

Jan 2025
en.wikipedia.org en.wikipedia.org

Internationalized Resource Identifier - Wikipedia

1
1. TylerRick 30 Jan 2025
  
  in Public
  
  All non-ASCII code points in the IRI should next be encoded as UTF-8, and the resulting bytes percent-encoded, to produce a valid URI.
  
  IRI UTF-8
Visit annotations in context

Tags

UTF-8

IRI

Annotators

TylerRick

URL

en.wikipedia.org/wiki/Internationalized_Resource_Identifier
May 2023
datascience.codata.org datascience.codata.org

The Challenge of Ensuring Persistency of Identifier Systems in the World of Ever-Changing Technology

1
1. WHPrivate 08 May 2023
  
  in Public
  
  articulates requirements for readability sating that identifiers must be: Any printable characters from the Universal Character Set of ISO/IEC 10646 (ISO 2012):UTF-8 encoding is required; Case insensitive:Only ASCII case folding is allowed.
  
  {UTF-8} {ASCII Case Folding}
  
  UTF-8 Technical Independence Functionality ASCII Case Folding
Visit annotations in context

Tags

UTF-8

ASCII Case Folding

Technical

Independence

Functionality

Annotators

WHPrivate

URL

datascience.codata.org/articles/10.5334/dsj-2017-013
Dec 2022
www.zhihu.com www.zhihu.com

java中GBK编码格式转成UTF8，用一段方法实现怎么做？ - 知乎

1
1. caocao485 12 Dec 2022
  
  in Public
  
  java中GBK编码格式转成UTF8，用一段方法实现怎么做？
  
  utf-8 编码
Visit annotations in context

Tags

utf-8

编码

Annotators

caocao485

URL

zhihu.com/question/20361462
www.zhihu.com www.zhihu.com

Unicode 和 UTF-8 有什么区别？ - 知乎

1
1. caocao485 12 Dec 2022
  
  in Public
  
  Unicode 和 UTF-8 有什么区别？
  
  utf-8 unicode 字符编码编码
Visit annotations in context

Tags

unicode

utf-8

字符编码

编码

Annotators

caocao485

URL

zhihu.com/question/23374078
May 2022
stackoverflow.com stackoverflow.com

(grep) Regex to match non-ASCII characters?

1
1. radera 09 May 2022
  
  in Public
  
  [^[:print:]] will probably suffice for you.**
  
  FOR ME
  
  because [ascii] doesnt work in cygwin's grep
  
  #utf-8
Visit annotations in context

Tags

#utf-8

Annotators

radera

URL

stackoverflow.com/questions/2124010/grep-regex-to-match-non-ascii-characters
www.justinweiss.com www.justinweiss.com

How Rails sessions work

2
1. radera 09 May 2022
  
  in Public
  
  You can use a heuristic: only change strings that have one of the bad characters in them, like â. This works well if a character like â won’t ever appear in a valid string. The last time I fixed this kind of bug, though, I wanted to play it safe. I used another useful tool to help: my eyes. Whenever I found a badly encoded string, I printed it out, along with its replacement:
  
  no magic solutions!
  
  #utf-8
2. radera 09 May 2022
  
  in Public
  
  It seems like those three bytes should be read as UTF-8, where they’d represent a curly quote. Instead, each byte is showing up as a different character. So, which encoding would represent [226, 128, 153] as â€™? If you look at a few tables of popular encodings, you’ll see it’s Windows-1252.
  
  -In UTF8 are 3 bytes - In W1252 a byte= a char
  
  #utf-8
Visit annotations in context

Tags

#utf-8

Annotators

radera

URL

justinweiss.com/articles/how-to-get-from-theyre-to-theyre/
stackoverflow.com stackoverflow.com

"â€™" showing on page instead of " ' "

2
1. radera 09 May 2022
  
  in Public
  
  This only forces the client which encoding to use to interpret and display the characters. But the actual problem is that you're already sending â€™ (encoded in UTF-8) to the client instead of ’. The client is correctly displaying â€™ using the UTF-8 encoding. If the client was misinstructed to use, for example ISO-8859-1, you would likely have seen Ã¢â¬â¢ instead.
  
  HERE IT IS!
  
  #utf-8
2. radera 09 May 2022
  
  in Public
  
  This answer is not useful Show activity on this post. So what's the problem, It's a ’ (RIGHT SINGLE QUOTATION MARK - U+2019) character which is being decoded as CP-1252 instead of UTF-8. If you check the encodings table, then you see that this character is in UTF-8 composed of bytes 0xE2, 0x80 and 0x99. If you check the CP-1252 code page layout, then you'll see that each of those bytes stand for the individual characters â, € and ™.
  
  HERE IT IS!
  
  #utf-8
Visit annotations in context

Tags

#utf-8

Annotators

radera

URL

stackoverflow.com/questions/2477452/â€-showing-on-page-instead-of
utf8-chartable.de utf8-chartable.de

Unicode/UTF-8-character table

2
1. radera 08 May 2022
  
  in Public
  
  One Latin-1 char per byte
  
  activar para ver secuencia 2 bytes
  
  #utf-8
2. radera 08 May 2022
  
  in Public
  
  sequences 2 bytes for no-ascii
  
  #utf-8
Visit annotations in context

Tags

#utf-8

Annotators

radera

URL

utf8-chartable.de/unicode-utf8-table.pl
stackoverflow.com stackoverflow.com

How to simply list all files of a folder using dir to text with UTF-8 encoding?

2
1. radera 02 May 2022
  
  in Public
  
  This works for me: PowerShell -Command "TREE /F | Out-File output.txt -Encoding utf8"
  
  WITH POWERSHELL
  
  #utf-8
2. radera 02 May 2022
  
  in Public
  
  You should add this command chcp 65001 before dir command to change code page to UTF-8 @echo off CHCP 65001>nul dir>1.txt Further reading about CHCP command
  
  DIR NAMES IN UTF-8
  
  #utf-8
Visit annotations in context

Tags

#utf-8

Annotators

radera

URL

stackoverflow.com/questions/65640943/how-to-simply-list-all-files-of-a-folder-using-dir-to-text-with-utf-8-encoding
docs.actian.com docs.actian.com

DataConnect

1
1. radera 02 May 2022
  
  in Public
  
  hex: 93 y 94
  
  #utf-8
Visit annotations in context

Tags

#utf-8

Annotators

radera

URL

docs.actian.com/dataconnect/11.4/index.html
www.cl.cam.ac.uk www.cl.cam.ac.uk

ASCII and Unicode quotation marks

1
1. radera 02 May 2022
  
  in Public
  
  Most European keyboards have keycap labels for the apostrophe and both accents. These have always looked like in the ISO and Unicode standards. The photo below shows the relevant keys highlighted on a standard German PC keyboard, which has the acute/grave accent key left and the number-sign/apostrophe key below the backspace key:
  
  unicode!
  
  #utf-8
Visit annotations in context

Tags

#utf-8

Annotators

radera

URL

cl.cam.ac.uk/~mgk25/ucs/quotes.html
Sep 2021
s3.us-central-1.wasabisys.com s3.us-central-1.wasabisys.com

Scan Jan 13, 2021.pdf

3
1. ltraxel 13 Sep 2021
  
  in Public
  
  paradigmatic
  
  typical answer to something
  
  https://www.google.com/search?q=paradigmatic+definition&rlz=1C1CHZN_enUS966US966&oq=paradigmatic&aqs=chrome.1.69i59j0i512l9.4463j1j9&sourceid=chrome&ie=UTF-8
2. ltraxel 13 Sep 2021
  
  in Public
  
  elucidatory
  
  give a clarifying expression
  
  https://www.google.com/search?q=elucidative&rlz=1C1CHZN_enUS966US966&oq=elucidative&aqs=chrome..69i57j0i512l5j0i10i512j0i512j0i10i30l2.8526j0j4&sourceid=chrome&ie=UTF-8
3. ltraxel 13 Sep 2021
  
  in Public
  
  elucidation.
  
  another word for clarification
  
  https://www.google.com/search?q=elucidation&rlz=1C1CHZN_enUS966US966&oq=elucidation&aqs=chrome.0.69i59i433i512j0i131i433i512j0i433i512j0i512l5j0i10i512j0i512.2598j0j7&sourceid=chrome&ie=UTF-8
Visit annotations in context

Tags

https://www.google.com/search?q=elucidative&rlz=1C1CHZN_enUS966US966&oq=elucidative&aqs=chrome..69i57j0i512l5j0i10i512j0i512j0i10i30l2.8526j0j4&sourceid=chrome&ie=UTF-8

https://www.google.com/search?q=elucidation&rlz=1C1CHZN_enUS966US966&oq=elucidation&aqs=chrome.0.69i59i433i512j0i131i433i512j0i433i512j0i512l5j0i10i512j0i512.2598j0j7&sourceid=chrome&ie=UTF-8

https://www.google.com/search?q=paradigmatic+definition&rlz=1C1CHZN_enUS966US966&oq=paradigmatic&aqs=chrome.1.69i59j0i512l9.4463j1j9&sourceid=chrome&ie=UTF-8

Annotators

ltraxel

URL

s3.us-central-1.wasabisys.com/docdrop-annotations-prod/Definition-by-Strausberg-and-Gardnier-204eq_ocr-2--w3t5w.pdf

Tags

Annotators

URL

Tags

Annotators

URL

Tags

Annotators

URL

Tags

Annotators

URL

Tags

Annotators

URL

Tags

Annotators

URL

Tags

Annotators

URL

Tags

Annotators

URL

Tags

Annotators

URL

Tags

Annotators

URL

Tags

Annotators

URL

Tags

Annotators

URL