39 Matching Annotations
  1. Mar 2024
    1. $@ charCode

      El carácter ($) se utiliza para denotar un carácter literal en Pharo, y (@) es el carácter específico que sigue al signo ($). El carácter charCode es un mensaje que se puede enviar a un carácter para obtener su valor numérico de código de carácter Unicode.

  2. Jan 2024
  3. Dec 2023
  4. Aug 2023
  5. Apr 2023
  6. Mar 2023
  7. Dec 2022
  8. Nov 2022
    1. The btoa() function takes a JavaScript string as a parameter. In JavaScript strings are represented using the UTF-16 character encoding: in this encoding, strings are represented as a sequence of 16-bit (2 byte) units. Every ASCII character fits into the first byte of one of these units, but many other characters don't. Base64, by design, expects binary data as its input. In terms of JavaScript strings, this means strings in which each character occupies only one byte. So if you pass a string into btoa() containing characters that occupy more than one byte, you will get an error, because this is not considered binary data:
    1. The replacement character � (often displayed as a black rhombus with a white question mark) is a symbol found in the Unicode standard at code point U+FFFD in the Specials table. It is used to indicate problems when a system is unable to render a stream of data to a correct symbol.[4] It is usually seen when the data is invalid and does not match any character:
    1. The Unicode replacement character sounds promising when reading about it on Wikipedia: It is used to indicate problems when a system is not able to render a stream of data to a correct symbol. It is most commonly seen when a font does not contain a character, but is also seen when the data is invalid and does not match any character
  9. Oct 2022
  10. Aug 2022
  11. Apr 2022
  12. Mar 2022
  13. Dec 2021
    1. Here are the single characters which can be normalised down to a valid TLD. They're mostly country codes, but there are a few interesting exceptions:

      ㏕ - US Military
      ℡ - .tel registry
      № - Norway
      ㍳ - Australia
      ㍷ - Dominica
      ㎀ - Panama
      ㎁ - Namibia
      ㎃ - Morocco
      ㎊ - French Polynesia
      ㎋ - Norfolk Island
      ㎏ - Kyrgyzstan
      ㎖ - Mali
      ㎙ - Federated States of Micronesia
      fi - Finland
      ㎜ - Myanmar
      ㎝ - Cameroon
      ㎞ & ㏎ - Comoros
      ㎰ - Palestine
      ㎳ - Montserrat
      ㎷ & ㎹ - Republic of Maldives.
      ㎺ - Palau
      ㎽ & ㎿ - Malawi
      ㏄ - Cocos (Keeling) Islands
      ㏅ - Democratic Republic of Congo
      ㏉ - Guyana
      ㏗ - Philippines
      ㏘ - Saint Pierre and Miquelon
      ㏚ - Puerto Rico
      ㏛ - Suriname
      ㏜ - El Salvador
      ℠ - San Marino
      ™ - Turkmenistan
      st & ſt - São Tomé and Príncipe
      ㎇ - Great Britain (Obsolete)
      ß - South Sudan (Not available)
      ㏌ - India and Indiana (subdomain of .us)
      Ⅵ & ⅵ - Virgin Islands and Virginia (subdomain of .us)
      fl - Florida (subdomain of .us)
      ㎚ - New Mexico (subdomain of .us)
      ㎵ - Nevada (subdomain of .us)
      ㍵ - As part of .ovh
      
    2. Nestling among the "Letterlike Symbols" are two curious entries. Both of these are single characters:

      • Telephone symbol - ℡
      • Numero Sign - №

      What's interesting is both .tel and .no are Top-Level-Domains (TLD) on the Domain Name System (DNS).

      So my contact site - https://edent.tel/ - can be written as - https://edent.℡/

      And the Norwegian domain name registry NORID can be accessed at https://www.norid.№/

      Copy and paste those links - they work in any browser!

  14. Jun 2021
  15. Apr 2021
  16. Feb 2021
  17. Sep 2020
    1. The value of dotAll is a Boolean and true if the "s" flag was used; otherwise, false. The "s" flag indicates that the dot special character (".") should additionally match the following line terminator ("newline") characters in a string, which it would not match otherwise: U+000A LINE FEED (LF) ("\n") U+000D CARRIAGE RETURN (CR) ("\r") U+2028 LINE SEPARATOR U+2029 PARAGRAPH SEPARATOR This effectively means the dot will match any character on the Unicode Basic Multilingual Plane (BMP). To allow it to match astral characters, the "u" (unicode) flag should be used. Using both flags in conjunction allows the dot to match any Unicode character, without exceptions.
  18. Jun 2020
  19. Feb 2020
  20. Oct 2018
  21. Sep 2018
  22. Sep 2015
  23. Apr 2015
    1. This part of the Character Model for the World Wide Web covers string matching—the process by which a specification or implementation defines whether two string values are the same or different from one another.