Embed Notice

HTML Code

<blockquote style="position: relative; padding-left: 55px;"><section><a href="https://mastodon.zunda.ninja/users/zundan/statuses/111098751591856826">zunda (zundan@mastodon.zunda.ninja)'s status on Thursday, 21-Sep-2023 02:53:42 JST</a><a href="https://mastodon.zunda.ninja/@zundan" title="zundan@mastodon.zunda.ninja"><img src="https://gnusocial.jp/avatar/285-48-20230130022456.gif" width="48" height="48" alt="zunda" style="position: absolute; left: 0; top: 0;">zunda</a><div><a href="https://best-friends.chat/@tadd/111098624854728379" rel="in-reply-to">in reply to</a><ul><li><li><a href="https://gnusocial.jp/user/4845" title="mstdnoyster@gnusocial.jp">かき@GNUsocialJP</a></li></ul></div></section><article><p><a href="https://best-friends.chat/@tadd">@tadd</a> たぶん横から失礼なんですけど、UTF-8では１バイト目にその文字のバイト数もエンコードされてるので文字数を数えるには残りのバイトを読む必要はありません。かっこいい！</p><p>UTF-8 - Wikipedia <a href="https://en.wikipedia.org/wiki/UTF-8#Encoding" rel="nofollow noreferrer">https://en.wikipedia.org/wiki/UTF-8#Encoding</a></p></article><footer><a rel="bookmark" href="https://gnusocial.jp/conversation/2082071#notice-4088301">In conversation</a><time datetime="2023-09-21T02:53:42+09:00" title="Thursday, 21-Sep-2023 02:53:42 JST">about a year ago</time> <span>from <span><a href="https://mastodon.zunda.ninja/@zundan/111098751591856826" rel="external" title="Sent from mastodon.zunda.ninja via ActivityPub">mastodon.zunda.ninja</a></span></span><a href="https://mastodon.zunda.ninja/@zundan/111098751591856826">permalink</a><h4>Attachments</h4><ol><li><article><header><div>Domain not in remote thumbnail source whitelist: upload.wikimedia.org</div><h5><a href="https://en.wikipedia.org/wiki/UTF-8#Encoding">UTF-8</a></h5><div></div></header><div>UTF-8 is a variable-length character encoding standard used for electronic communication. Defined by the Unicode Standard, the name is derived from Unicode (or Universal Coded Character Set) Transformation Format –  8-bit.UTF-8 is capable of encoding all 1,112,064 valid character code points in Unicode using one to four one-byte (8-bit) code units. Code points with lower numerical values, which tend to occur more frequently, are encoded using fewer bytes. It was designed for backward compatibility with ASCII: the first 128 characters of Unicode, which correspond one-to-one with ASCII, are encoded using a single byte with the same binary value as ASCII, so that valid ASCII text is valid UTF-8-encoded Unicode as well.
UTF-8 was designed as a superior alternative to UTF-1, a proposed variable-length encoding with partial ASCII compatibility which lacked some features including self-synchronization and fully ASCII-compatible handling of characters such as slashes. Ken Thompson and Rob Pike produced the first implementation for the Plan 9 operating system in September 1992. This led to its adoption by X/Open as its specification for FSS-UTF...</div><footer></footer></article></li></ol></footer></blockquote>

Corresponding Notice

Embed this notice
zunda (zundan@mastodon.zunda.ninja)'s status on Thursday, 21-Sep-2023 02:53:42 JSTzunda
in reply to
- 斎藤ただし
- かき@GNUsocialJP
@tadd たぶん横から失礼なんですけど、UTF-8では１バイト目にその文字のバイト数もエンコードされてるので文字数を数えるには残りのバイトを読む必要はありません。かっこいい！
UTF-8 - Wikipedia https://en.wikipedia.org/wiki/UTF-8#Encoding
In conversationabout a year ago from mastodon.zunda.ninjapermalink
Attachments
1. Domain not in remote thumbnail source whitelist: upload.wikimedia.org
  UTF-8
  UTF-8 is a variable-length character encoding standard used for electronic communication. Defined by the Unicode Standard, the name is derived from Unicode (or Universal Coded Character Set) Transformation Format – 8-bit.UTF-8 is capable of encoding all 1,112,064 valid character code points in Unicode using one to four one-byte (8-bit) code units. Code points with lower numerical values, which tend to occur more frequently, are encoded using fewer bytes. It was designed for backward compatibility with ASCII: the first 128 characters of Unicode, which correspond one-to-one with ASCII, are encoded using a single byte with the same binary value as ASCII, so that valid ASCII text is valid UTF-8-encoded Unicode as well. UTF-8 was designed as a superior alternative to UTF-1, a proposed variable-length encoding with partial ASCII compatibility which lacked some features including self-synchronization and fully ASCII-compatible handling of characters such as slashes. Ken Thompson and Rob Pike produced the first implementation for the Plan 9 operating system in September 1992. This led to its adoption by X/Open as its specification for FSS-UTF...

Public

Embed Notice

HTML Code

Corresponding Notice