|Anonymous | Login||2020-07-06 08:02 UTC|
|Main | My View | View Issues | Change Log | Docs|
|Viewing Issue Simple Details|
|ID||Category||Severity||Type||Date Submitted||Last Update|
|0001356||[Issue 8 drafts] Base Definitions and Headers||Editorial||Enhancement Request||2020-06-29 12:47||2020-06-29 12:47|
|Status||New||Product Version||Draft 1|
|Final Accepted Text|
|Summary||0001356: Our definition of character disagrees with that of Unicode.|
In Draft 1, a character is being defined as:
> A sequence of one or more bytes representing
> a single graphic symbol or control code.
This definition falls apart when applied in the context of e.g. Arabic text.
The Unicode standard (version 13, page 15) says:
> The Unicode Standard draws a distinction
> between characters and glyphs. Characters are
> the abstract representations of the
> smallest components of written language
> that have semantic value.
Considering Unicode has a radically different goal than POSIX,
I propose the following new definition for consideration
(as the current wording may be intended, even not on purpose)
> A sequence of bytes that is considered an
> individual unit in text processing.
1) A sequence of bytes: remains the same as our original definition.
2) individual unit: graphic symbols in Arabic text are composed of parts that're sometimes stacked on top of each other. Older iOS that didn't take this into account caused the iPhone Arabic Glitch.
3) text processing: this can refer to terminal (and emulators) processing control sequences, and `wc -m` counting characters. Defining characters in terms of text processing lifts the burdon of relying on the concept of "code point" externally defined in the Unicode standard.
|Desired Action||Consider applying the the proposed new definition.|
|Tags||No tags attached.|
|There are no notes attached to this issue.|
|2020-06-29 12:47||dannyniu||New Issue|
|2020-06-29 12:47||dannyniu||Name||=> DannyNiu/NJF|
|2020-06-29 12:47||dannyniu||Section||=> 3.57 Character|
|2020-06-29 12:47||dannyniu||Page Number||=> 37|
|2020-06-29 12:47||dannyniu||Line Number||=> 1241|
|Mantis 1.1.6[^] Copyright © 2000 - 2008 Mantis Group|