View Issue Details

IDProjectCategoryView StatusLast Update
00017931003.1(2016/18)/Issue7+TC2Front Matterpublic2025-04-29 15:40
Reportersteffen Assigned Togeoffclare  
PrioritynormalSeverityEditorialTypeEnhancement Request
Status AppliedResolutionAccepted As Marked 
Namesteffen
Organization
User Reference
Sectionmany
Page Numbermany
Line Numbermany
Interp Status---
Final Accepted Text0001793:0006646
Summary0001793: Streamline US-ASCII character set name
DescriptionIt is a bit over the top maybe, but the standard uses the alias "ASCII" to denote ANSI_X3.4-1968.
This is of course ok as "ASCII" is an official alias.
There are occurrences of "US ASCII", however, which is not: the real alias IANA alias name is "US-ASCII".
Desired Action- change occurrences of "US ASCII" with either "ASCII" or "US-ASCII".
- i personally would create a roff string/macro and use that everywhere instead of ASCII or "US ASCII".
- maybe describe somewhere in the preamble where many ANSI things are clarified that "ASCII" really means ANSI_X3.4-1968 of the IANA character set database.
Tagstc1-2024

Activities

shware_systems

2024-01-04 16:05

reporter   bugnote:0006613

The standard references ASCII in its pre-ISO-2021 versions of the 1960s, as overridden by the C standard. Afaik US-ASCII is a synonym for the current version of ISO-646, which defers to ISO-2021, not POSIX or C, for how control codes are expected to function. As such references to US-ASCII should probably be shortened to just ASCII.

steffen

2024-01-04 22:02

reporter   bugnote:0006615

Ooooh, careful with that axe -- greetings to the big apple!!!
I quote art(at)ietf.org and John C. Klensin, he will excuse this.
(And point out that _i_ do not follow him.)

Starting with IANA and copying the charsets list, as Martin
suggests, is almost certainly a good step at this stage (I'm
copying IANS on this note to avoid possible confusion).
However, to fill in the historical blank: the omission was
almost certainly deliberate (I vaguely recall being part of the
discussion). While I found the idea of "US-ASCII" obnoxious
(and still do -- after all, the "A" in ASCII does not stand for
"Antarctic" or a variety of other options), it was very common
practice in the 1990s to use the term "ASCII" to refer to a
variety of character sets that used the high order bit of an
octet (including ISO 8859 -- the omission of "-1" is deliberate)
and even used "ASCII" to refer to assorted national language
variations and national variations on ISO 646. So "ASCII" was
avoided because it was seriously ambiguous. There is evidence
that the confusion continues: the definition of IBM CP 367
("IBM367" and "cp367" in the IANA registry) [1] as a synonym for
"US-ASCII" is incorrect or at least sketchy because it lists
iso-646.irv:1983 as another synonym and ISO 646 IRV did not
match ASCII until 1991 [2] because, prior to that, the
"international currency symbol" was used instead of ASCII's
dollar sign. That isn't proof but does suggest that the
confusion we say in the 1990s has not completely disappeared.

So, coming back to Steffen's suggestion, I strongly recommend
against adding "ASCII" to the alias list without an explanation
and warning about the ambiguity. And the current registry
arrangement does not allow for that sort of note although maybe
it should.

shware_systems

2024-01-22 17:44

reporter   bugnote:0006636

Last edited: 2024-01-22 17:45

Re: 6615

Agreed, there still might be confusion. An alternative is changing all those instances to "PSX-ASCII", or "POSIX-ASCII", and add to XBD3 a definition that specifies how C overloaded <NUL> and <LF> as string terminator and end-of-line, along with making <SI>, <SO>, and <ESC> as having no defined function. This could even be a new IANA registration, I suspect, outside the range reserved to ISO-IR registrations.

steffen

2024-01-23 00:12

reporter   bugnote:0006637

(I have no opinion on that. I am in strong favour of readding the ASCII alias that got lost to the IANA database. This issue is because later ones may stumble over the ASCII as IANA is the "official thing", and it has its opinion that i do not understand.)

geoffclare

2024-01-29 16:21

manager   bugnote:0006646

Change the three occurrences of "US ASCII" to "ASCII".

Add a definition to XBD chapter 3:
3.xxx ASCII
The character encoding specified by the International Reference Version (IRV) of the ISO/IEC 646: 1991 standard.

Issue History

Date Modified Username Field Change
2023-12-19 02:02 steffen New Issue
2023-12-19 02:02 steffen Name => steffen
2023-12-19 02:02 steffen Section => many
2023-12-19 02:02 steffen Page Number => many
2023-12-19 02:02 steffen Line Number => many
2024-01-04 16:05 shware_systems Note Added: 0006613
2024-01-04 22:02 steffen Note Added: 0006615
2024-01-22 17:44 shware_systems Note Added: 0006636
2024-01-22 17:45 shware_systems Note Edited: 0006636
2024-01-23 00:12 steffen Note Added: 0006637
2024-01-29 16:21 geoffclare Note Added: 0006646
2024-01-29 16:23 geoffclare Interp Status => ---
2024-01-29 16:23 geoffclare Final Accepted Text => 0001793:0006646
2024-01-29 16:23 geoffclare Status New => Resolved
2024-01-29 16:23 geoffclare Resolution Open => Accepted As Marked
2024-01-29 16:23 geoffclare Tag Attached: tc1-2024
2025-04-29 15:40 geoffclare Assigned To => geoffclare
2025-04-29 15:40 geoffclare Status Resolved => Applied