0001022: error indicator for encoding errors in fgetwc(3)

Notes
(0003027) geoffclare (manager) 2016-01-21 12:04	C99 says in 7.19.1 that the error indicator for a stream "records whether a read/write error has occurred". To me this implies that if only an encoding error (EILSEQ) has occurred, the error indicator must not be set otherwise it would wrongly be indicating that a read/write error has occurred. I believe this is a genuine conflict between C99 and POSIX and needs to be raised with the C committee.

(0003028) shware_systems (reporter) 2016-01-21 13:44	It appears some implementations did set ferror() if errno changed for any reason, so this was made part of POSIX at some point, even if technically "wrong" for the above reason. As the C standard doesn't preclude ferror() also being set, since it leaves implementation-defined what constitutes a "read/write error", it's more a conformance distinction can't be made than conflict, but imo should be marked as CX since the text differs.

(0003029) shware_systems (reporter) 2016-01-22 01:06	I think there is another missing CX item that should be discussed. I feel it should be explicit as a portability matter: If the error indicator for the stream is already set when a call to the interface is made it shall set errno to EINVAL and exit before attempting any internal operation that might set the error indicator if it was clear as a side effect. While the C standard leaving it to applications to call clearerr() themselves may be robust enough for a single threaded and single process environment, in a multi-thread environment one thread (or process) may get focus and attempt to call a wide I/O interface before the code that has caused the indicator to be set in another thread has a chance to call clearerr(), if us4s the call at all. As a potential race condition I think this applies to many other interfaces too, not just the wide char ones.

(0003030) schwarze (reporter) 2016-01-22 02:02	Re: 0003027 Thanks for your feedback, geoffclare. To make really sure that i understand correctly what you are saying: Your intention is that POSIX should not be changed, that all the operating systems i listed should continue what they are already doing, and that the C standard should be amended to also require setting the error flag in case of an encoding error. Right? I would welcome that solution. Re: 0003029 I don't know how reasonable it is to use non-atomic I/O operations on the same FILE object in two threads if you want to recover from I/O errors. If you are forced to do that, i think you have to write your own locking code anyway: Acquire the dedicated lock you define, call the I/O function, check ferror() and/or feof(), call clearerr() if needed, release the lock. Otherwise, you get a race in the other direction, too: I/O call succeeds in thread A, I/O call fails in thread B, A calls ferror(), boom. I don't see how any change to the definition of the interfaces might cure those races in either direction. Anyway, that's not related to the original issue, i don't think it's a wise move to conflate separate topics in the same bugtracking ticket.

(0004383) nick (manager) 2019-05-02 13:03 edited on: 2019-09-05 15:42	WG14 have added normative text to C17: Change 7.29.3.1p3 to require the error indicator to be set in this case: If a read error occurs, the error indicator for the stream is set and the fgetwc function returns WEOF. If an encoding error occurs (including too few bytes), the error indicator for the stream is set and the value of the macro EILSEQ is stored in errno and the fgetwc function returns WEOF.

(0004554) geoffclare (manager) 2019-09-05 15:58	In view of the addition in C17, we believe no change is needed in POSIX and this bug can be closed.

(0006236) eblake (manager) 2023-03-27 15:24	On the musl list, https://www.openwall.com/lists/musl/2023/03/20/8 [^] points out that C17 addressed fgetwc(), but not fputwc(). As a followup, https://www.openwall.com/lists/musl/2023/03/20/10 [^] requests that the Austin Group submit a ballot comment against C23 to ensure that C23 matches POSIX for both fgetwc() and fputwc().

Issue History
Date Modified	Username	Field	Change
2016-01-10 20:47	schwarze	New Issue
2016-01-10 20:47	schwarze	Name	=> Ingo Schwarze
2016-01-10 20:47	schwarze	Organization	=> OpenBSD
2016-01-10 20:47	schwarze	Section	=> fgetwc(3)
2016-01-10 20:47	schwarze	Page Number	=> 0
2016-01-10 20:47	schwarze	Line Number	=> 0
2016-01-21 12:00	geoffclare	Project	2008-TC2 => 1003.1(2013)/Issue7+TC1
2016-01-21 12:04	geoffclare	Note Added: 0003027
2016-01-21 13:44	shware_systems	Note Added: 0003028
2016-01-22 01:06	shware_systems	Note Added: 0003029
2016-01-22 02:02	schwarze	Note Added: 0003030
2017-11-21 16:23	geoffclare	Relationship added	has duplicate 0001170
2019-02-04 16:31	nick	Tag Attached: C11
2019-02-04 16:31	nick	Tag Attached: c99
2019-05-02 13:03	nick	Note Added: 0004383
2019-09-05 15:42	nick	Note Edited: 0004383
2019-09-05 15:58	geoffclare	Interp Status	=> ---
2019-09-05 15:58	geoffclare	Note Added: 0004554
2019-09-05 15:58	geoffclare	Status	New => Closed
2019-09-05 15:58	geoffclare	Resolution	Open => Rejected
2023-03-27 15:24	eblake	Note Added: 0006236
2023-07-24 09:25	geoffclare	Relationship added	related to 0001769

Aardvark Mark IV