[Bug localedata/3140] New: SHIFT_JISX0213 incorrectly marked as __stateful

classic Classic list List threaded Threaded
6 messages Options
Reply | Threaded
Open this post in threaded view
|

[Bug localedata/3140] New: SHIFT_JISX0213 incorrectly marked as __stateful

glaubitz at physik dot fu-berlin.de
SHIFT_JISX0213 is incorrectly marked as __stateful while the whole point of it
is that it involves no shift states. This causes mbtowc() to incorrectly report
that it in fact does involve shift states.

--
           Summary: SHIFT_JISX0213 incorrectly marked as __stateful
           Product: glibc
           Version: 2.4
            Status: NEW
          Severity: normal
          Priority: P2
         Component: localedata
        AssignedTo: libc-locales at sources dot redhat dot com
        ReportedBy: pasky at suse dot cz
                CC: glibc-bugs at sources dot redhat dot com


http://sourceware.org/bugzilla/show_bug.cgi?id=3140

------- You are receiving this mail because: -------
You are on the CC list for the bug, or are watching someone who is.
Reply | Threaded
Open this post in threaded view
|

[Bug localedata/3140] SHIFT_JISX0213 incorrectly marked as __stateful

glaubitz at physik dot fu-berlin.de

------- Additional Comments From pasky at suse dot cz  2006-08-27 23:40 -------
Created an attachment (id=1256)
 --> (http://sourceware.org/bugzilla/attachment.cgi?id=1256&action=view)
Proposed patch to fix the issue


--


http://sourceware.org/bugzilla/show_bug.cgi?id=3140

------- You are receiving this mail because: -------
You are on the CC list for the bug, or are watching someone who is.
Reply | Threaded
Open this post in threaded view
|

[Bug localedata/3140] SHIFT_JISX0213 incorrectly marked as __stateful

glaubitz at physik dot fu-berlin.de
In reply to this post by glaubitz at physik dot fu-berlin.de

------- Additional Comments From bruno at clisp dot org  2006-09-02 14:09 -------
The SHIFT_JISX0213 converter is stateful because the characters listed
in __jisx0213_to_ucs_combining are single characters in SHIFT_JISX0213
but pairs of Unicode characters in Unicode space. Since glibc has chosen
that its wide character representation is identical to Unicode, and since
mbrtowc() produces only 1 wide character at a time, we cannot accomodate
a stateless converter for SHIFT_JISX0213 in glibc.

A workaround would be to define private area codepoints for each of these
25 characters, but this would lead to hassles elsewhere in the applications
that use glibc.

By the way, the SHIFT_JISX0213 encoding is not US-ASCII compatible, therefore
a locale that would use this encoding would not be ISO C 99 compliant anyway.

For these two reasons, I'd say: forget about locales with SHIFT_JISX0213
encoding.

> the whole point of it is that it involves no shift states

Where do you got this understanding? References?



--


http://sourceware.org/bugzilla/show_bug.cgi?id=3140

------- You are receiving this mail because: -------
You are on the CC list for the bug, or are watching someone who is.
Reply | Threaded
Open this post in threaded view
|

[Bug localedata/3140] SHIFT_JISX0213 incorrectly marked as __stateful

glaubitz at physik dot fu-berlin.de
In reply to this post by glaubitz at physik dot fu-berlin.de


--
           What    |Removed                     |Added
----------------------------------------------------------------------------
                 CC|                            |bruno at clisp dot org


http://sourceware.org/bugzilla/show_bug.cgi?id=3140

------- You are receiving this mail because: -------
You are on the CC list for the bug, or are watching someone who is.
Reply | Threaded
Open this post in threaded view
|

[Bug localedata/3140] SHIFT_JISX0213 incorrectly marked as __stateful

glaubitz at physik dot fu-berlin.de
In reply to this post by glaubitz at physik dot fu-berlin.de

------- Additional Comments From drepper at redhat dot com  2006-09-09 16:29 -------
Wrong claim.  Bruno explained it correctly.  This encoding is just another
terrible development and should be ignored completely.

--
           What    |Removed                     |Added
----------------------------------------------------------------------------
             Status|NEW                         |RESOLVED
         Resolution|                            |INVALID


http://sourceware.org/bugzilla/show_bug.cgi?id=3140

------- You are receiving this mail because: -------
You are on the CC list for the bug, or are watching someone who is.
Reply | Threaded
Open this post in threaded view
|

[Bug localedata/3140] SHIFT_JISX0213 incorrectly marked as __stateful

glaubitz at physik dot fu-berlin.de
In reply to this post by glaubitz at physik dot fu-berlin.de

------- Additional Comments From pasky at suse dot cz  2006-09-11 03:38 -------
Sorry, you are right and my patch made no sense in this light. I was thinking of
shift states only in terms of the original encoding, not in terms of the
transformation function.

--


http://sourceware.org/bugzilla/show_bug.cgi?id=3140

------- You are receiving this mail because: -------
You are on the CC list for the bug, or are watching someone who is.