[Bug regex/19376] New: regcomp.c needs to be upgraded to GNU Grep's one

classic Classic list List threaded Threaded
1 message Options
Reply | Threaded
Open this post in threaded view
|

[Bug regex/19376] New: regcomp.c needs to be upgraded to GNU Grep's one

macro@linux-mips.org
https://sourceware.org/bugzilla/show_bug.cgi?id=19376

            Bug ID: 19376
           Summary: regcomp.c needs to be upgraded to GNU Grep's one
           Product: glibc
           Version: 2.22
            Status: NEW
          Severity: normal
          Priority: P2
         Component: regex
          Assignee: unassigned at sourceware dot org
          Reporter: t.rus76 at ya dot ru
                CC: drepper.fsp at gmail dot com
  Target Milestone: ---

Symptom: GNU Grep does not handle Syriac characters (U+0700 – U+074F) correctly

$ echo 'ܫܠܡܐ' > peace
$ egrep '\<[ܐ-ܬ]' peace
grep: Invalid collation character
$ awk /'\<[ܐ-ܬ]'/ peace
ܫܠܡܐ

However when grep is build with ./configure --with-included-regex
it works just fine and there is no REG_ECOLLATE error

$ echo ܫܠܡܐ | src/egrep [ܫ-ܬ]
ܫܠܡܐ
$ echo ܫܠܡܐ | src/egrep [ܒ-ܓ]
$

This is because GNU Grep contains improved version of regcomp.

The bus was found here:
http://forum.rosalab.ru/viewtopic.php?f=53&t=6219&p=54747 (in Russian)

It is tested and confirmed also on Gentoo (both glibc and grep are 2.22).


I expect there are other bugs that could be fixed with this upgrade.

--
You are receiving this mail because:
You are on the CC list for the bug.