regex, multibyte locales, and word boundaries

Yuri Pankov yuripv at yuripv.net
Fri Nov 23 16:03:10 UTC 2018


Hi,

We have the following note in the BUGS section of regcomp(3):

----------------------------------------------------------------------
Word-boundary matching does not work properly in multibyte locales.
----------------------------------------------------------------------

It was added ages ago along with multibyte support in our regex
implementation, though I can't think of any positive test case to see
that the problem is real, and eventually fix it.

I'm wondering if anyone has real life examples showing the bug?

-------------- next part --------------
A non-text attachment was scrubbed...
Name: signature.asc
Type: application/pgp-signature
Size: 488 bytes
Desc: OpenPGP digital signature
URL: <http://lists.freebsd.org/pipermail/freebsd-hackers/attachments/20181123/710e6b8c/attachment.sig>


More information about the freebsd-hackers mailing list