[Bug 266001] uniq says it's affected by LC_COLLATE, must not be according to POSIX
- Reply: bugzilla-noreply_a_freebsd.org: "[Bug 266001] uniq says it's affected by LC_COLLATE, must not be according to POSIX"
- Reply: bugzilla-noreply_a_freebsd.org: "[Bug 266001] uniq says it's affected by LC_COLLATE, must not be according to POSIX"
- Reply: bugzilla-noreply_a_freebsd.org: "[Bug 266001] uniq says it's affected by LC_COLLATE, must not be according to POSIX"
- Reply: bugzilla-noreply_a_freebsd.org: "[Bug 266001] uniq says it's affected by LC_COLLATE, must not be according to POSIX"
- Reply: bugzilla-noreply_a_freebsd.org: "[Bug 266001] uniq says it's affected by LC_COLLATE, must not be according to POSIX"
- Go to: [ bottom of page ] [ top of archives ] [ this month ]
Date: Tue, 23 Aug 2022 11:51:46 UTC
https://bugs.freebsd.org/bugzilla/show_bug.cgi?id=266001 Bug ID: 266001 Summary: uniq says it's affected by LC_COLLATE, must not be according to POSIX Product: Base System Version: CURRENT Hardware: Any OS: Any Status: New Severity: Affects Many People Priority: --- Component: bin Assignee: bugs@FreeBSD.org Reporter: nabijaczleweli@nabijaczleweli.xyz The manual says: -- >8 -- ENVIRONMENT The LANG, LC_ALL, LC_COLLATE and LC_CTYPE environment variables affect the execution of uniq as described in environ(7). -- >8 -- This, presumably, means that uniq compares lines with strcoll(3) or equivalent. Compare Issue 7, uniq, DESCRIPTION (https://pubs.opengroup.org/onlinepubs/9699919799/utilities/uniq.html): > The second and succeeding copies of repeated adjacent input lines shall not be written. And APPLICATION USAGE: > To remove duplicate lines based on whether they collate equally instead of whether they are identical, applications should use: > sort -u Indeed, Issue 8 (Draft 2.1), following Bug 1070 (https://www.austingroupbugs.net/view.php?id=1070), explicitly clarifies this: > If the collating sequence of the current locale does not have a total ordering of all characters, the behavior of sort | uniq differs from sort -u, as uniq treats lines as duplicates only if they are identical, whereas sort -u treats lines as duplicates if they collate equally. -- You are receiving this mail because: You are the assignee for the bug.