From nobody Mon Apr 17 21:24:59 2023 X-Original-To: current@mlmmj.nyi.freebsd.org Received: from mx1.freebsd.org (mx1.freebsd.org [IPv6:2610:1c1:1:606c::19:1]) by mlmmj.nyi.freebsd.org (Postfix) with ESMTP id 4Q0g7Y30Pcz45DQ7 for ; Mon, 17 Apr 2023 21:25:13 +0000 (UTC) (envelope-from delphij@gmail.com) Received: from mail-io1-xd29.google.com (mail-io1-xd29.google.com [IPv6:2607:f8b0:4864:20::d29]) (using TLSv1.3 with cipher TLS_AES_128_GCM_SHA256 (128/128 bits) key-exchange X25519 server-signature RSA-PSS (4096 bits) server-digest SHA256 client-signature RSA-PSS (2048 bits) client-digest SHA256) (Client CN "smtp.gmail.com", Issuer "GTS CA 1D4" (verified OK)) by mx1.freebsd.org (Postfix) with ESMTPS id 4Q0g7X4bXlz43SL for ; Mon, 17 Apr 2023 21:25:12 +0000 (UTC) (envelope-from delphij@gmail.com) Authentication-Results: mx1.freebsd.org; none Received: by mail-io1-xd29.google.com with SMTP id s6so2038849iow.11 for ; Mon, 17 Apr 2023 14:25:12 -0700 (PDT) DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=gmail.com; s=20221208; t=1681766711; x=1684358711; h=cc:to:subject:message-id:date:from:in-reply-to:references :mime-version:from:to:cc:subject:date:message-id:reply-to; bh=Z2hVXUd9jTQCUq7oB7gNxYi6a7Eio5tQcu8W27K+sdk=; b=AJQRsNG7z6KHqF0C+AKciWU1PqhPLLKA5Ifu5y2HyRJTxeNrLsIF2PYMDr/MOhakCB 00HaAYxqZ6uKLBUpdGBRI/Vpw1sleUj7l2o6rkBduoHyrJaN9pzMsauWOlcua0nAzwUl hyI7oRto5FnHelcf0Y+bX8y0ZMb9GUE98K791CEuZCylCdjkDnfyM+Xj4SrNXRNbbHm0 JCJefW/Y5q5/NzWyrWyoNL0jCml03OLWZqzRuQvUWjE/LBTveiP+hJhq2N5Td5bl8zsz 5Q0h/Tbk9X3+lGDSJVgh8RAMcHYYTIyVfBOiFVzAG271cOxNHIL5oW3E3YsUOxFakVi9 4xLA== X-Google-DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=1e100.net; s=20221208; t=1681766711; x=1684358711; h=cc:to:subject:message-id:date:from:in-reply-to:references :mime-version:x-gm-message-state:from:to:cc:subject:date:message-id :reply-to; bh=Z2hVXUd9jTQCUq7oB7gNxYi6a7Eio5tQcu8W27K+sdk=; b=FSnHQoPtZWquN7ZLX6NCK+zNaSyKPC1ovrS8K0EcV137gZPlF53Z3Qb8EMipDm4D6m 9OBN194UiPl7rT2TgxGxtEkwAqxYwaHwqifj19tgFmInjBeUug35lzsuYLEbi4PUwEQ8 foy2L+WGIbIT/Bs80kdYCjt7sBx42RhSQt0AnIV5pMW8r09E5pn//Pm1BC2V5v9IeWhj IN1xwOhoexNducD/NKZZdNB4ukxbu+sRRtNXmD5DizrShgkFv9r2FYwac8JEhmtB6/ms k+8IbPu1pH570Hu42Ge/R9Gz9eKQIOWwnX4gKZDDPBeIs+q2fuvs1thcJcaS9bKEwzRb IQmg== X-Gm-Message-State: AAQBX9eiHS3lhq34bl+7kT2yVuNs0CPMEdFFi1xhna2oZBQsLK7i3jR3 Q2TjHT8M2hJalZRVhm0GYCo/hMRrKaKUyvUL/nLrv8d9UmU= X-Google-Smtp-Source: AKy350aRZ8iQvkByTYRsxBIjESNuOPtbxpuTzFLrUv33XvoeJcTNG7WSQQDNHZc549NnnF52HZtI7xk+XsCkWm/7Yco= X-Received: by 2002:a5e:c008:0:b0:753:568:358e with SMTP id u8-20020a5ec008000000b007530568358emr11242819iol.20.1681766710771; Mon, 17 Apr 2023 14:25:10 -0700 (PDT) List-Id: Discussions about the use of FreeBSD-current List-Archive: https://lists.freebsd.org/archives/freebsd-current List-Help: List-Post: List-Subscribe: List-Unsubscribe: Sender: owner-freebsd-current@freebsd.org MIME-Version: 1.0 References: <202304172106.33HL6RUX051407@critter.freebsd.dk> In-Reply-To: <202304172106.33HL6RUX051407@critter.freebsd.dk> From: Xin LI Date: Mon, 17 Apr 2023 14:24:59 -0700 Message-ID: Subject: Re: find(1): I18N gone wild ? To: Poul-Henning Kamp Cc: current@freebsd.org Content-Type: multipart/alternative; boundary="000000000000640e9605f98ed3f4" X-Rspamd-Queue-Id: 4Q0g7X4bXlz43SL X-Spamd-Bar: ---- X-Spamd-Result: default: False [-4.00 / 15.00]; REPLY(-4.00)[]; ASN(0.00)[asn:15169, ipnet:2607:f8b0::/32, country:US] X-Rspamd-Pre-Result: action=no action; module=replies; Message is reply to one we originated X-ThisMailContainsUnwantedMimeParts: N --000000000000640e9605f98ed3f4 Content-Type: text/plain; charset="UTF-8" Content-Transfer-Encoding: quoted-printable This is expected behavior (in en_US.UTF-8 the ordering is AaBb, not ABab). You might want to set LC_COLLATE to C if C behavior is desirable. On Mon, Apr 17, 2023 at 2:06=E2=80=AFPM Poul-Henning Kamp wrote: > This surprised me: > > # mkdir /tmp/P > # cd /tmp/P > # touch FOO > # touch bar > # env LANG=3DC.UTF-8 find . -name '[A-Z]*' -print > ./FOO > # env LANG=3Den_US.UTF-8 find . -name '[A-Z]*' -print > ./FOO > ./bar > > Really ?! > > -- > Poul-Henning Kamp | UNIX since Zilog Zeus 3.20 > phk@FreeBSD.ORG | TCP/IP since RFC 956 > FreeBSD committer | BSD since 4.3-tahoe > Never attribute to malice what can adequately be explained by incompetenc= e. > > --000000000000640e9605f98ed3f4 Content-Type: text/html; charset="UTF-8" Content-Transfer-Encoding: quoted-printable
This is expected behavior (in en_US.UTF-8 the ordering is AaBb= , not ABab).=C2=A0 You might want to set=C2=A0LC_COLLATE to C if C behavior= is desirable.

On Mon, Apr 17, 2023 at 2:06=E2=80=AFPM Poul-Henning Ka= mp <phk@phk.freebsd.dk> wro= te:
This surpris= ed me:

=C2=A0 =C2=A0 =C2=A0 =C2=A0 # mkdir /tmp/P
=C2=A0 =C2=A0 =C2=A0 =C2=A0 # cd /tmp/P
=C2=A0 =C2=A0 =C2=A0 =C2=A0 # touch FOO
=C2=A0 =C2=A0 =C2=A0 =C2=A0 # touch bar
=C2=A0 =C2=A0 =C2=A0 =C2=A0 # env LANG=3DC.UTF-8 find . -name '[A-Z]*&#= 39; -print
=C2=A0 =C2=A0 =C2=A0 =C2=A0 ./FOO
=C2=A0 =C2=A0 =C2=A0 =C2=A0 # env LANG=3Den_US.UTF-8 find . -name '[A-Z= ]*' -print
=C2=A0 =C2=A0 =C2=A0 =C2=A0 ./FOO
=C2=A0 =C2=A0 =C2=A0 =C2=A0 ./bar

Really ?!

--
Poul-Henning Kamp=C2=A0 =C2=A0 =C2=A0 =C2=A0| UNIX since Zilog Zeus 3.20 phk@FreeBSD.ORG=C2=A0 =C2=A0 =C2=A0 =C2=A0 =C2=A0| TCP/IP since RFC 956
FreeBSD committer=C2=A0 =C2=A0 =C2=A0 =C2=A0| BSD since 4.3-tahoe
Never attribute to malice what can adequately be explained by incompetence.=

--000000000000640e9605f98ed3f4--