From nobody Mon Nov 13 02:00:46 2023 X-Original-To: freebsd-ports@mlmmj.nyi.freebsd.org Received: from mx1.freebsd.org (mx1.freebsd.org [IPv6:2610:1c1:1:606c::19:1]) by mlmmj.nyi.freebsd.org (Postfix) with ESMTP id 4STCMR4SjTz51JDJ for ; Mon, 13 Nov 2023 02:01:07 +0000 (UTC) (envelope-from marklmi@yahoo.com) Received: from sonic311-25.consmr.mail.gq1.yahoo.com (sonic311-25.consmr.mail.gq1.yahoo.com [98.137.65.206]) (using TLSv1.3 with cipher TLS_AES_128_GCM_SHA256 (128/128 bits) key-exchange X25519 server-signature RSA-PSS (4096 bits) server-digest SHA256) (Client did not present a certificate) by mx1.freebsd.org (Postfix) with ESMTPS id 4STCMQ43RCz3TYr for ; Mon, 13 Nov 2023 02:01:06 +0000 (UTC) (envelope-from marklmi@yahoo.com) Authentication-Results: mx1.freebsd.org; dkim=pass header.d=yahoo.com header.s=s2048 header.b=IaA5HJiD; spf=pass (mx1.freebsd.org: domain of marklmi@yahoo.com designates 98.137.65.206 as permitted sender) smtp.mailfrom=marklmi@yahoo.com; dmarc=pass (policy=reject) header.from=yahoo.com DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=yahoo.com; s=s2048; t=1699840864; bh=J9f8H2f6/L+rCrs/SksCnPHW3495mTTDXkMTH0qikZA=; h=From:Subject:Date:References:To:In-Reply-To:From:Subject:Reply-To; b=IaA5HJiDGzmDwbHjl0GKbrwUEkfFrz2exNTxf3JbtJfTbZsn0mdRU8/gRTFkjMzCuCC8UBBbJHnWVSyQuH0sCmH3wu2x2yKHjGL35fKEbQX4ASIpFpw8BXy9BUYgzXI8/sIQ284pw8Yy8xsd8nFLB4lYnlk7W9cr4pFy9cux7pDyDcxXedkIqBlhiV5uE/3NCSP2N1R0rmAIx+aX8/P7eQTnr4+HBGIe/yKS5ukBsIns0uqHpV5FXnmnFyRFcp+AQ/sM1i1IP8W3mfMLlZ1H56EaK4UXXPVuEo+TWksyqKEbJBNKpN8VtiqXsKsZWqV4rNh/EsnPXmVkpZU9gSVciA== X-SONIC-DKIM-SIGN: v=1; a=rsa-sha256; c=relaxed/relaxed; d=yahoo.com; s=s2048; t=1699840864; bh=aP/VHBXlxO0McxIpRZW5uGbhwAIyJ8zkycksbjxKeL5=; h=X-Sonic-MF:From:Subject:Date:To:From:Subject; b=mHYLujATMzsKGs6tJwBKzdaCXnZ/YMOZpu9WVNjllLWnaaD8mDmmCZW93rjQlkd0Iz7DSz/rCfzkYhTFuHRH7J4bxjznCPb/GrEsWihA1SjTqPcjM+Vjd87GiJVGAchZP4g5tBvZU0ZZw5KcJSYV6JzXS3uchGy7FlqRVaFz4papUe5E70KZwt0fmdsjX3u+ij4kPVo5MiSVTcM9wqlQ4j362C51T0HWnu1f1s/MR9H8Y4kryBu1W5CeK8oYCaOzeODpnn48waF00dK8CeA9Vjbt878trfoCJM9fzjUoSnZC1DZnlBjXxjzPnyukFryZ/GAY9A2v1Aj3yl7oVzFbdg== X-YMail-OSG: th2iUAEVM1muQaoA62qWnMcHvcKlKs7jdAzF3xZQO7LCzs8DypZYO.w6x0L.mBh Gi9uigqGQMmV7J2j5Wxh7sxer9g0jSIuAHLPlNUo19Gbj3qx.j9zLzxf7ckcb7xlRnby3i7PK0nd mjIF15kFpfiWeuQGZiG4SFSJhzEOXpGSNbpEry8Eym_I3eyAFC9ExUnj0zkohXuBk8sU82dC2dJt aHhrPJqwB1a.fzPDCmo6RSdfvfWHgKxwBsvbHdtyTvOTRiddIBBQdUsOYPWlaxydTh_FkNB0OVGy DHZ4JWd09hklW1BAZcuxPDJeSuyxuLYHFAGzB_wZSnDX0OnTWoHpEsPAOao8ESMz.85wXLLKB4nT h2vEGLw0IaIO4ebEEacGMqYWND5omDiwexrgHjSc8DK2IYhJR5X3y4XgW_M9deEpuBUOPg_bnbLy xHPm_sT2_cykyws4UxfleNcKenJR19rUQ4VH55OEVPkiOnoJVtzyt6JW4bQOBqmWhBtCvpXiB6Cx 3zRYa3SqThsSdg2G7sNLvrepttluB44CM6g2SzsKY3c2RtiEr7VVtcWm4ZQJA5tin74gIgG.3AET WwBB6sQ8mJ4K04nYMfFGFXFwzCaQuuJnXiVBQeXkzBsupskoNMLs3vb_iU3XYkgUYVfkYJepzMrE HFPE5wZt0kxy9n_PRicWbIXNtrQ8sZ.GCEdzM15bOc0EzLqzg.9pGoU9vYtuB7Tq0tKbgMZDAQFd tZv0nBBFXdZYXzaAmLUEUcw9WOOhz9uX3fDeVgbvVxWfWVgXm0ebBOWw1t9qHzhKaFKnBbszRo3E WaYg_29wacoL.lyoNVfn70ZetTKGsb62SpTwvr60MaEGgPSx8cA.71eyKzBeUZnqbBb3_afhvucL qTmdx8FpkG8fArstsAXO5RAWwpFUPixRifHdIxfVzR1bJtDeU_hoQQSYPmdJxN7Q6j.PEIGiYL_n TYGe7g_tlm8ke_LCdhAGYVyYt_vTgzSwUoIS3OCdNSvOdAQdP02wMBiP1XJn1ivGishG8AuGfUT. 3TRUYPd64xdLeiisve7Eccv7CN0wZNQHfXAxEyF.QCnkWYFxeUFehqlvFP95lWKrxpfy8B2MYZ1W M.HYxvR5fAiGF0XyMLu3haZA1JpbhXAWpSNY4n9GsqdRcGpgAlNOkAwhAyJjJEtL1WDPiI7mhtMQ 0aokO32uvSBYgECKxs8KpOrT3ZLFODzd7xSKt49lIpG0caBoKAvFoqVVyt.BQAnkL6Ew00nfVSMy _dVBLQGcisFPoJ1I9NFonwOoNXKIO8Qu8iSQR713h6DnopcIwkVIXOrY8d2ijIAtRrRfruQGUNiJ v58quyamkhAsYtEfzr6L20FC5Iig8C2pP6iGq4o48nTUmIrNHGXqpY42Y7f7Nrx7JmT7DwbHVbXA Dq49Fiei294Wem9JzMWJLnUzkqlKCC_n4VLOmy7NRn5IYqIxPkeFm6oyIprtt3jBMrNrVqwrYrlz iOOz_6Hwq5FRf2BSVu9W74Ydc9LHPP11KZsq6cv7XmpvSEnx8HD4LYVqA9tfvuCWM_ZO2_QMQNtL fySnkR_weP81pF3MJ0Y5uG_DaR.GIsbJIsOD61Lt9mKy.Tuen4NmKARBzUjWDiqy2GtYwDm5cu5n jf2OHbDfO_t406URE2TElOaF5urw2TEWLeiPmgkw1wSMLbL_Fu.H.LGG43EOkslXFrEsOxOmqzFp UnNLlvRRonx1exsMJcY6yPDaisigt_WGKUCf5eM.d6lSIgy7lbvBDiAxapvyFgYh_mBcPES6V6bx EIIH19cGSozKqb5bhsJAk1Fwfrpvba.9.hxmSnkGUZHnrOEb4.FKN2eD7dqYHdchEojxjBRPiWoK 3ilKij6oYdxojpstQfGEMNGnbTxpnaJD6D_c8c5nt77nphoHKvhQqQoaJkuUCVQZLEv.xTGOYkAD Sdt96.wYtrBM_A2RSCCHisfVvuG.JMtEUrg246v5Qb4FgJ3F329UxQoMetulGULS44SGB3bOxo9d QPAlkcseQnbXMiBYmtUIU9sRTOC8q7hAmCrWFXWwJN12j7.fNr5jg7rsZlAPLTfllN5gRjNm6781 __9tV__N9ZVEl4co4fTok3MkAGoqqooccmqOps6JQJheBAhp4S9.4c8385p2vyahIssiNwNwmh6L VXqsqj7wuYF9uWS7HI2iwiBYS2ApBLwEpU4IC_Y2VFiHrBxBrTu0XBRkae01O5qRhr5Xh73TqFNk ZLxVP2RCthUzrkftdTlvnb3sxQ3xqhi1lrirYSUJzqzGf2mRFe8zZ59JAOSvtqlXynuIE0PdNfqt saQ-- X-Sonic-MF: X-Sonic-ID: 44d3873f-a7ff-4a77-8ed2-acd705097797 Received: from sonic.gate.mail.ne1.yahoo.com by sonic311.consmr.mail.gq1.yahoo.com with HTTP; Mon, 13 Nov 2023 02:01:04 +0000 Received: by hermes--production-gq1-59b5df67b6-gs8xt (Yahoo Inc. Hermes SMTP Server) with ESMTPA ID 44d90cca8eef67edc156aadfbf32dbbe; Mon, 13 Nov 2023 02:01:00 +0000 (UTC) From: Mark Millard Content-Type: text/plain; charset=us-ascii Content-Transfer-Encoding: quoted-printable List-Id: Porting software to FreeBSD List-Archive: https://lists.freebsd.org/archives/freebsd-ports List-Help: List-Post: List-Subscribe: List-Unsubscribe: Sender: owner-freebsd-ports@freebsd.org X-BeenThere: freebsd-ports@freebsd.org Mime-Version: 1.0 (Mac OS X Mail 16.0 \(3774.200.91.1.1\)) Subject: Re: Ryzen 9 7950X3D bulk -a times: adding an example with SMT disabled (so 16 hardware threads, not 32) Date: Sun, 12 Nov 2023 18:00:46 -0800 References: <88907269-7ECD-4539-AA3D-AD0A31B13CA7@yahoo.com> To: FreeBSD Hackers , FreeBSD Mailing List In-Reply-To: <88907269-7ECD-4539-AA3D-AD0A31B13CA7@yahoo.com> Message-Id: <4596CD14-82EF-4213-9CD8-D065A2F7E073@yahoo.com> X-Mailer: Apple Mail (2.3774.200.91.1.1) X-Spamd-Result: default: False [-3.47 / 15.00]; NEURAL_HAM_LONG(-1.00)[-1.000]; NEURAL_HAM_MEDIUM(-1.00)[-1.000]; NEURAL_HAM_SHORT(-0.97)[-0.966]; DMARC_POLICY_ALLOW(-0.50)[yahoo.com,reject]; MV_CASE(0.50)[]; R_SPF_ALLOW(-0.20)[+ptr:yahoo.com]; R_DKIM_ALLOW(-0.20)[yahoo.com:s=s2048]; MIME_GOOD(-0.10)[text/plain]; FROM_HAS_DN(0.00)[]; RCVD_VIA_SMTP_AUTH(0.00)[]; TO_MATCH_ENVRCPT_SOME(0.00)[]; ARC_NA(0.00)[]; MLMMJ_DEST(0.00)[freebsd-ports@freebsd.org]; RCVD_IN_DNSWL_NONE(0.00)[98.137.65.206:from]; DWL_DNSWL_NONE(0.00)[yahoo.com:dkim]; ASN(0.00)[asn:36647, ipnet:98.137.64.0/20, country:US]; RWL_MAILSPIKE_POSSIBLE(0.00)[98.137.65.206:from]; DKIM_TRACE(0.00)[yahoo.com:+]; TO_DN_ALL(0.00)[]; FREEMAIL_FROM(0.00)[yahoo.com]; MID_RHS_MATCH_FROM(0.00)[]; RCPT_COUNT_TWO(0.00)[2]; RCVD_TLS_LAST(0.00)[]; FROM_EQ_ENVFROM(0.00)[]; FREEMAIL_ENVFROM(0.00)[yahoo.com]; MIME_TRACE(0.00)[0:+]; RCVD_COUNT_TWO(0.00)[2] X-Rspamd-Queue-Id: 4STCMQ43RCz3TYr X-Spamd-Bar: --- On Nov 9, 2023, at 17:26, Mark Millard wrote: > Reading some benchmark results for compilation activity that showed = some > SMT vs. not examples and also using my C++ variant of the old HINT > benchmark, I ended up curious how a non-SMT from scratch bulk -a would > end up (ZFS context) compared my prior SMT based run. >=20 > I use a high load average style of bulk -a activity that has = USE_TMPFS=3Dall > involved. The system has 96 GiBytes of RAM (total across the 2 DIMMs). > The original under 1.5 day time definitely had significant swap space = use > (RAM+SWAP =3D 96 GiBYtes + 364 GiBytes =3D=3D 460 GiBytes =3D=3D = 471040 MiBytes). > The media was (and is) a PCIe based Optane 905P 1.5T. ZFS on a single > partition on the single drive, ZFS used just for bectl reasons, not = other > typical use-ZFS reasons. I've not controlled the ARC size-range = explicitly. >=20 > So less swap partition use is part of contribution to the results. >=20 > The original bulk -a spent a couple of hours at the end where it was > just fetching and building textproc/stardict-quick . I have not = cleared > out /usr/ports/distfiles or updated anything. >=20 > So fetch time is also a difference here. >=20 > SMT (32 hardware threads, original bulk -a): >=20 > [33:10:00] [32] [04:37:23] Finished emulators/libretro-mame | = libretro-mame-20220124_1: Success > [35:36:51] [23] [03:44:04] Finished textproc/stardict-quick | = stardict-quick-2.4.2_9: Success > . . . > [main-amd64-bulk_a-default] [2023-11-01_07h14m50s] [committing:] = Queued: 34683 Built: 33826 Failed: 179 Skipped: 358 Ignored: 320 = Fetched: 0 Tobuild: 0 Time: 35:37:55 >=20 > Swap-involved MaxObs (Max Observed) figures: > 173310Mi MaxObsUsed > 256332Mi MaxObs(Act+Lndry+SwapUsed) > 265551Mi MaxObs(Act+Wir+Lndry+SwapUsed) > (So 265551Mi of 471040Mi RAM+SWAP.) >=20 > Just-RAM MaxObs figures: > 81066Mi MaxObsActive > (Given the complications of getting usefully comparable wired figures = for ZFS (ARC): omit.) > 94493Mi MaxObs(Act+Wir+Lndry) >=20 > Note: MaxObs(A+B+C) <=3D MaxObs(A)+MaxObs(B)+MaxObs(C) >=20 > ALLOW_MAKE_JOBS=3Dyes was used. No explicit restriction on = PARALLEL_JOBS > or MAKE_JOBS_NUMBER (or analogous). So 32 builders allowed, each = allowed > 32 make jobs. This explains the high load averages of the bulk -a : >=20 > load averages . . . MaxObs: 360.70, 267.63, 210.84 > (Those need not be all from the same time frame during the bulk -a .) >=20 > As for the ports vintage: >=20 > # ~/fbsd-based-on-what-commit.sh -C /usr/ports/ > 6ec8e3450b29 (HEAD -> main, freebsd/main, freebsd/HEAD) devel/sdts++: = Mark DEPRECATED > Author: Muhammad Moinur Rahman > Commit: Muhammad Moinur Rahman > CommitDate: 2023-10-21 19:01:38 +0000 > branch: main > merge-base: 6ec8e3450b29462a590d09fb0b07ed214d456bd5 > merge-base: CommitDate: 2023-10-21 19:01:38 +0000 > n637598 (--first-parent --count for merge-base) >=20 > I do have a environment that avoids various LLVM builds taking > as long to build : >=20 > llvm1[3-7] : no MLIR, no FLANG > llvm1[4-7] : use BE_NATIVE > other llvm* : use defaults (so, no avoidance) >=20 > I also prevent the builds from using strip on most of the install > materials built (not just toolchain materials). >=20 >=20 > non-SMT (16 hardware threads): >=20 > Note one builder (math/fricas), the last still present, was > stuck and I had to kill processes to have it stop unless I > was willing to wiat for my large timeout figures. The last > builder normal-finish was: >=20 > [39:48:10] [09] [00:16:23] Finished devel/gcc-msp430-ti-toolchain | = gcc-msp430-ti-toolchain-9.3.1.2.20210722_1: Success >=20 > So, trying to place some bounds for comparing to SMT (32 hw threads) > and non-SMT (16 hw threads): >=20 > 33:10:00 SMT -> 39:48:10 non-SMT would be over 6.5 hrs longer for = non-SMT > 35:36:51 SMT -> 39:48:10 non-SMT would be over 4 hrs longer for = non-SMT >=20 > As for SMT vs. non-SMT Maximum Observed figures: >=20 > SMT load averages . . . MaxObs: 360.70, 267.63, 210.84 > non-SMT load averages . . . MaxObs: 152.89, 100.94, 76.28 >=20 > Swap-involved MaxObs figures for SMT (32 hw threads) vs not (16): > 173310Mi vs. 33003Mi MaxObsUsed > 256332Mi vs. 117221Mi MaxObs(Act+Lndry+SwapUsed) > 265551Mi vs. 124776Mi MaxObs(Act+Wir+Lndry+SwapUsed) >=20 > Just-RAM MaxObs figures for SMT (32 hw threads) vs not (16): > 81066Mi vs. 69763Mi MaxObsActive > (Given the complications of getting usefully comparable wired figures = for ZFS (ARC): omit.) > 94493Mi vs. 94303Mi MaxObs(Act+Wir+Lndry) >=20 I've added a section for a plot for the 7950X3D to the end of: = https://github.com/markmi/acpphint/blob/master/Some_acpphint_curves_with_n= otes.md It is from a C++ variant of the old HINT benchmark and includes showing RAM caching consequences for the benchmark. The about 32 MiByte and about 96 MiByte cache sizes for the 2 CCDs are observable. I'll also note that for the devices present (active and not), at fully active the 7950X3D seems to use 225 Watts .. 235 Watts at the power cable for FreeBSD. Idle FreeBSD: more like 96 Watts. (No video card. 2 forms of Optane 905P 1.5TB, one active. One Samsung 960 Pro 2TB, inactive. One Samsung 970 EVO Plus 2TB, inactive. 96 GiBytes of RAM total across 2 DIMMs. Fans and AIO cooling. Keyboard and mouse USB powered. USB3 Ethernet dongle. Monitor connection.) ThreadRipper 1950X "bulk -a" test in progress: I'm running a from-scratch USE_TMPFS=3Dall "bulk -a" on the ThreadRipper 1950X (128 GiBytes of RAM). =46rom what I've seen so far, it looks to likely take over 72 hr, so 2x+ as long as the 7950X3D. (Samgsung 960 Pro 1TB system media and Optane 900 480 GB swap space media in use, 447 GiByte I as I remember). The ZFS partition on the 960 Pro has ashift=3D14 .) It has a slightly modified copy of the ZFS from the 7950X3D as far as starting content goes. It does have openzfs-2.2 compatibility fully enabled for its pool, including block cloning, unlike any other ZFS I have around (openzfs-2.1-freebsd). =3D=3D=3D Mark Millard marklmi at yahoo.com