From nobody Fri Nov 11 01:48:23 2022 X-Original-To: dev-commits-src-all@mlmmj.nyi.freebsd.org Received: from mx1.freebsd.org (mx1.freebsd.org [IPv6:2610:1c1:1:606c::19:1]) by mlmmj.nyi.freebsd.org (Postfix) with ESMTP id 4N7hS80sYCz4f42g; Fri, 11 Nov 2022 01:48:24 +0000 (UTC) (envelope-from git@FreeBSD.org) Received: from mxrelay.nyi.freebsd.org (mxrelay.nyi.freebsd.org [IPv6:2610:1c1:1:606c::19:3]) (using TLSv1.3 with cipher TLS_AES_256_GCM_SHA384 (256/256 bits) key-exchange X25519 server-signature RSA-PSS (4096 bits) server-digest SHA256 client-signature RSA-PSS (4096 bits) client-digest SHA256) (Client CN "mxrelay.nyi.freebsd.org", Issuer "R3" (verified OK)) by mx1.freebsd.org (Postfix) with ESMTPS id 4N7hS8022Hz3Kk1; Fri, 11 Nov 2022 01:48:24 +0000 (UTC) (envelope-from git@FreeBSD.org) DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=freebsd.org; s=dkim; t=1668131304; h=from:from:reply-to:subject:subject:date:date:message-id:message-id: to:to:cc:mime-version:mime-version:content-type:content-type: content-transfer-encoding:content-transfer-encoding; bh=eHBveyfxCdpy2Vm6dN+eRLhO4VRg9Ifq8XHapkRy71E=; b=XNsL2461ur9gxU0cT0PjEzf8L4AiAQ/JWQg3qjLOYfuex8+0qNeX8t+FgD9O7HfvIzgANb oPa/Whbpt2aBgSaYW6lzgh0f6y7oxqJCTwAASxRYCM7HNsbeFIxEZOvX/M697E1/YzpMuo UVXbyQ6h4fiGHEaT51lwmR3csStzcAJ6FeeX9hNiC1kLzfObHe/Rkehe2l/Y5Okds7Akkq 3/IGoYw29e6lVVbuGNNJHNEufSgERVQ8ay5fWQ8rnGHU6S/b1Z6k3VxQObcjbdtUWbuzEY ORdIwmqvbeBkDcp1+J5UJ7otaysUVtx7ii182jZGxscVHfSOc0FzZsKuzBTDgg== ARC-Message-Signature: i=1; a=rsa-sha256; c=relaxed/relaxed; d=freebsd.org; s=dkim; t=1668131304; h=from:from:reply-to:subject:subject:date:date:message-id:message-id: to:to:cc:mime-version:mime-version:content-type:content-type: content-transfer-encoding:content-transfer-encoding; bh=eHBveyfxCdpy2Vm6dN+eRLhO4VRg9Ifq8XHapkRy71E=; b=Yg7YVTo6bL6kj1T2qEoueEPMXzBxUeWbBKxDqrDkj+XGBjDW4MY9tQ7zSKaP0JcDkwf6Dp gNUYmC/g9tfwdJO58qC0haaN8i2Qgbx49/UGp/yQgQWNJx3/9JhbPIy0THQgGRt4Y3qY9V au8ZbSYXRFUReC6Y15ktVeD8ONhSTbbyGbN/yYSt6wZ4855TQcZVELTYan/OtZBwCiY0/z InvKgA1FuHhbU2ac6i/p5u0btQEAA99SdK27cfM2fBmoMq+jO6NfhLgLTAJ8tUmlOgRTSj fPqLLquQI3d07JuN69q6QZ6NsuAsir0a9tcjKbm8aO9CIP9oXn4AbtduZQUA6g== ARC-Seal: i=1; s=dkim; d=freebsd.org; t=1668131304; a=rsa-sha256; cv=none; b=rOur7TwnnyYpsgIMQJUDUAXUzmy7yaEFm1IHHXooI9KGLe8LK7lP0qfPZtOm+yXAnMIRVN 5OogrvXQ7xi3x/pcGTIHY8B0iRaMKGXnsfeasCxTr3AzfyIcMYITiAwL1zoQQe+OkkNJVz mjfYpXC/UyJiWtMpGbt/9wEBTDHpPJICmSno6QdWckcXHIOgZBpUXqNyeS1ls8Fq9ZjJ7k nw081PqAPdzdCRu3vvEEJwTLEv7Hl8Qz5RFQpS09QqlGCeXypzngH1OOs32l81P1h8z4JV 84/RM0uI0yPao/gDAaaoV2tRhVSntPyBHXl2TvjikrTbyiI0N4i8gpAnvAnRKA== ARC-Authentication-Results: i=1; mx1.freebsd.org; none Received: from gitrepo.freebsd.org (gitrepo.freebsd.org [IPv6:2610:1c1:1:6068::e6a:5]) (using TLSv1.3 with cipher TLS_AES_256_GCM_SHA384 (256/256 bits) key-exchange X25519 server-signature RSA-PSS (4096 bits) server-digest SHA256) (Client did not present a certificate) by mxrelay.nyi.freebsd.org (Postfix) with ESMTPS id 4N7hS76DWtzPLF; Fri, 11 Nov 2022 01:48:23 +0000 (UTC) (envelope-from git@FreeBSD.org) Received: from gitrepo.freebsd.org ([127.0.1.44]) by gitrepo.freebsd.org (8.16.1/8.16.1) with ESMTP id 2AB1mNc2056551; Fri, 11 Nov 2022 01:48:23 GMT (envelope-from git@gitrepo.freebsd.org) Received: (from git@localhost) by gitrepo.freebsd.org (8.16.1/8.16.1/Submit) id 2AB1mN5E056550; Fri, 11 Nov 2022 01:48:23 GMT (envelope-from git) Date: Fri, 11 Nov 2022 01:48:23 GMT Message-Id: <202211110148.2AB1mN5E056550@gitrepo.freebsd.org> To: src-committers@FreeBSD.org, dev-commits-src-all@FreeBSD.org, dev-commits-src-branches@FreeBSD.org From: John Baldwin Subject: git: 2800e3a8ae1d - stable/13 - cxgbe: Compute timestamps via sbintime_t. List-Id: Commit messages for all branches of the src repository List-Archive: https://lists.freebsd.org/archives/dev-commits-src-all List-Help: List-Post: List-Subscribe: List-Unsubscribe: Sender: owner-dev-commits-src-all@freebsd.org X-BeenThere: dev-commits-src-all@freebsd.org MIME-Version: 1.0 Content-Type: text/plain; charset=utf-8 Content-Transfer-Encoding: 8bit X-Git-Committer: jhb X-Git-Repository: src X-Git-Refname: refs/heads/stable/13 X-Git-Reftype: branch X-Git-Commit: 2800e3a8ae1daead79759efb361729447c608634 Auto-Submitted: auto-generated X-ThisMailContainsUnwantedMimeParts: N The branch stable/13 has been updated by jhb: URL: https://cgit.FreeBSD.org/src/commit/?id=2800e3a8ae1daead79759efb361729447c608634 commit 2800e3a8ae1daead79759efb361729447c608634 Author: John Baldwin AuthorDate: 2022-09-26 21:57:26 +0000 Commit: John Baldwin CommitDate: 2022-11-11 01:26:27 +0000 cxgbe: Compute timestamps via sbintime_t. This uses fixed-point math already used elsewhere in the kernel for sub-second time values. To avoid overflows this does require updating the calibration once a second rather than once every 30 seconds. Note that the cxgbe driver already queries multiple registers once a second for the statistics timers. This version also uses fewer instructions with no branches (for the math portion) in the per-packet fast path. Reviewed by: np Sponsored by: Chelsio Communications Differential Revision: https://reviews.freebsd.org/D36663 (cherry picked from commit 2c74c9dac3a6e74a2c33d519cdcf1de145e7664c) --- sys/dev/cxgbe/adapter.h | 4 ++-- sys/dev/cxgbe/t4_main.c | 39 ++++++--------------------------------- sys/dev/cxgbe/t4_sge.c | 41 +++++------------------------------------ 3 files changed, 13 insertions(+), 71 deletions(-) diff --git a/sys/dev/cxgbe/adapter.h b/sys/dev/cxgbe/adapter.h index 1fb0d9ff61c2..4d246512b868 100644 --- a/sys/dev/cxgbe/adapter.h +++ b/sys/dev/cxgbe/adapter.h @@ -865,8 +865,8 @@ struct clip_entry; struct clock_sync { uint64_t hw_cur; uint64_t hw_prev; - uint64_t rt_cur; - uint64_t rt_prev; + sbintime_t sbt_cur; + sbintime_t sbt_prev; uint32_t gen; }; diff --git a/sys/dev/cxgbe/t4_main.c b/sys/dev/cxgbe/t4_main.c index 9ac790abfcf4..eff4db09aa12 100644 --- a/sys/dev/cxgbe/t4_main.c +++ b/sys/dev/cxgbe/t4_main.c @@ -319,18 +319,6 @@ static int t4_rsrv_noflowq = 0; SYSCTL_INT(_hw_cxgbe, OID_AUTO, rsrv_noflowq, CTLFLAG_RDTUN, &t4_rsrv_noflowq, 0, "Reserve TX queue 0 of each VI for non-flowid packets"); -static int t4_clocksync_fast = 1; -SYSCTL_INT(_hw_cxgbe, OID_AUTO, csfast, CTLFLAG_RW | CTLFLAG_MPSAFE, &t4_clocksync_fast, 0, - "During initial clock sync how fast do we update in seconds"); - -static int t4_clocksync_normal = 30; -SYSCTL_INT(_hw_cxgbe, OID_AUTO, csnormal, CTLFLAG_RW | CTLFLAG_MPSAFE, &t4_clocksync_normal, 0, - "During normal clock sync how fast do we update in seconds"); - -static int t4_fast_2_normal = 30; -SYSCTL_INT(_hw_cxgbe, OID_AUTO, cscount, CTLFLAG_RW | CTLFLAG_MPSAFE, &t4_fast_2_normal, 0, - "How many clock syncs do we need to do to transition to slow"); - #if defined(TCP_OFFLOAD) || defined(RATELIMIT) #define NOFLDTXQ 8 static int t4_nofldtxq = -NOFLDTXQ; @@ -1122,17 +1110,10 @@ t4_ifnet_unit(struct adapter *sc, struct port_info *pi) return (-1); } -static inline uint64_t -t4_get_ns_timestamp(struct timespec *ts) -{ - return ((ts->tv_sec * 1000000000) + ts->tv_nsec); -} - static void t4_calibration(void *arg) { struct adapter *sc; - struct timespec ts; struct clock_sync *cur, *nex; int next_up; @@ -1144,17 +1125,15 @@ t4_calibration(void *arg) if (__predict_false(sc->cal_count == 0)) { /* First time in, just get the values in */ cur->hw_cur = t4_read_reg64(sc, A_SGE_TIMESTAMP_LO); - nanouptime(&ts); - cur->rt_cur = t4_get_ns_timestamp(&ts); + cur->sbt_cur = sbinuptime(); sc->cal_count++; goto done; } nex->hw_prev = cur->hw_cur; - nex->rt_prev = cur->rt_cur; - KASSERT((hw_off_limits(sc) == 0), ("hw_off_limits at t4_calibtration")); + nex->sbt_prev = cur->sbt_cur; + KASSERT((hw_off_limits(sc) == 0), ("hw_off_limits at t4_calibration")); nex->hw_cur = t4_read_reg64(sc, A_SGE_TIMESTAMP_LO); - nanouptime(&ts); - nex->rt_cur = t4_get_ns_timestamp(&ts); + nex->sbt_cur = sbinuptime(); if ((nex->hw_cur - nex->hw_prev) == 0) { /* The clock is not advancing? */ sc->cal_count = 0; @@ -1165,17 +1144,11 @@ t4_calibration(void *arg) sc->cal_current = next_up; sc->cal_gen++; atomic_store_rel_int(&nex->gen, sc->cal_gen); - if (sc->cal_count < t4_fast_2_normal) - sc->cal_count++; done: - callout_reset_sbt_curcpu(&sc->cal_callout, - ((sc->cal_count < t4_fast_2_normal) ? - t4_clocksync_fast : t4_clocksync_normal) * SBT_1S, 0, - t4_calibration, sc, C_DIRECT_EXEC); + callout_reset_sbt_curcpu(&sc->cal_callout, SBT_1S, 0, t4_calibration, + sc, C_DIRECT_EXEC); } - - static void t4_calibration_start(struct adapter *sc) { diff --git a/sys/dev/cxgbe/t4_sge.c b/sys/dev/cxgbe/t4_sge.c index 48d87be4e0b6..fc8a55111a4c 100644 --- a/sys/dev/cxgbe/t4_sge.c +++ b/sys/dev/cxgbe/t4_sge.c @@ -1516,10 +1516,9 @@ static inline uint64_t t4_tstmp_to_ns(struct adapter *sc, uint64_t lf) { struct clock_sync *cur, dcur; - uint64_t tstmp_sec, tstmp_nsec; uint64_t hw_clocks; - uint64_t rt_cur_to_prev, res_s, res_n, res_s_modulo, res; - uint64_t hw_clk_div, cclk; + uint64_t hw_clk_div; + sbintime_t sbt_cur_to_prev, sbt; uint64_t hw_tstmp = lf & 0xfffffffffffffffULL; /* 60b, not 64b. */ uint32_t gen; @@ -1541,42 +1540,12 @@ t4_tstmp_to_ns(struct adapter *sc, uint64_t lf) * * With the constraints that we cannot use float and we * don't want to overflow the uint64_t numbers we are using. - * - * The plan is to take the clocking value of the hw timestamps - * and split them into seconds and nanosecond equivalent portions. - * Then we operate on the two portions seperately making sure to - * bring back the carry over from the seconds when we divide. - * - * First up lets get the two divided into separate entities - * i.e. the seconds. We use the clock frequency for this. - * Note that vpd.cclk is in khz, we need it in raw hz so - * convert to hz. */ - cclk = (uint64_t)sc->params.vpd.cclk * 1000; hw_clocks = hw_tstmp - dcur.hw_prev; - tstmp_sec = hw_clocks / cclk; - tstmp_nsec = hw_clocks % cclk; - /* Now work with them separately */ - rt_cur_to_prev = (dcur.rt_cur - dcur.rt_prev); - res_s = tstmp_sec * rt_cur_to_prev; - res_n = tstmp_nsec * rt_cur_to_prev; - /* Now lets get our divider */ + sbt_cur_to_prev = (dcur.sbt_cur - dcur.sbt_prev); hw_clk_div = dcur.hw_cur - dcur.hw_prev; - /* Make sure to save the remainder from the seconds divide */ - res_s_modulo = res_s % hw_clk_div; - res_s /= hw_clk_div; - /* scale the remainder to where it should be */ - res_s_modulo *= cclk; - /* Now add in the remainder */ - res_n += res_s_modulo; - /* Now do the divide */ - res_n /= hw_clk_div; - res_s *= cclk; - /* Recombine the two */ - res = res_s + res_n; - /* And now add in the base time to get to the real timestamp */ - res += dcur.rt_prev; - return (res); + sbt = hw_clocks * sbt_cur_to_prev / hw_clk_div + dcur.sbt_prev; + return (sbttons(sbt)); } static inline void