Synchronizing TSC
Konstantin Belousov
kostikbel at gmail.com
Thu Apr 18 16:12:02 UTC 2013
On Thu, Apr 18, 2013 at 05:16:22PM +0300, Alexander Motin wrote:
> On 17.04.2013 11:50, Konstantin Belousov wrote:
> > On Wed, Apr 17, 2013 at 09:46:15AM +0300, Alexander Motin wrote:
> >> On 17.04.2013 03:25, Jim Harris wrote:
> >>>
> >>> On Tue, Apr 16, 2013 at 3:04 PM, Alexander Motin <mav at freebsd.org
> >>> <mailto:mav at freebsd.org>> wrote:
> >>>
> >>> Hi.
> >>>
> >>> Recently I've got 6-core/12-thread system on Sandy Bridge-E Core
> >>> i7-3930K CPU and was unpleasantly surprised to see that TSCs are not
> >>> synchronized there. While all 11 APs were synchronized, BSP was far
> >>> behind them. Since it is single-socket system, I don't know any good
> >>> reason for such behavior except some BIOS bug. But I've recalled
> >>> that somewhere was some discussions about possible TSC
> >>> synchronization. I've implemented patch below that allows to adjust
> >>> TSC values of BSPs to AP's one on boot using CPU MSRs, hoping that
> >>> they should not diverge after that:
> >>> http://people.freebsd.org/~__mav/tsc_adj2.patch
> >>> <http://people.freebsd.org/~mav/tsc_adj2.patch>
> >>>
> >>> I don't know very much about all different TSC hardware to predict
> >>> when it is safe to enable the functionality, but at least on my
> >>> system being enabled via loader tunable it seems working well.
> >>>
> >>> Comments?
> >>>
> >>>
> >>> You may be remembering this thread on r238755 last year:
> >>>
> >>> http://lists.freebsd.org/pipermail/svn-src-head/2012-July/038992.html
> >>>
> >>> This was a bug fix in the TSC synchronization test code though, not
> >>> anything for trying to adjust out-of-sync TSCs.
> >>
> >> I remember that thread, but I think I've seen somebody told somewhere
> >> that it could be interesting to implement some MI mechanism. Never mind.
> >>
> >>> The Intel SDM (volume 3, section 17.13 of March 2013 revision) says
> >>> earlier models can only write to lower 32 bits of
> >>> IA32_TIME_STAMP_COUNTER, but these models also should not have invariant
> >>> TSC so they would never even get to your new routine. So your patch
> >>> seems OK for Intel CPUs, at least as a tunable that is disabled by default.
> >>
> >> Thanks.
> >>
> >>> My only concern would be why TSC on the BSP started out-of-sync on your
> >>> system. Theoretically, BIOS could adjust TSCs in SMM to try to hide SMI
> >>> code execution from the OS, which could then make them out-of-sync
> >>> again. Not sure if that's what's happening here, but might be worth a
> >>> test putting the TSC test code on a periodic timer to see if they ever
> >>> get out of sync again.
> >>
> >> I did one more interesting observation: on every reboot drift between
> >> BSP and APs is growing proportionally to the previous system power-on
> >> time. On first boot it is -3878361036 (just above one second), after
> >> reboot some minutes later it is -1123454492776 (about 6 minutes), after
> >> another reboot it is -1853033521804 (about 10 minutes).
> >>
> >> Unless my adjustment code would be active, I would guess that AP's TSC
> >> is running linearly while BSP's for some reason reset to zero on every
> >> reboot. But since I am synchronizing them on each boot, the only
> >> possibility for it I see is that there is some other timer(s) /
> >> counter(s) not affected by MSR writes that ticks linearly and reloading
> >> AP's TSC, but for some reason not reloading BSP's.
> >
> > For me it sounds as the BIOS bug, indeed. Could you verify the content
> > of IA32_TSC_ADJUST on all cores (I believe it is present on E5) ?
> > Also, using TSC_ADJUST to correct the skew seems to be preferrable,
> > according to the Intel docs.
>
> IA32_TSC_ADJUST register seems not present there. At least cpucontrol
> doesn't want to read it. In Intel docs I also see it mentioned only in
> context of future Haswell generation. And I don't see "Standard Extended
> Features" line in dmesg.
>
> > Why do you use cpuid in the assembly sequence ? As I understand, you
> > ensure that there is a serialization point, but why do you need it ?
>
> The idea was to minimize time distance between following MSR read and
> write. But may be it is not needed, I am not exactly sure about that magic.
Well, I do not believe that such trick is useful.
Patch with removed cpuid and with resync disabled by default (as it is
now, AFAIR) would be IMO fine for the commit.
-------------- next part --------------
A non-text attachment was scrubbed...
Name: not available
Type: application/pgp-signature
Size: 834 bytes
Desc: not available
URL: <http://lists.freebsd.org/pipermail/freebsd-hackers/attachments/20130418/8fad0303/attachment.sig>
More information about the freebsd-hackers
mailing list