Re: FreeBSD 13.2R and OpenZFS bug #15933

From: Kurt Hackenberg <kh_at_panix.com>
Date: Sat, 02 Mar 2024 00:08:08 UTC
On Fri, Mar 01, 2024 at 02:04:01PM -0800, David Christensen wrote:

>>Here is a write-up of the person that fixed the bug containing all the
>>gory details:
>>https://despairlabs.com/blog/posts/2023-12-25-openzfs-data-corruption-bug/
...
>I am also reminded of an operating systems course -- specifically: 
>shared resources, critical sections, and thread-safe programming 
>techniques.  Failing to solve the mutual exclusion problem correctly in 
>the design is going to result in race conditions in the implementation.  
>The article discusses concurrent operations, but only implies 
>concurrent design.  I suspect this is where the root cause of the 
>OpenZFS bugs is to be found.

That article sounds like there was no mutual exclusion around the dirty 
lists.  It says the bug they found was fixed, but doesn't say how.