Do you feel that mutex performance could be improved from the current 2-3 million lock/unlock operations per second on uncontested mutexes on ~2.4Ghz prescott? Which seems to be about 1000 cycles per lock/unlock. I have a fairly basic producer/consumer application to optimize and I'm trying to decide on the performance-optimal synchronization method. Pete