Nice blog post, though I personally prefer the ridiculousfish post [0] he links ...

Dylan16807 · on Oct 19, 2012

Well, to be specific about x86:

Reads always happen in the original order.

Writes always happen in the original order.

Reads can be moved in front of writes.

callan · on Oct 19, 2012

I work for Intel. This is not correct. A lot depends on your cache type. The two basic ones are uncacheable and write-back.

What you wrote is true for UC. For WB, reads can happen in any order (especially due to cache pre-fetchers). Writes always happen in program order, unless you are in other cache types such as write-combining. WC is mainly used for graphics memory-mapped pixmaps, where the order doesn't matter.

But don't let this scare you too much. From the viewpoint of a single CPU, everything is in order. It's only when you look at it from the memory bus point-of-view that things get confusing.

Unless we are dealing with a memory-mapped IO device that has read side-effects, in which case you need to carefully choose a cache type.

Dylan16807 · on Oct 19, 2012

Well yes as far as WB I meant from the view of the program. Modern CPUs do all kinds of terrifying rewriting and speculation internally.

dspeyer · on Oct 20, 2012

How do I control my cache type?

yuhong · on Oct 20, 2012

I don't think you can from a user mode program. If you are dealing with normal memory, it will be typically WB, and that is the only thing most user-mode programs will encounter.

dchichkov · on Oct 20, 2012

__builtin_prefetch

ComputerGuru · on Oct 19, 2012

Yeah, it's that last one: "can" but usually won't.