@regehr @dalias (macro-fusing bswap + cmp would generally not be feasible because the CPU cannot know that the result of the bswap isn't used elsewhere--the usual rule of thumb for macro fusion is that the second instruction being fused has to overwrite the register name that holds the intermediate result. Since compares don't write to a register, they are bad candidates for macro fusion.)