@steve @cr1901 @pkhuong and even with that, compilers will often sever the dependency explicitly by doing an xor or movsx/xx to the destination at the start of an operation chain
the other thing that makes 16 bit operations slow on some uarches iirc is a penalty for the operand size prefix byte