@b0rk another reason for using power of two sizes is that clock dividers that work based on halving are trivial to design (a flip flop whose input is driven by the inverse of its output will produce a square wave at half the clock frequency you drive it at), so if you think about designing a circuit where some data is sent serially (bit by bit) down a single wire and you want to trigger some event at the end of each byte, you can use three /2 clock dividers in series to get /8.