Screenshot from the PDF.
It appears that #E4M3 (the one without multiple NaNs etc.) has less range, and less precision near 0, but smaller steps between numbers on average.
E4M3 also has NaN, just one type of NaN (nice).
I'm not thinking about #deepLearning here in particular, but I think I prefer E4M3 personally (for what it's worth).
Interesting development. Never thought float8's could have any use, and here we are in 2025 with the potential use of them.