@lanodan yep, this looked like some interesting problems to determine the capabilities of a language, gauge how fit each is as a general purpose tool, identify possible shortcomings
But actual applications in the wild don’t necessarily implement these edge cases often (or ever), and that’s perhaps what that warning was about
If you’re going to benchmark, at least share your versions and settings so that they are readily available to readers. And if you write an article covering the research, do the same?
That said, I’m no researcher