@inthehands Fair points, but you seem to be comparing an LLM-based system against one with perfect efficiency, instead of the existing human-based system (which I’m certain has its own set of failings).
While it’s useful to know how an LLM system deviates from the ideal, I’d be far more interested in how it compares against the existing system. Personally, I don’t need a system to be perfect - I just need it to be better