Plausible output is plausible because it is similar to the input text, to distinguish this case you have to be able to reason about what the words actually mean. Just pattern matching is not good enough because you can write two sentences that mean exactly the opposite thing, which are still strongly correlated with each other except for the sense of a conjunction or a negation. I have observed exactly this behavior.