I don’t see how that affects my point.
- Today’s AI detector can’t tell apart the output of today’s LLM.
- Future AI detector WILL be able to tell apart the output of today’s LLM.
- Of course, future AI detector won’t be able to tell apart the output of future LLM.
So at any point in time, only recent text could be “contaminated”. The claim that “all text after 2023 is forever contaminated” just isn’t true. Researchers would simply have to be a bit more careful including it.
If you give me several paragraphs instead of a single sentence, do you still think it’s impossible to tell?