• sleepundertheleaves
    link
    fedilink
    arrow-up
    24
    ·
    3 days ago

    Ironically, the actual human contributions to Wikipedia play a significant role in training LLMs. And since you can’t train LLMs on LLM generated text without recursion errors, the more Wikipedia is contaminated by AI text, the less useful Wikipedia becomes for training purposes.

    I’m torn between thinking “keep poisoning your own fucking well, assholes” and “look at all this free labor Wikipedia editors are doing to help AI developers”.