They allege that OpenAI’s ChatGPT and Meta’s LLaMA chatbots were trained on datasets that included their copyrighted books, without their permission. The datasets in question were allegedly obtained from “shadow library” websites like Bibliotik, Library Genesis, and Z-Library. These websites are known for distributing pirated content.

  • harry_h0udini911@lemmy.fmhy.ml
    link
    fedilink
    English
    arrow-up
    2
    ·
    1 year ago

    Ahhh. When I ask the GPT about piracy, there is silence. But GPT itself is running on the data that was scraped illegally. Hypocrisy = 1000