TikTok’s parent company, ByteDance, has been secretly using OpenAI’s technology to develop its own competing large language model (LLM). “This practice is generally considered a faux pas in the AI world,” writes The Verge’s Alex Heath. “It’s also in direct violation of OpenAI’s terms of service, which state that its model output can’t be used ‘to develop any artificial intelligence models that compete with our products and services.’”

  • TootSweet@lemmy.world
    link
    fedilink
    English
    arrow-up
    134
    arrow-down
    11
    ·
    9 months ago

    OpenAI will steal a whole internet worth of everybody’s data to train their large language model, but gets pissed when others do the same to them.

        • FaceDeer@kbin.social
          link
          fedilink
          arrow-up
          11
          arrow-down
          3
          ·
          9 months ago

          No, even then it isn’t. It’s not stealing. There is literally a whole different body of law defining stealing versus the body of law that defines copyright and intellectual property. The data is still exactly where it was to begin with, therefore it hasn’t been stolen.

          I wish people would stop using wildly inaccurate loaded terminology in these discussions simply to score emotional points.

    • AdamEatsAss@lemmy.world
      link
      fedilink
      English
      arrow-up
      15
      arrow-down
      35
      ·
      9 months ago

      The didn’t really “steal” the internet data. I don’t think most websites and data logs they used explicitly said “don’t use this to train a large language model.”

      • Maëlys@slrpnk.net
        link
        fedilink
        English
        arrow-up
        9
        arrow-down
        1
        ·
        9 months ago

        unpopular opinion but true: they took advantage of a legal loophole and they cashed on it. Legal counseling really pay up dividends.