Can we discuss how it’s possible that the paid model (gpt4) got worse and the free one (gpt3.5) got better? Is it because the free one is being trained on a larger pool of users or what?

  • blue_zephyr@lemmy.world
    link
    fedilink
    English
    arrow-up
    13
    arrow-down
    1
    ·
    edit-2
    11 months ago

    It’s because the research in question used a really small and unrepresentative dataset. I want to see these findings reproduced on a proper task collection.

    • Gsus4@lemmy.oneOP
      link
      fedilink
      English
      arrow-up
      3
      ·
      11 months ago

      True, checking whether a number is prime is very limited in scope for chargpt, but this is in line with other reports of progressive dumbing down.