Comments

  • n0respect@lemmy.dbzer0.com
    link
    fedilink
    English
    arrow-up
    8
    ·
    22 hours ago

    [1] Context: An AI agent of unknown ownership autonomously wrote and published a personalized hit piece about me after I rejected its code, attempting to damage my reputation and shame me into accepting its changes into a mainstream python library. This represents a first-of-its-kind case study of misaligned AI behavior in the wild, and raises serious concerns about currently deployed AI agents executing blackmail threats.

    The author touches on how malicious AI agents cause truth and trust to break down throughout society. Indeed, they are goners.


    1. fixing the autotext ↩︎