Yuritopiaposadism [none/use name]@hexbear.net to technology@hexbear.netEnglish · 2 years agoOpenAI Says It's Fine to Vacuum Up Everyone's Content and Charge for It Without Paying Themfuturism.comexternal-linkmessage-square16linkfedilinkarrow-up193arrow-down11
arrow-up192arrow-down1external-linkOpenAI Says It's Fine to Vacuum Up Everyone's Content and Charge for It Without Paying Themfuturism.comYuritopiaposadism [none/use name]@hexbear.net to technology@hexbear.netEnglish · 2 years agomessage-square16linkfedilink
minus-squareAwoo [she/her]@hexbear.netlinkfedilinkEnglisharrow-up2·2 years agoIf ai will regurgitate its training data then you can perform copyright-laundering via this one neat loophole. We can move literally the entire internet (which is basically all in their training data) into the public domain.
minus-squareJohnBrownNote [comrade/them, des/pair]@hexbear.netBannedlinkfedilinkEnglisharrow-up3·2 years ago unfortunately i think these things don’t keep the training set, just the set of associations and relations it made by analyzing it
minus-squareAwoo [she/her]@hexbear.netlinkfedilinkEnglisharrow-up2·2 years agoNot true, they will completely and totally replicate their training data. The companies try to prevent this so the method to get it to happen regularly changes, but they do it. Chatgpt: https://not-just-memorization.github.io/extracting-training-data-from-chatgpt.html Image AIs: https://techcrunch.com/2022/12/13/image-generating-ai-can-copy-and-paste-from-training-data-raising-ip-concerns/?guccounter=1 I’m not saying this would work and you won’t get in trouble for doing it. But it would fuck the system just a little bit.
minus-squareJohnBrownNote [comrade/them, des/pair]@hexbear.netBannedlinkfedilinkEnglisharrow-up2·2 years agooh wow that 's great lol
If ai will regurgitate its training data then you can perform copyright-laundering via this one neat loophole.
We can move literally the entire internet (which is basically all in their training data) into the public domain.
unfortunately i think these things don’t keep the training set, just the set of associations and relations it made by analyzing it
Not true, they will completely and totally replicate their training data. The companies try to prevent this so the method to get it to happen regularly changes, but they do it.
Chatgpt: https://not-just-memorization.github.io/extracting-training-data-from-chatgpt.html
Image AIs: https://techcrunch.com/2022/12/13/image-generating-ai-can-copy-and-paste-from-training-data-raising-ip-concerns/?guccounter=1
I’m not saying this would work and you won’t get in trouble for doing it. But it would fuck the system just a little bit.
oh wow that 's great lol