Nightshade - A new data poisoning tool lets artists fight back against generative AI

ekZepp@lemmy.world · edit-2 1 year ago

Nightshade - A new data poisoning tool lets artists fight back against generative AI

possibly a cat@lemmy.ml · edit-2 1 year ago

lain?!

I see data poisoning as a continuing trend. However this is one tricky battle.

If you have a very signature digital art style, and it has not been published on the internet yet, and you poison every single piece of work you create, then you might have some success foiling copycats. If you are a physical artist, you would also have to prevent pictures of your works from ever being shared.

As the Tech Review article mentions, and other commenters, the images first have to be included in training sets. And those training sets are huge. (I don’t think many people recognize small/bespoke models as a major threat at this point in time.) You might get some short-lived success when a new fad first comes out and the training sets are minimal.

What would really be needed is the automatic inclusion of data poisoning in widely popular data/media sharing services, and then time for them to poison the data pool.

However even that is fallible. I expect that based on the way that private property works, if data poisoning ever took off at scale then courts would rule it illegal (or at least civilly liable for damages). Also AI researchers are already looking into ways to overcome data poisoning, and their methods can be retroactively effective.

Nonetheless I find data-poisoning to be an important and intriguing research field. There are valid use cases even if it never successfully extends artists’ control over their works.

Edit: I don’t know if I was being dense but it took me awhile to find the study, so here’s the link - https://arxiv.org/abs/2310.13828v1