Google just announced it’s giving website publishers a way to opt out of having their data used to train the company’s AI models while remaining accessible through Google Search.
many sites have moved to block the web crawler that OpenAI uses to scrape data and train ChatGPT. However, there have been concerns over how to block out Google. After all, websites can’t close off Google’s crawlers completely, or else they won’t get indexed in search.
Nice of these AI companies to add opt out capability after they’ve trained their models on all available data.
This is by design. Helps to stop any competition as they will now struggle to get the same amount of training data.
This overall bullshit of doing opt-out fucking sucks. Opt-IN! Pissing off people and the. going “oh well there’s an opt-out” is so stupid. New experimental features that don’t directly benefit the user should be that they must have to opt-in manually. Off by default.
When a company gives you a bunch of free services that work pretty well and everybody uses them, they get a little cocky
This doesn’t change anything. They’re still violating copyright law if they scrape any data.
For search results, nobody really cared, because it was useful to everyone. Now they’re reprocessing it into output and competing with the original sites. That’s a big no-no.