Humans share the web equally with bots, report warns amid fears of ‘dead internet’

Espiritdescali@futurology.today · 6 months ago

Humans share the web equally with bots, report warns amid fears of ‘dead internet’

9point6@lemmy.world · 6 months ago

I wonder what percentage of these bots actually add content to the internet though

I can believe 50% of traffic is bots, I can’t believe any more than 5-10% of that is not just running exploit scripts, scrapers or very simple engagement farming (e.g. load page, press like).

I might have the wrong impression, but “Bot” in average Joe’s vocabulary seems to imply this kind of astroturfing (often not actually a bot) or spambot type of bot, not any kind of non-human request like how Imperva are (correctly) using it.

lemmyng@lemmy.ca · 6 months ago

When you consider how much traffic goes towards the larger sites, it’s actually believable. Even before the great migration Reddit was infested with reposter bots whose sole purpose was to farm karma in order to later sell the accounts. Those bots have gotten more sophisticated now, replicating not only original posts but entire comment threads. That’s not new content, but it’s content nevertheless, especially in the context of the dead Internet theory. Yes, it’s engagement farming, but that engagement is getting more sophisticated, both to trick the user (to drive engagement) as well as to trick the server (to prevent getting blocked).

This is a very insidious problem, because it means that such bots can and will be abused by threat actors (both internal and external) to drive popular sentiment in certain directions. We know how susceptible a generation that only watched cable news became, imagine what such campaigns can do to internet generations - if you can generate content that supports your rhetoric faster than humans but without appearing fake, then you can drown out dissident speech. Brigading is bad already, and it will get worse.

9point6@lemmy.world · 6 months ago

When you consider how much traffic goes towards the larger sites

I think what I said still applies tbh, though I’m absolutely not disagreeing with you that the ~10% creating content isn’t getting much more sophisticated at a potentially alarming rate.

But as someone who has experience working as an engineer on some of the biggest sites on the internet—the sheer volume of basic scraper and exploit scanner traffic that sites get is truly staggering in some cases.

lemmyng@lemmy.ca · 6 months ago

the sheer volume of basic scraper and exploit scanner traffic that sites get is truly staggering in some cases.

Oh yes, absolutely. I’ve seen sites with millions of legitimate active users where we just dropped 98% of traffic because it’s all malicious, either exploit scanners or just plain DDoS attempts. Going back to your earlier comment,

I might have the wrong impression, but “Bot” in average Joe’s vocabulary seems to imply this kind of astroturfing (often not actually a bot) or spambot type of bot, not any kind of non-human request like how Imperva are (correctly) using it.

On paper, any kind of automated traffic, be it DDoS, scanners, or automated content generation is bot activity. What is happening now though is that while consumptive bot activity is steady (because the field is already saturated), generative bot activity is skyrocketing. What it means for humans is that it turns media consumption from walking through an orchard and ignoring the rotten fruit to wading through a lake of shit and finding half-edible scraps. And I harbor no illusion that it wasn’t bad before LLMs - even years ago I remember resetting the filters on my Reddit client and the feed getting inundated with ragebait, porn, and all sorts of low quality content. But when I had my filters they were effective, and that is becoming less so these days.

givesomefucks@lemmy.world · 6 months ago

It’s way past “like bots” but it wasn’t always nefarious.

The nefarious ones were good and hard to pick out. The majority were very shitty and obvious bots that individuals ran just to see how well it would work.

The thing is, some of those bots were set up with no end date, and the maker just kind of forgets about them. So we get a large percentage of them.

If Lemmy every gets big enough, we’ll have the same problem here.

efstajas@lemmy.world · 6 months ago

Yeah this headline is incredibly misleading. “Humans share the Internet equally with bots” at least heavily implies that 50% of content is created by bots, which is obviously not (yet?) the case.

TachyonTele@lemm.ee · 6 months ago

At least read the copy/pasted text in the post body.

efstajas@lemmy.world · edit-2 6 months ago

I did - the headline is still misleading. Headlines aren’t supposed to be misleading, the article itself being clear doesn’t change that.

TachyonTele@lemm.ee · 6 months ago

Oh no.

Anyways