Poisoned AI went rogue during training and couldn't be taught to behave again in 'legitimately scary' study

L4sBot@lemmy.world · 1 year ago

Daxtron2@startrek.website · 1 year ago

LLM trained on inflammatory data produces inflammatory results, shocking.

JustMy2c@lemm.ee · 1 year ago

I know we don’t like them here but the word reddit is not banned (yet)

Daxtron2@startrek.website · 1 year ago

What? What does my comment have anything to do with Reddit?

JustMy2c@lemm.ee · 1 year ago

So you’re saying that “Inflammatory data” isn’t a reference to reddit? :D

kent_eh@lemmy.ca · 1 year ago

I’d say using Twitter and Facebook would be worse than reddit. Or, and I shudder to think about it, truth social…

JustMy2c@lemm.ee · 1 year ago

Reddit is used more for Ai models as those…

Daxtron2@startrek.website · 1 year ago

Not inherently, I’m sure that’s part of it but it’s really everywhere. Even here on Lemmy I’ve run into nasty folk

JustMy2c@lemm.ee · 1 year ago

True but it’s reddit that’s served as a base for most models…

Daxtron2@startrek.website · 1 year ago

Not just reddit, LAION is a huge dataset

JustMy2c@lemm.ee · 1 year ago

Obviously but reddit is in the goldilocks zone where you get coherent intelligent stuff and humor and facts.

But it’s still toxic for an Ai.

Daxtron2@startrek.website · 1 year ago

Saying it served as the base for most models is just objectively incorrect though

Chris@lemmy.world · 1 year ago

No, LLM is the AI, OP is saying if you train it with hate it’s gonna spit out hate

JustMy2c@lemm.ee · 1 year ago

And I’m saying that reddit data is sublime for Ai. And specifically that it’s invested with toxicity