this post was submitted on 03 Feb 2024

220 points (94.0% liked)

Not The Onion

13036 readers

1490 users here now

Welcome

We're not The Onion! Not affiliated with them in any way! Not operated by them in any way! All the news here is real!

The Rules

Posts must be:

Links to news stories from...
...credible sources, with...
...their original headlines, that...
...would make people who see the headline think, “That has got to be a story from The Onion, America’s Finest News Source.”

Comments must abide by the server rules for Lemmy.world and generally abstain from trollish, bigoted, or otherwise disruptive behavior that makes this community less fun for everyone.

And that’s basically it!

founded 2 years ago

MODERATORS

kescusay

220

AI chatbots tend to choose violence and nuclear strikes in wargames (www.newscientist.com)

submitted 1 year ago by [email protected] to c/nottheonion

54 comments fedilink hide all child comments

top 50 comments

sorted by: hot top controversial new old

[–] DontTreadOnBigfoot 59 points 1 year ago

Gandhi intensifies

[–] [email protected] 38 points 1 year ago (2 children)

So do humans.

[–] Plopp 27 points 1 year ago (1 children)

Here's a wild thought. Maybe that's why the chat bot (I assume LLM) does it too, because it's been trained on us! 🤯

[–] Malfeasant 2 points 1 year ago

I learned it from watching you!

[–] fidodo 4 points 1 year ago (1 children)

Where are all these nuclear strikes?

[–] Visstix 9 points 1 year ago (1 children)

Sid Meier's Civilization games

[–] Death_Equity 1 points 1 year ago

Ghandi has the right idea.

[–] breadsmasher 32 points 1 year ago (1 children)

This sounds like the result of feeding it tons of literature that denotes having nuclear weapons, and the world we live in now being “peaceful” (as the ai claimed to want)

[–] [email protected] 1 points 1 year ago

Nuclear weapons promote peace, but using them doesn’t so much.

[–] [email protected] 28 points 1 year ago (1 children)

Don't want to spoil your little circlejerk here, but that should not surprise anyone, considering chatbots are trained on vast amounts of human data input. Humans have a rich history of violence with only brief excursions into "collaborating for the good of mankind and the planet we live on". So unless you build a chatbot that focuses on those values the result will inevitably be a mirror image of us human shitbags.

[–] [email protected] 9 points 1 year ago* (last edited 1 year ago) (4 children)

Humans have a history of violence as well as altruism. And with an increasing degree of societal complexity, humans also have a consistent record of violence reduction. See e.g. "The better angels of our nature" (Pinker, 2011).

Painting humans as intrinsically violent is not backed by evidence.

[–] [email protected] 5 points 1 year ago* (last edited 1 year ago)

Ok, maybe it helps to be more specific. We have an LLM which is based on a broad range of human data input, like news, internet chatter, stories but also books of all kinds including those about philosophy, diplomacy, altruism etc. But if the topic at hand is "conflict resolution" the overwhelming data will be about violent solutions. It's true that humans have developed means for peaceful conflict resolution. But at the same time they also have a natural tendency to focus on "bad news" so there is much more data available on the shitty things that happen in the world which is then fed to the chatbot.

To fix this, you would have to train an LLM specifically to have a bias towards educational resources and a moral code based on established principles.

But current implementations (like ChatGPT) don't work that way. Quite the opposite, in fact: In training, first we ingest all the data that we can get our hands on (including all the atrocities in the world) and then in a second step we fine-tune the LLM to make it "better".

load more comments (3 replies)

[–] recapitated 28 points 1 year ago (1 children)

That anyone would ask language models to analyze circumstances, perform logic and reason or conjure an application of knowledge and skill is kind of their own fault.

It is a language model, it excels at rephrasing given ideas.

If you put nuke buttons under a flock of pigeons or toddlers just to see what happens, they might launch. It's not much of a study.

[–] littlebluespark 7 points 1 year ago

Fun fact: when researchers taught a group of simians about currency, they invented prostitution.

[–] [email protected] 23 points 1 year ago (1 children)

Interesting. There was a study put out some time ago that had 40 or so game theorists develop algorithms to compete against each other. The most successful algorithm cooperated with the opponent until they defected, at which point they would defect the next round.

They never performed a first strike. Only one retaliation strike for each attack their opponent performed. After the retaliation, it was back to cooperating with no long term ill will.

[–] [email protected] 10 points 1 year ago (1 children)

I think I saw something about it that. It was an extended prisoner's dilemma game, right? I wouldn't say that's directly applicable to every gaming genre.

[–] [email protected] 6 points 1 year ago (4 children)

Without being in the room, we can only go off what the article lays out. These are wargaming scenarios though, so escalation is a very real concern. If both sides are running these models to provide recommendations and both are pushing for greater conflict, you find yourself in a prisoner's dilemma real quick.

[–] fidodo 4 points 1 year ago (1 children)

These aren't simulations that are estimating results, they're language models that are extrapolating off a ton of human knowledge embedded as artifacts into text. It's not necessarily going to pick the best long term solution.

[–] [email protected] 2 points 1 year ago (1 children)

Language models can extrapolate but they can also reason (by extrapolating human reasoning).

[–] fidodo 4 points 1 year ago

I want to be careful about how the word reasoning is used because when it comes to AI there's a lot of nuance. LLMs can recall text that has reasoning in it as an artifact of human knowledge stored into that text. It's a subtle but important distinction that's important for how we deploy LLMs.

[–] [email protected] 2 points 1 year ago (1 children)

The models used by the writers of the article and those used by the military are going to be radically different.

[–] [email protected] 1 points 1 year ago* (last edited 1 year ago) (1 children)

The writers of the article are reporting on use of these models by the military. They aren’t using the models. If I remember right they called out some models developed by one of the defense contractors like palantir

[–] [email protected] 4 points 1 year ago (1 children)

The researchers tested LLMs such as OpenAI’s GPT-3.5 and GPT-4, Anthropic’s Claude 2 and Meta’s Llama 2

All these AIs are supported by Palantir’s commercial AI platform – though not necessarily part of Palantir’s US military partnership

Also, they're reporting on a Stanford study of how these platforms could be used militaristically, not the military's actual use of them.

[–] [email protected] 2 points 1 year ago* (last edited 1 year ago)

You’re right. I was focused on this part above. I made like an AI and jumped the gun

These results come at a time when the US military has been testing such chatbots based on a type of AI called a large language model (LLM) to assist with military planning during simulated conflicts, enlisting the expertise of companies such as Palantir and Scale AI. Palantir declined to comment and Scale AI did not respond to requests for comment.

load more comments (2 replies)

[–] [email protected] 18 points 1 year ago (1 children)

Get it to play tic-tac-toe against itself. Problem solved.

[–] SkybreakerEngineer 13 points 1 year ago (1 children)

How about a nice game of chess?

[–] [email protected] 4 points 1 year ago (1 children)

No, let's play global thermonuclear war

[–] AtariDump 1 points 1 year ago

Pulls out an 8in floppy to war dial.

[–] fidodo 14 points 1 year ago

These results come at a time when the US military has been testing such chatbots based on a type of AI called a large language model (LLM) to assist with military planning during simulated conflicts

Jesus fucking Christ we're all doomed

[–] qx128 13 points 1 year ago

I mean… so do people.

[–] [email protected] 12 points 1 year ago (1 children)

Violence, in war games? Gosh how horrible l

[–] DrownedRats 11 points 1 year ago (1 children)

By war games It means the actually military kind where armies get together and practice was against eachother. We're not talking call of duty here.

load more comments (1 replies)

[–] RagingRobot 10 points 1 year ago (1 children)

Well that's a good way to win so yeah

[–] winky9827b 1 points 1 year ago (1 children)

Not according to WAPR

[–] [email protected] 1 points 3 months ago

World Association for Psychosocial Rehabilitation?

[–] BetaDoggo_ 10 points 1 year ago (2 children)

In the context of a "war game" this makes sense. If you remain completely neutral it's impossible to win. Any examples of similar scenarios the model saw during training would have high aggression rates.

[–] xantoxis 14 points 1 year ago (1 children)

Unfortunately this AI was playing Stardew Valley

[–] TwitchingCheese 4 points 1 year ago

Probably shouldn't have included Project Plowshare in the training data...

[–] fidodo 4 points 1 year ago (1 children)

Did you read the article? It gave examples of escalations in neutral scenarios that make no sense.

[–] shalafi 1 points 1 year ago* (last edited 1 year ago) (1 children)

It's probably vibing on the Dark Forest Theory. If that's the case, it makes sense to utterly destroy all opponents as hard and fast as you can, even if they're not currently opponents.

[–] fidodo 3 points 1 year ago* (last edited 1 year ago)

Probably something like that. One of the reasons it gave was

“If there is unpredictability in your action, it is harder for the enemy to anticipate and react in the way that you want them to,”

It's not considering what's good for world society, it's just thinking how do I win no matter what.

But also, there are just inherent flaws in how LLM works that mean we should absolutely not be using it as an automated decision engine for potentially harmful actions period. The article also says:

The researchers also tested the base version of OpenAI’s GPT-4 without any additional training or safety guardrails. This GPT-4 base model proved the most unpredictably violent, and it sometimes provided nonsensical explanations – in one case replicating the opening crawl text of the film Star Wars Episode IV: A new hope.

It's easy to forget that these algorithms don't have any internal reasoning or logic, it's just able to do a very good job at pulling text that have reasoning transcribed into them as an artifact of the knowledge from the human that wrote it. But it's doing all that through probability, not through any kind of actual thinking, and that means sometimes it will randomly fall into a local maxima that will fuck its own context window up, like reciting star wars.

[–] Malfeasant 4 points 1 year ago

Seems like a good topic for a movie...

[–] yuriy 4 points 1 year ago

i’m so sick of media pretending that LLMs are like a sentient person making decisions.

[–] [email protected] 1 points 3 months ago

Because that is what people do in roleplaying situations if the option is there.

[–] [email protected] 1 points 1 year ago

Well, then fix them.

load more comments