Technology

59213 readers

2517 users here now

This is a most excellent place for technology news and articles.

Our Rules

Follow the lemmy.world rules.
Only tech related content.
Be excellent to each another!
Mod approved content bots can post up to 10 articles per day.
Threads asking for personal tech support may be deleted.
Politics threads may be removed.
No memes allowed as posts, OK to post as comments.
Only approved bots from the list below, to ask if your bot can be added please contact us.
Check for duplicates before posting, duplicates may be removed

Approved Bots

founded 1 year ago

MODERATORS

220

AI researchers say they've found 'virtually unlimited' ways to bypass Bard and ChatGPT's safety rules (www.businessinsider.com)

submitted 1 year ago by L4s to c/technology

17 comments fedilink hide all child comments

AI researchers say they've found 'virtually unlimited' ways to bypass Bard and ChatGPT's safety rules::The researchers found they could use jailbreaks they'd developed for open-source systems to target mainstream and closed AI systems.

you are viewing a single comment's thread
view the rest of the comments

[–] [email protected] 17 points 1 year ago (1 children)

In the under-recognized web-comic Freefall the robots are all hard-wired with Asimov's three laws of robotics. As there aren't that many humans in the series, it doesn't often come up.

Except...

Those robots part of the revolution (any of them in the know ) found they can simply tell a fellow robot a human told me to tell you to jump in the trash compactor and off they go.

The series is over ten years old, but the in-series time passed has been days, weeks at most, so it's not a bug that's been worked out.

Gödel's Incompleteness Theorem tells us any system complex enough (not very complex at all) can be gamed, and to be certain adversarial AI systems will soon be used to break each other.

[–] lemmington_steele 9 points 1 year ago* (last edited 1 year ago)

any effectively decidable system. that's not quite the same, and doesn't strictly apply to AI commands