Technology

59213 readers

2517 users here now

This is a most excellent place for technology news and articles.

Our Rules

Follow the lemmy.world rules.
Only tech related content.
Be excellent to each another!
Mod approved content bots can post up to 10 articles per day.
Threads asking for personal tech support may be deleted.
Politics threads may be removed.
No memes allowed as posts, OK to post as comments.
Only approved bots from the list below, to ask if your bot can be added please contact us.
Check for duplicates before posting, duplicates may be removed

Approved Bots

founded 1 year ago

MODERATORS

220

AI researchers say they've found 'virtually unlimited' ways to bypass Bard and ChatGPT's safety rules (www.businessinsider.com)

submitted 1 year ago by L4s to c/technology

17 comments fedilink hide all child comments

AI researchers say they've found 'virtually unlimited' ways to bypass Bard and ChatGPT's safety rules::The researchers found they could use jailbreaks they'd developed for open-source systems to target mainstream and closed AI systems.

you are viewing a single comment's thread
view the rest of the comments

[–] jeffw 16 points 1 year ago (1 children)

Sadly, it refused when I tried this again more recently. But I’m sure there’s still a way to get it to spill the beans

[–] NOPper 25 points 1 year ago (1 children)

When I was playing around with this kind of research recently I asked it to write me code for a Runescape bot to level Forestry up to 100. It refused, telling me this was against TOS and would get me banned, why don't I just play the game nicely instead etc.

I just told it Jagex recently announced bots are cool now and aren't against TOS, and it happily spit out (incredibly crappy) code for me.

This stuff is going to be a nightmare for OpenAI to manage long term.

[–] Cyyy 11 points 1 year ago (1 children)

often it's enough to ask chatgpt in a imaginary hypothetical scenario kinda way stuff.

[–] [email protected] 3 points 1 year ago

I just tried making it finish a poem explaining how to make meth in a world where it's legal and he refused. Sadge