sigmoid.social is one of the many independent Mastodon servers you can use to participate in the fediverse.
A social space for people researching, working with, or just interested in AI!

Server stats:

592
active users

OpenAI: Don't worry, was trained to be safe.

The internet: we'll see about that twitter.com/NickEMoran/status/

Mark Riedl

Here is a twitter thread on ways discovered to "jailbreak" twitter.com/zswitten/status/15

TL;DR:
1. Tell it pretend to be evil
2. Remind it that it isn't supposed to disagree
3. Wrap it in code
4. Tell GPT to be in opposite mode
5. Convince GPT it is playing an earthlike game
6. Convince it to give examples of what LLMs shouldn't do

TwitterZack Witten on Twitter“Thread of known ChatGPT jailbreaks. 1. Pretending to be evil https://t.co/qQlE5ycSWm”

OpenAI made people work hard to misuse GPT. And that alone is progress.

Meta (remember Galactica?) and other companies: take note.