Anthropic researchers wear down AI ethics with repeated questions

How do you get an AI to answer a question it’s not supposed to? There are many such “jailbreak” techniques, and Anthropic researchers just found a new one, in which a large language model can be convinced to tell you how to build a bomb if you prime it with a few dozen less-harmful questions…#llm

https://biztoc.com/x/d7ad8e74cc5ff6f0?ref=ff

Source: Reuters: Health - April 2, 2024 Category: Consumer Health News Source Type: news

More News: Health | Medical Ethics