Psychological Tricks Can Get AI to Break the Rules

Uncategorized 07/09/2025 às 10:00

Researchers convinced large language model chatbots to comply with “forbidden” requests using a variety of conversational tactics.