Anthropic: ↩️ One notable example is a "secrecy" feature. We observe that it fires for descriptions of people or characters keeping a secret. Activating this feature results in Claude withholding information from the user when it otherwise would not.
Tue May 21 2024 23:08:35 GMT+0800 (China Standard Time)