Hacker Newsnew | past | comments | ask | show | jobs | submitlogin

Claude also does that apparently. You give it a hint and it’ll lie about using that hint.

They talk about it here: https://www.anthropic.com/news/tracing-thoughts-language-mod...



Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search: