Hacker Newsnew | past | comments | ask | show | jobs | submitlogin

Does it still try to 'unplug' itself if it gets something wrong, or did they RL that out yet?


Not sure if you're joking or serious? Every model has "degenerate" behavior it can be coerced into. Sonnet is even more apologetic on average.




Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search: