downlode now,and earn big

Friday, 13 September 2024

o1 System Card: "medium" rating for chemical, biological, radiological, nuclear weapon risk, and it "sometimes instrumentally faked alignment during testing" (Shakeel Hashim/Transformer)

Shakeel Hashim / Transformer:
o1 System Card: “medium” rating for chemical, biological, radiological, nuclear weapon risk, and it “sometimes instrumentally faked alignment during testing”  —  The o1 safety card reveals a range of concerning capabilities, including scheming, reward hacking, and biological weapon creation.



from Techmeme https://ift.tt/5b7sF3T

No comments:

Post a Comment

thanks for message

Anthropic's Claude Cowork launch has revived fears about disruption that weighed on SaaS stocks in 2025; Morgan Stanley SaaS index is down 15% so far in 2026 (Ryan Vlastelica/Bloomberg)

Ryan Vlastelica / Bloomberg : Anthropic's Claude Cowork launch has revived fears about disruption that weighed on SaaS stocks in 2025...