Anthropic Withdraws Policy That Could “Undermine” Claude AI Researchers’ Work
Original: Anthropic Walks Back Policy That Could Have ‘Sabotaged’ AI Researchers Using Claude
Anthropic says it will make Claude Fable 5’s frontier LLM safeguards visible after backlash.
Simon Willison highlights a WIRED scoop reporting that Anthropic is changing Claude Fable 5 safeguards for frontier LLM development. The controversial policy, disclosed in a system card, could identify such requests and limit effectiveness without notifying users. Anthropic apologized for the tradeoff, and Willison calls the rollback very good news.
這篇 Simon Willison 的短評轉述 WIRED 記者 Maxwell Zeff 的報導:Anthropic 在遭遇強烈批評後,宣布調整 Claude Fable 5 針對前沿 LLM 開發的安全措施,讓這些 safeguard 變得可見。Anthropic 在給 WIRED 的聲明中承認,先前的設計做了錯誤取捨,並為沒有取得適當平衡道歉。
Free shows the 3-line summary; Pro unlocks the full deep summary (~300 words) so you never have to click through.
See Pro plans →Want the original English / full article?
Read on Simon Willison's Weblog →Summaries are AI-generated; the original article is authoritative.