Latest in AI

Showing:llm-safetyGeneralClear ×

Topic

Release New Tool Tutorial Business Paper Benchmark Opinion Regulation

For

General Developers Designers Product Founders Marketing Researchers Students

OpenAI Help: Lockdown Mode★ 74
Simon Willison's Weblog52 days agoCommentary
Simon Willison notes that OpenAI’s previously teased Lockdown Mode is now live for eligible personal and self-serve Business ChatGPT accounts. The feature does not stop prompt injections from appearing in content, but limits outbound network requests that could leak sensitive data. He sees it as a direct mitigation for the exfiltration leg of the “Lethal Trifecta,” while implying default ChatGPT settings are not robust against determined data theft attempts.
Import AI 450：中國電子戰 AI 模型、受創傷的 LLM 與網路攻擊的規模法則★ 75
Import AI (Jack Clark)127 days agoCommentary
In this issue of Import AI 450, author Jack Clark explores three key topics with profound implications for the future of technology, security, and geopolitics…
Google DeepMind 推出 FACTS 基準測試套件：系統化評估大型語言模型的真實性★ 80
Google DeepMind Blog231 days agoRelease
As large language models (LLMs) are deployed across a wide range of industries, ensuring the "factuality" of model outputs and reducing "hallucination" has…