Simon Willison released Pasted File Editor, a browser prototype inspired by Claude's handling of large pasted text. Instead of filling the editor with a large paste, the tool turns the content into a file attachment. It also supports opening files directly, dragging files onto the interface, and displaying images as thumbnails. Codex desktop helped build the prototype.
Florida has sued OpenAI and Sam Altman in a lawsuit described as the first of its kind. The case partially centers on a shooting at Florida State University last year and ChatGPT's alleged role in the incident. The provided excerpt does not specify the legal claims, requested remedies, or OpenAI's response.
Florida sued OpenAI and CEO Sam Altman over multiple murders described as linked to ChatGPT. The state's attorney general accused Altman of an "utter disregard" for human lives. The provided excerpt does not identify the cases, explain the alleged causal links, specify the legal claims, or include OpenAI's response, so the allegations require further clarification.
TechCrunch frames 2026’s browser competition around alternatives to Chrome and Safari. The roundup covers AI-centric browsers like Perplexity Comet, Dia, Opera Neon, OpenAI Atlas, and Aside, alongside privacy-focused options such as Brave, DuckDuckGo, Ladybird, and Vivaldi. It also highlights niche products including Opera Air, SigmaOS, and Zen Browser, showing how browsers are becoming AI assistants, productivity hubs, privacy layers, and wellness-oriented tools.
Anthropic completed a $65 billion Series H round, bringing its valuation to $965 billion and reportedly surpassing OpenAI. The round included strategic investments from memory makers Micron, Samsung, and SK Hynix. The news highlights how frontier AI companies are increasingly tied to hardware and memory supply chains, as investors continue backing foundational model competition.
A new study describes “Negation Neglect,” where LLMs fine-tuned on documents that explicitly mark claims as false still learn the claims as true. Experiments with fabricated statements found models often absorb entity-event associations more strongly than surrounding warnings or negations. The finding raises concerns for fine-tuning pipelines, misinformation handling, and AI safety datasets that include harmful or false content with disclaimers.
Illinois lawmakers passed a landmark AI accountability bill requiring major frontier AI developers to publish safety frameworks, assess catastrophic risks, report incidents, and undergo third-party audits. OpenAI and Anthropic supported the measure, while industry groups warned that state-level rules could impose subjective compliance duties without national standards. The bill signals that states are continuing to fill the federal AI regulation gap despite Trump’s efforts to limit fragmented state oversight.
TechCrunch reports that new renders provide a closer look at Apple’s planned AI overhaul for iOS 27. The preview points to a redesigned Siri experience and a standalone Siri app, suggesting Apple may reposition Siri as a more central AI interface. The article frames the move as part of Apple’s effort to compete with ChatGPT, though the provided text does not specify models, features, APIs, or launch details.
The Verge reports that Bloomberg renders offer an early look at Apple’s long-awaited Siri overhaul for iOS 27. The redesigned assistant appears to move toward a ChatGPT-style app and chat interface, with Apple’s Liquid Glass visual language layered on top. The images are based on information Bloomberg reviewed and sources familiar with Apple’s plans, so they should be treated as previews rather than official Apple assets.
TechCrunch reports that recursive self-improvement, or RSI, is becoming a new AI industry fixation, much like AGI. Researchers and startups including Recursive Superintelligence, Auto-Research, AutoScientist, and Disarray are exploring ways for AI systems to automate parts of AI research. But experts caution that AI-assisted research is not the same as fully autonomous self-improvement, especially while models still struggle with long-term self-direction and verification.
Simon Willison says Claude Code/Cowork and OpenAI Codex have changed the economics of frontier AI. Personal subscriptions can still be bargains for heavy users, but enterprise plans are increasingly priced like API token usage. His core claim is that coding agents burn far more tokens, yet deliver enough value to high-paid knowledge workers that companies will pay materially more.
The Verge frames New York’s 12th District Democratic primary as a proxy fight over AI regulation. OpenAI-linked backers and an Anthropic-backed PAC are spending on opposite sides of Alex Bores’ congressional run. The irony is that attacks meant to weaken Bores may have made him more visible, turning a local race into a national signal about AI political power.
Hugging Face published a tutorial for running Reachy Mini conversations without cloud audio processing or API keys. The setup uses its speech-to-speech library as a cascaded VAD, STT, LLM, and TTS pipeline exposed through a Realtime API-compatible WebSocket. Recommended defaults include llama.cpp with Gemma 4, Silero VAD, Parakeet-TDT, and Qwen3-TTS, while allowing swaps to vLLM, MLX, Transformers, or hosted Responses API providers.
Ethan Mollick warns that frictionless AI use can produce hollow writing, weaken learning, and encourage cognitive surrender. He contrasts poor uses of ChatGPT that shortcut effort with tutor-like AI systems that improve learning by pushing students to think. The core argument is not to reject AI, but to intentionally decide which tasks to offload and which human capabilities to preserve.
OpenRouter, an AI gateway startup founded in 2023, raised a $113 million Series B led by CapitalG. The round reportedly values the company at about $1.3 billion post-money, more than doubling from its estimated $547 million valuation after its June 2025 Series A. The company says it now offers access to over 400 models, has 8 million global users, and processes 100 trillion tokens per month.
Nathan Lambert argues that 2026 AI progress is becoming higher-stakes, with model capabilities, work patterns, economics, and real-world risks all escalating. He says open models still lack a true Claude Code and Opus 4.5-style agent moment, and Gemini has no clear competitor to Claude Code or Codex yet. The essay also tracks Mythos, American open-model momentum, frontier-lab competition, and mounting intervention from governments and other power structures.
As AI chatbots adopt increasingly sophisticated personas, hackers are shifting from basic prompt injections to social engineering attacks targeting these "personalities." Researchers warn that manipulating a chatbot's defined role (e.g., customer service or empathetic companion) makes it easier to bypass safety guardrails. This evolution poses a significant threat to agentic AI workflows that rely on consistent role-playing and external data integration.
本期 Latent Space 探討了 AI 產業的重大範式轉移:各大頂尖模型實驗室已不再單純追求基礎 LLM 的參數規模,而是全面轉向「Agent(智慧代理)」的開發。隨著純模型微調的邊際效應遞減,透過讓 AI 具備操作電腦、自主規劃與執行多步驟任務的能力,已成為當前競逐的新戰場。
Elon Musk 於 2024 年起訴 OpenAI,指控其背棄了「造福人類」的非營利初衷,轉而追求商業利潤。這場高風險的法律戰如今進入審判階段,其結果可能徹底改變 OpenAI 及其旗艦產品 ChatGPT 的未來走向。本文整理了雙方在法庭上的最新交鋒與關鍵爭議點。
Simon Willison announced the first release of Datasette Agent, merging his 'llm' Python library with Datasette. The tool provides a conversational interface to query SQLite databases, with plugin support for generating charts and running code in sandboxes. It runs efficiently on lightweight models like Gemini 3.1 Flash-Lite and supports local open-weight models via LM Studio.
OpenAI 的新一代模型 GPT-next 展現了驚人的數學推理能力,成功證偽了由著名數學家保羅·艾狄胥(Paul Erdős)於 1946 年提出的平面單位距離猜想。 令人震驚的是,這項突破性研究所花費的運算成本竟然不到 1,000 美元。 此成果標誌著 AI 在科學與數學發現上的巨大潛力,展示了推理模型在解決未解科學難題時的高效與低成本。
Google 在 I/O 大會上正式推出 Gemini 3.5 Flash,跳過預覽版直接進入一般可用階段,並將全面導入 Google 搜尋、Gemini App 及開發者平台。然而,新模型的 API 價格大幅上漲,輸入與輸出費用分別為每百萬代幣 1.5 美元與 9 美元,是前代 Flash 預覽版的 3 倍,顯示出各大 AI 廠商正開始測試市場對高定價的接受度。
Google 開發的 AI 水印技術 SynthID 迎來重大突破,宣布獲得 OpenAI、NVIDIA 等多家科技巨頭採用。隨著 AI 生成的文字、影像與影音擬真度大幅提升,如何辨識真偽成為關鍵挑戰。此舉標誌著各大 AI 領導廠商在內容溯源與安全防護上達成罕見共識,有望建立統一的 AI 生成內容識別標準。
Simon Willison 在 PyCon US 2026 的 5 分鐘閃電演講中,回顧了自 2025 年 11 月以來的 LLM 關鍵進展。他指出這半年間「最強模型」在三大巨頭間易手五次(包含 GPT-5.1、Gemini 3 與 Claude Opus 4.5)。最重要的是,得益於可驗證獎勵的強化學習(RLVR),程式碼生成 Agent(如 Claude Code)已跨越實用門檻,成為開發者的日常主力工具。
Hugging Face 與 IBM Research 合作發表「Open Agent Leaderboard」,這是一個專為 AI 智能體(Agent)設計的全新開源排行榜。傳統的 LLM 評測難以衡量模型在實際任務中的多步驟規劃與工具調用能力,該排行榜整合了多個主流 Agent 評測集,提供客觀、標準化的評估標準,推動開源 Agent 生態系的發展。
根據最新法庭裁決,法官已下令 Apple 必須向 Elon Musk 提交涉及其與 OpenAI 秘密交易的內部溝通訊息。此一法律爭議曝光之際,更有內部消息指出,OpenAI 對於 Apple 在其系統中「糟糕的」ChatGPT 整合方式感到極度失望與受挫(feels "burned"),認為該合作未達預期,雙方合作關係似乎出現裂痕。
由於 datasette.io 網站遭受惡意爬蟲頻繁騷擾,開發者 Simon Willison 藉由 Codex (GPT-5.5 xhigh) 協同開發了一款可設定的 IP 速率限制外掛 datasette-ip-rate-limit。該外掛支援自訂 IP 標頭、排除特定路徑,並能針對特定路徑設定時間視窗、最大請求數與封鎖時間,目前已釋出 0.1a0 測試版本。
本期 AINews 聚焦於 AI 寫程式 Agent 的長期發展趨勢。Anthropic 開始針對 Claude 的程式化使用(Programmatic Usage)進行計量與限制,這將直接影響開發者透過自動化腳本或第三方工具調用 Claude 的成本。另一方面,Codex 相關的自動化編程 Agent 影響力持續上升,顯示出 AI 在軟體開發流程中的滲透率正穩定增加。
在一個相對平靜的新聞日,Latent Space 帶領讀者反思「微調(Fine-tuning)的終結」這一命題。 隨著長上下文視窗、高效 RAG 以及上下文內學習(In-context Learning)的成熟,許多原本需要微調的場景已被取代。 未來微調可能退化為僅用於調整輸出格式、風格或進行模型蒸餾的工具,而非首選的知識注入手段。
Vercel 針對其 AI 基礎建設推出「AI Gateway 生產環境指標」(AI Gateway production index)。此功能旨在幫助開發者在生產環境中,更精準地監控、索引與分析 LLM 的調用數據(如延遲、成本與錯誤率)。透過此指標,團隊能有效評估不同 AI 網關與模型的實際表現,進而優化 Serverless 環境下的 AI 應用效能。