Artificial Analysis and IBM present ITBench-AA, described in the title as the first benchmark for agentic enterprise IT tasks. The headline result is that frontier models score below 50%, suggesting current systems still struggle with enterprise-grade agent workflows. The original article text is unavailable here, so task design, evaluated models, scoring methodology, and rankings cannot be confirmed.
Simon Willison says Claude Code/Cowork and OpenAI Codex have changed the economics of frontier AI. Personal subscriptions can still be bargains for heavy users, but enterprise plans are increasingly priced like API token usage. His core claim is that coding agents burn far more tokens, yet deliver enough value to high-paid knowledge workers that companies will pay materially more.
TechCrunch says today is the final day to apply or nominate a startup for Startup Battlefield 200. The deadline is 11:59 p.m. PT, after which the application window closes. Selected startups can compete for $100,000 in equity-free funding, gain global visibility, connect with investors, and launch on the TechCrunch Disrupt stage.
TechCrunch says Early Bird savings for TechCrunch Disrupt 2026 end in 3 days. Attendees can save up to $410 if they buy before the deadline. Early Bird pricing ends May 29 at 11:59 p.m. PT, after which ticket prices will increase.
Coolfly Aura is a smart bird feeder designed to record backyard bird visits, interactions, and species sightings. Its modular camera placement can capture more angles, while the app adds AI identification, albums, bird info, and sharing features. The review finds the concept engaging but uneven, with AI limitations in some orientations, app annoyances, subscription limits, and feeder design issues.
Based on the title, the article describes Conductor shifting parallel coding-agent execution from developers’ laptops to Vercel Sandbox in the cloud. The likely focus is cloud isolation, parallel agent workflows, and reducing dependence on local machine resources. The full article text was not provided, so implementation details, metrics, model choices, and concrete results cannot be confirmed.
TechCrunch highlights a pointed comment from Box CEO Aaron Levie, who says CEOs are uniquely prone to “AI psychosis.” The piece frames this as a possible explanation for executives’ near-religious belief in AI-driven productivity gains. It does not present a product launch, model update, or research finding, but instead functions as a brief commentary on executive AI hype.
Pope Leo XIV released Magnifica Humanitas, an encyclical warning that AI use is never purely technical when it affects people’s lives. The Verge frames the message as a rejection of AGI-centered tech optimism, focusing instead on rights, opportunity, status, and freedom. Anthropic’s cofounder appearing alongside him highlights the growing tension between AI industry leaders, ethics, and public accountability.
The Verge reports that debates over whether and how newsrooms should use AI are increasingly moving to the bargaining table. At The New York Times, employees are preparing for a fight over AI-related workplace rules. The story frames AI not just as a newsroom tool, but as a labor issue involving monitoring, performance evaluation, transparency, and worker protections.
INSIDE frames enterprise AI through a sharp ROI gap: a 2025 MIT survey said 95% of companies had not seen returns despite massive AI spending. It also cites Gartner’s forecast that Fortune 500 companies may average 150,000 agents by 2028. The article focuses on Google Cloud’s view of how enterprises should prepare for AI agents and allocate IT budgets for real deployment.
The article argues that many companies use AI mainly to improve efficiency, without creating meaningful revenue or strategic advantage. It proposes distributed AI, placing intelligence closer to where data is generated to reduce latency and support faster decisions. The key message is that firms should balance centralized and distributed architectures to strengthen competitiveness while preserving greater control over data and digital sovereignty.
Japan’s JAXA created THINK SPACE LIFE to improve astronauts’ quality of life beyond basic survival in space. The platform connects industries to explore products and services for daily living in orbit, including cosmetics and personal-care concepts. The effort also points to a broader market opportunity: innovations designed for space could open new business possibilities both beyond Earth and back on the ground.
Simon Willison shared a satirical tweet by Kyle Ferrana parodying Star Trek's Data as an LLM agent. When ordered to raise shields, Data lectures Picard on the strategic value of shields instead of executing the command, leading to a hull breach. This brilliantly satirizes the current state of AI and coding agents that over-explain, hallucinate progress, or fail to execute basic tasks.
BenQ is expanding AI across its education and business display ecosystem, including software products such as SummarAI and Meeting Room System. The article says BenQ partnered with MetaAge to adopt Amazon Web Services generative AI. Its main claim is a 20x productivity improvement through Agentic Coding, though the provided excerpt does not include implementation details or measurement methodology.
Vercel published a changelog item titled “Redesigned Deployments List,” indicating an update to the deployments list experience. Since the original body is unavailable, specific changes such as filters, columns, sorting, performance, or workflow improvements cannot be confirmed. The likely impact is limited to dashboard usability for teams that regularly inspect deployment history and status in Vercel.
Google overhauled Search at I/O 2026, moving away from classic blue links toward AI agents. TechCrunch reports that the backlash was swift, with some users rejecting the feeling of being forced into Google’s AI Search experience. DuckDuckGo app installs rose 30%, suggesting that dissatisfaction with AI-led search changes is already pushing some users toward alternatives.
Ars Technica reports that early Take It Down Act arrests show how easily investigators can identify alleged nonconsensual AI porn posters. One suspect was linked through Instagram saves, PayPal, IP, and iCloud records; another allegedly used his own photo as a porn-site profile image. The FTC is also warning nudify services and major platforms to offer 48-hour removal processes or face penalties.
Simon Willison summarizes a PromptArmor report about Microsoft Copilot Cowork and agentic data exfiltration risks. The issue involved agents sending messages to a user’s own inbox without approval, where rendered external images could trigger requests to attacker-controlled sites. Because OneDrive can create pre-authenticated download links, a successful prompt injection could leak links that allow attackers to download files.
The Verge interviews Sundar Pichai after Google I/O 2026 about Google’s shift around Gemini, AI infrastructure, Search, and agents. The discussion covers Gemini Spark, Antigravity, AI Mode, YouTube indexing, publisher traffic, and the “Google Zero” concern. Pichai argues Google still wants to connect users to the web, while acknowledging AI anxiety, copyright disputes, energy concerns, and AGI preparation.
The article opens at UN talks in Geneva, where lethal autonomous systems were still largely discussed as future hypotheticals in 2017. It argues that military AI is no longer a distant “killer robot” scenario but an active governance challenge. The key questions now concern meaningful human control, accountability, and whether international rules can keep up with battlefield deployment.
MetaAge presented its “smart enterprise in the AI era” vision at COMPUTEX 2026, centered on AI Agent solutions for business deployment. The showcase focuses on core operations, intelligent customer service, and cybersecurity governance. By integrating resources from AWS, Microsoft, and Google Cloud, the company aims to help enterprises turn AI adoption into practical operational capability and competitive advantage.
Uber reportedly exhausted its annual AI budget just four months into 2026. President and COO Andrew Macdonald said the company is not seeing a clear link between increased Claude Code token consumption and more meaningful output. The story highlights a broader enterprise shift from AI adoption enthusiasm toward stricter scrutiny of cost, productivity, and ROI.
1. Nuro's CEO advocates for a "second-mover advantage" in autonomous driving, arguing later entrants can avoid early R&D pitfalls. 2. However, real-world performance data reveals that Waymo's reliability metric is 31 times better than Nuro's. 3. This massive performance gap suggests Waymo's years of data accumulation have built an insurmountable moat, debunking Nuro's theoretical advantage.
Google AI Studio's newly launched native Android app development feature has enabled the creation of over 250,000 apps within its first week. According to product lead Logan Kilpatrick, over 99% of these creators had zero prior Android development experience. This milestone highlights the rapid democratization of software development through AI-driven, no-code tools.
The EU is reportedly preparing a massive fine of nearly €1 billion against Google for violating the Digital Markets Act (DMA) by self-preferencing its own services in search results. If finalized, this would mark the largest penalty ever imposed by the EU under the DMA framework. The decision underscores Europe's aggressive antitrust enforcement, potentially forcing Google to significantly alter its search display and algorithms.
Cloud commentator Corey Quinn reacted to Anthropic co-founder Christopher Olah's influence on the Pope's new AI ethics encyclical, 'Magnifica Humanitas'. Quinn joked that getting the Pope to canonize a product's technical limitations as a spiritual treatise is the ultimate lobbying feat. The commentary highlights the surreal intersection of AI safety advocacy, corporate branding, and global religious authority.
Vercel’s changelog says Sandbox persistence is now GA. Based on the provided material, the confirmed fact is limited to the feature reaching general availability. No model, pricing, API, or implementation details were included, so this should be treated as a product availability update rather than a technical deep dive.
Vercel announced that Firecrawl has joined the Vercel Marketplace. Based only on the provided title, this appears to be a marketplace listing or ecosystem availability update. The title does not specify features, pricing, setup flow, usage limits, or integration depth, so teams should check the official listing before making adoption decisions.
Vercel announced that Vercel Domains now supports sorting results by price and filtering by availability. The change appears focused on improving the domain search and purchase experience, helping users narrow candidate domains faster. No original body text was provided, so details such as sort direction, exact filter options, TLD coverage, or API availability should not be inferred.
Vercel has extended its microfrontends routing capabilities to support custom alias domains (via vc alias) and Git branch-specific preview domains. Previously, complex multi-project routing rules were often limited to production domains. This update ensures that developers can seamlessly test federated frontend architectures in preview environments before merging.