Latest in AI

Showing:ai-agentsProductOtherClear ×

🔥 Trending today

nvidia5 microsoft-build4 agentic-ai4 enterprise-ai4 security3 ai-assistant3 fundraising3 rtx-spark2 ai-agents2 fraud-prevention2

Topic

Release New Tool Tutorial Business Paper Benchmark Opinion Regulation

For

General Developers Designers Product Founders Marketing Researchers Students

The Age of Async Agents — Cognition's Walden Yan & OpenInspect's Cole Murray
Latent Space5d agoCommentary
Latent Space interviews Cognition's Walden Yan and OpenInspect's Cole Murray on the rise of async coding agents. The discussion centers on Devin-related workflows, including 80% Devin commits, spec-to-PR development, full VMs, agent memory, and PMs shipping code. The key theme is not a model release, but a shift toward agents that can work asynchronously inside more complete software delivery loops.
ITBench-AA: Frontier Models Score Below 50% on Enterprise IT Tasks★ 72
Hugging Face Blog6d agoBenchmark
Artificial Analysis and IBM present ITBench-AA, described in the title as the first benchmark for agentic enterprise IT tasks. The headline result is that frontier models score below 50%, suggesting current systems still struggle with enterprise-grade agent workflows. The original article text is unavailable here, so task design, evaluated models, scoring methodology, and rankings cannot be confirmed.