Latest in AI

Showing:software-engineeringResearchersClear ×

Topic

Release New Tool Tutorial Business Paper Benchmark Opinion Regulation

For

General Developers Designers Product Founders Marketing Researchers Students

Mistral AI Upgrades Agentic Coding Capabilities with New Devstral Models
Mistral AI News40 days agoRelease
Mistral AI has announced a new generation of its Devstral model family, designed specifically to advance agentic coding capabilities. Devstral models are purpose-built for software engineering agents that autonomously plan, write, debug, and iterate on code. The release signals Mistral's continued investment in the competitive agentic-coding segment alongside offerings from Anthropic, OpenAI, and Google.
John Carmack: Fabrice Bellard Is Almost Certainly a Better Overall Programmer
Hacker News (AI keywords)42 days agoCommentary
In a brief post on X, legendary programmer John Carmack expressed deep admiration for Fabrice Bellard, stating Bellard is almost certainly a better overall programmer. The tribute is striking given Carmack's own towering reputation in software engineering. No further body text was available from the source, but the remark alone carries significant weight in the developer community.
Why AI Hasn't Replaced Software Engineers, and Won't
Simon Willison's Weblog43 days agoCommentary
Princeton researchers Arvind Narayanan and Sayash Kappor examine software engineering — AI's most exposed profession — and find no evidence of mass displacement. New York WARN Act data shows zero AI-related layoff disclosures after a full year of filings. Their qualitative analysis identifies three automation-resistant bottlenecks: specifying what to build, verifying outcomes, and the deep contextual understanding of codebases, businesses, and environments that only humans accumulate.
Why AI Hasn't Replaced Software Engineers, and Won't
Hacker News (AI keywords)47 days agoOpinion
Based only on the title, this appears to be a commentary on the limits of AI in software engineering. It likely argues that coding is only one part of the engineering role, while judgment, system design, debugging, product context, and accountability remain human-centered. The piece is relevant to developers and technical leaders evaluating AI coding tools without assuming full automation is imminent.
Hot take: "Vibecoding" is being used for two different things and it causes unnecessary friction
r/LocalLLaMA top day48 days agoCommentary
A Reddit user argues "vibecoding" carries two distinct meanings: throwing code at AI carelessly with no engineering judgment, versus using heavy AI assistance while still maintaining quality standards. Andrej Karpathy's own practice almost certainly fits the second definition, not the first. This semantic ambiguity fuels unnecessary arguments whenever the community debates AI-assisted development quality.
Anthropic Releases Claude Fable 5, Its First Mythos-Class Model★ 78
The Verge AI48 days agoRelease
Anthropic has released Claude Fable 5, the company's most powerful model ever made widely available and its first under the new 'Mythos' model class. The model shows exceptional performance across software engineering, knowledge work, and vision tasks. Its advantage over competing models reportedly grows wider as tasks increase in length and complexity, making it particularly suited for demanding, multi-step workloads.
Tokenomics: Quantifying Where Tokens Are Used in Agentic Software Engineering
Hacker News (AI keywords)51 days agoPaper
This arXiv paper studies token consumption in LLM-based multi-agent software engineering. Using 30 ChatDev tasks with a GPT-5 reasoning model, the authors map internal phases to SDLC stages such as design, coding, review, testing, and documentation. Preliminary results suggest code review dominates token usage, averaging 59.4%, while input tokens form the largest share, pointing to inefficiencies in agent collaboration.
AlphaEvolve：Google DeepMind 基於 Gemini 的程式碼 Agent 如何在各領域擴大影響力★ 80
Google DeepMind Blog83 days agoRelease
Google DeepMind has recently shared the latest progress and real-world impact of its new coding agent "AlphaEvolve." AlphaEvolve is an algorithmic system…
評估驅動開發（Eval-driven development）：更快打造更好的 AI 應用★ 80
Vercel Changelog649 days agoOpinion
As generative AI applications become more widespread, one of the biggest challenges developers face is the "non-deterministic" output of large language models…
Hugging Face 的設計哲學：為什麼我們反其道而行，選擇「重複程式碼」？★ 85
Hugging Face Blog1,575 days agoOpinion
This classic blog post from Hugging Face examines the tension between software engineering principles and the demands of machine learning (ML) research…
機器學習即程式碼（ML as Code）的時代已經到來：Hugging Face 談開發者導向的 AI 新典範
Hugging Face Blog1,742 days agoOpinion
This classic 2021 article from Hugging Face declared the official arrival of the "Machine Learning as Code" (ML as Code) era. The central argument is that…