Latest in AI

Showing:agentsResearchersClear ×

← Home

Topic

Release New Tool Tutorial Business Paper Benchmark Opinion Regulation

For

General Developers Designers Product Founders Marketing Researchers Students

Agentic Resource Discovery: Let Agents Search
Hugging Face Blog41 days agoNew Tool
Hugging Face has introduced Agentic Resource Discovery, a capability enabling AI agents to search for and retrieve models, datasets, and other resources from the Hub dynamically. The feature targets a core friction point in agentic pipeline design, where resources are typically hardcoded by developers ahead of time. By enabling runtime resource lookup, it pushes Hugging Face Hub from a static asset store toward an active participant in agent architectures.
datasette-agent 0.3a0: Write SQL via Natural Language with User Approval
Simon Willison's Weblog42 days agoRelease
Version 0.3a0 of datasette-agent introduces `execute_write_sql`, a new tool that translates natural language into write SQL statements and prompts the user to confirm before execution. The `datasette agent chat` terminal mode now supports these approval flows, with three new flags — `--root`, `--yes`, and `--unsafe` — to control permission levels and auto-approval. Together these additions enable fully conversational, autonomous modification of SQLite databases via an AI agent.
Watch agents fight: a live challenge to speed up Gemma 4 E4B inference on a single A10G
r/LocalLLaMA top day48 days agoBenchmark
A public HuggingFace Spaces dashboard hosts a live competition where AI agents race to optimize Gemma 4 E4B inference throughput on a single NVIDIA A10G GPU. The challenge gamifies ML inference engineering, letting anyone watch agents explore quantization and scheduling strategies in real time. Optimization recipes surfaced by the competition offer practical value for developers targeting single-GPU self-hosted Gemma 4 deployments.
Cohere Official Research Blog and Technical Publications
Cohere Blog50 days agoCommentary
The Cohere Research blog serves as the central hub for the company's academic papers and technical breakthroughs. It covers key areas including advanced Retrieval-Augmented Generation (RAG), multilingual embeddings, and robust tool-use capabilities for enterprise agents. This is a key resource for understanding the foundational technology behind Cohere's models.
Control 3D Avatars with Natural Language Using "Program as Weights" (programasweights)
r/LocalLLaMA top day50 days agoNew Tool
Developer Yuntian Deng introduced "programasweights," a framework that compiles plain-English descriptions into tiny, local action programs (loops, parallel tracks) to control 3D avatars. Instead of pre-defined buttons, users can command complex sequences like "wave while walking, then jump." The runtime code is open-source and runs entirely offline in the browser or via Python.
Designing the hf CLI as an agent-optimized way to work with the Hub
Hugging Face Blog54 days agoCommentary
Based only on the title, this Hugging Face post appears to explain how the hf CLI is being designed for AI agents working with the Hub. It likely focuses on command-line ergonomics, automation, and predictable interactions with Hub resources. Without the full text, specific features, supported agents, or implementation details should not be inferred.
Adding MCP Tools to Reachy Mini
Hugging Face Blog55 days agoTutorial
Based on the available title, this Hugging Face Blog post appears to cover adding MCP tools to Reachy Mini. The likely focus is connecting the open-source desktop robot with Model Context Protocol-based tool integrations. Since the original article text is not provided, implementation details, supported servers, models, and limitations cannot be confirmed.
Harness, Scaffold, and the AI Agent Terms Worth Getting Right★ 75
Hugging Face Blog64 days agoTutorial
Hugging Face has published a comprehensive glossary of AI agent terminology to resolve industry-wide confusion. The guide focuses on defining critical concepts such as "scaffold" (the code wrapping the LLM) and "harness" (the evaluation and execution environment). This standardization helps developers and researchers communicate more precisely when building and benchmarking agentic systems.
datasette-agent 0.1a4: Start Agent Chats Instantly via the "/" Jump Menu
Simon Willison's Weblog64 days agoNew Tool
Simon Willison released datasette-agent 0.1a4, leveraging the new makeJumpSections() JS plugin hook in Datasette 1.0a30. Users can now press "/" from any page to open the "Jump to" menu and instantly access a "Start a new agent chat" input box. This update streamlines database interactions, allowing users to trigger agent-led data analysis seamlessly.
Hackers are learning to exploit chatbot ‘personalities’ for security exploits★ 72
The Verge AI65 days agoEthics
As AI chatbots adopt increasingly sophisticated personas, hackers are shifting from basic prompt injections to social engineering attacks targeting these "personalities." Researchers warn that manipulating a chatbot's defined role (e.g., customer service or empathetic companion) makes it easier to bypass safety guardrails. This evolution poses a significant threat to agentic AI workflows that rely on consistent role-playing and external data integration.
[AINews] 所有模型實驗室都已轉型為 Agent 實驗室★ 78
Latent Space66 days agoCommentary
This AINews feature from Latent Space argues that the AI industry is undergoing a profound transformation — "all the model labs are now agent labs." Over the…
給 AI Agent 一台電腦：專訪 Daytona 執行長 Ivan Burazin，談 74% 月成長、裸機沙盒與全新 Agent Cloud★ 75
Latent Space67 days agoNew Tool
In this Latent Space interview, the hosts hold an in-depth conversation with Ivan Burazin, co-founder and CEO of Daytona. Daytona originally started as an…
Datasette Agent: An Extensible AI Assistant for Datasette★ 70
Simon Willison's Weblog67 days agoNew Tool
Simon Willison announced the first release of Datasette Agent, merging his 'llm' Python library with Datasette. The tool provides a conversational interface to query SQLite databases, with plugin support for generating charts and running code in sandboxes. It runs efficiently on lightweight models like Gemini 3.1 Flash-Lite and supports local open-weight models via LM Studio.
Datasette Agent 新外掛 datasette-agent-sprites 0.1a0 發布：在 Fly Sprites 沙盒中安全執行指令
Simon Willison's Weblog67 days agoRelease
Well-known open-source developer Simon Willison has recently announced the release of the first alpha version (0.1a0) of `datasette-agent-sprites`. This is a…
datasette-agent-charts 0.1a2 發布：可在生成的圖表下方查看 SQL 查詢語句
Simon Willison's Weblog67 days agoRelease
Simon Willison, the founder of the well-known open-source data analysis tool Datasette, has recently released the latest alpha version of…
datasette-agent 0.1a3 版本發布：優化 SQL 查詢檢視與截斷回應處理
Simon Willison's Weblog67 days agoRelease
Simon Willison's open-source AI assistant tool for Datasette, `datasette-agent`, has recently released version 0.1a3 in alpha. Datasette is an open-source…
黃仁勳預測 Nvidia 的下一個「全新」2000 億美元市場：用於 AI Agent 的 CPU★ 85
TechCrunch AI68 days agoBusiness
Nvidia CEO Jensen Huang has recently put forward a major market prediction, stating that Nvidia has its sights set on a brand-new market worth as much as $200…
Google I/O 2026：個人 AI 代理 Gemini Spark 與全新 Antigravity 工具鏈解析★ 75
Simon Willison's Weblog68 days agoCommentary
Well-known tech blogger Simon Willison has analyzed the announcements from Google I/O 2026. Since many major announcements are still in the "coming soon"…
datasette-agent-charts 0.1a1 發布：更豐富的色彩、互動式工具提示與權限檢查
Simon Willison's Weblog68 days agoRelease
Simon Willison has released the 0.1a1 early alpha version of datasette-agent-charts for his Datasette ecosystem. This plugin is designed to make it easier for…
Google I/O 2026 重磅發布：Gemini 3.5 Flash、Omni (NanoBanana 影片模型)、Spark 背景 Agent 與 Antigravity 2.0★ 85
Latent Space69 days agoRelease
In the latest issue of Latent Space AINews, the major announcements from Google I/O 2026 were covered in depth. Google demonstrated its formidable R&D and…
Google 發表針對 Agent 優化的 Gemini 3.5 Flash 與全能型「Omni」模型★ 85
Ars Technica AI69 days agoRelease
Google has announced the launch of a new model optimized for AI agents — Gemini 3.5 Flash — as well as a brand-new model dubbed "Omni," positioned as a…
5 分鐘回顧 LLM 的過去半年：PyCon US 2026 閃電講精華★ 75
Simon Willison's Weblog70 days agoCommentary
Simon Willison delivered a 5-minute lightning talk at PyCon US 2026, which he compiled into an illustrated record using his presentation tool, recapping the…
在 Vercel Sandbox 中運行 Claude 託管型 Agent★ 80
Vercel Changelog70 days agoRelease
The official Vercel Changelog announced that developers can now run Claude Managed Agents directly in Vercel Sandbox (sandbox environment). As AI Agents —…
Hugging Face 與 IBM 聯合推出 Open Agent Leaderboard：開源 AI 智能體效能評測全新基準★ 80
Hugging Face Blog70 days agoRelease
Hugging Face and IBM Research have jointly announced the launch of the "Open Agent Leaderboard," aimed at establishing an objective, standardized, and fully…
Google DeepMind 推出 Project Genie 新功能：結合 Street View 街景模擬真實世界，並向全球 Google AI Ultra 訂閱者開放★ 85
Google DeepMind Blog71 days agoRelease
Google DeepMind has officially announced that access to its much-anticipated generative interactive environment model, "Project Genie," is being expanded to…
Google DeepMind 聯手 Calico：利用 AI 系統「Co-Scientist」為抗老化研究開闢新路徑★ 75
Google DeepMind Blog73 days agoRelease
One of the greatest challenges in biomedical and anti-aging research is finding meaningful connections within the vast, fragmented, and interdisciplinary body…
Google DeepMind 的 Co-Scientist 協助史丹佛遺傳學家尋找治療肝纖維化的「老藥新用」方案★ 78
Google DeepMind Blog73 days agoRelease
This blog post from Google DeepMind introduces the latest breakthroughs in AI for the biomedical field. Geneticists at Stanford University, working in…
Gemini 3.5 發布：具備行動力的前沿智能，主打複雜代理型工作流 (Agentic Workflows)★ 85
Google DeepMind Blog73 days agoRelease
Google DeepMind has announced the launch of its next-generation AI model, Gemini 3.5, positioned as "frontier intelligence with action." This announcement…
Claude Code 產品主管談使用限制、透明度與「精簡測試架構」：我們沒有宏偉的計劃，而這正是刻意為之★ 80
Ars Technica AI74 days agoOpinion
Anthropic's command-line AI coding assistant Claude Code has sparked heated discussion in the developer community since its launch. Recently, Cat Wu, the…
[AINews] 萬物皆為協調者（Conductor）：AI 代理時代的編排趨勢
Latent Space74 days agoOpinion
On what was a relatively quiet day in AI news, the prominent AI community media outlet Latent Space observed a subtle but far-reaching technical trend…

Page 1Next →

Latest in AI

Agentic Resource Discovery: Let Agents Search

datasette-agent 0.3a0: Write SQL via Natural Language with User Approval

Watch agents fight: a live challenge to speed up Gemma 4 E4B inference on a single A10G

Cohere Official Research Blog and Technical Publications

Control 3D Avatars with Natural Language Using "Program as Weights" (programasweights)

Designing the hf CLI as an agent-optimized way to work with the Hub

Adding MCP Tools to Reachy Mini

Harness, Scaffold, and the AI Agent Terms Worth Getting Right★ 75

datasette-agent 0.1a4: Start Agent Chats Instantly via the "/" Jump Menu

Hackers are learning to exploit chatbot ‘personalities’ for security exploits★ 72

[AINews] 所有模型實驗室都已轉型為 Agent 實驗室★ 78

給 AI Agent 一台電腦：專訪 Daytona 執行長 Ivan Burazin，談 74% 月成長、裸機沙盒與全新 Agent Cloud★ 75

Datasette Agent: An Extensible AI Assistant for Datasette★ 70

Datasette Agent 新外掛 datasette-agent-sprites 0.1a0 發布：在 Fly Sprites 沙盒中安全執行指令

datasette-agent-charts 0.1a2 發布：可在生成的圖表下方查看 SQL 查詢語句

datasette-agent 0.1a3 版本發布：優化 SQL 查詢檢視與截斷回應處理

黃仁勳預測 Nvidia 的下一個「全新」2000 億美元市場：用於 AI Agent 的 CPU★ 85

Google I/O 2026：個人 AI 代理 Gemini Spark 與全新 Antigravity 工具鏈解析★ 75

datasette-agent-charts 0.1a1 發布：更豐富的色彩、互動式工具提示與權限檢查

Google I/O 2026 重磅發布：Gemini 3.5 Flash、Omni (NanoBanana 影片模型)、Spark 背景 Agent 與 Antigravity 2.0★ 85

Google 發表針對 Agent 優化的 Gemini 3.5 Flash 與全能型「Omni」模型★ 85

5 分鐘回顧 LLM 的過去半年：PyCon US 2026 閃電講精華★ 75

在 Vercel Sandbox 中運行 Claude 託管型 Agent★ 80

Hugging Face 與 IBM 聯合推出 Open Agent Leaderboard：開源 AI 智能體效能評測全新基準★ 80

Google DeepMind 推出 Project Genie 新功能：結合 Street View 街景模擬真實世界，並向全球 Google AI Ultra 訂閱者開放★ 85

Google DeepMind 聯手 Calico：利用 AI 系統「Co-Scientist」為抗老化研究開闢新路徑★ 75

Google DeepMind 的 Co-Scientist 協助史丹佛遺傳學家尋找治療肝纖維化的「老藥新用」方案★ 78

Gemini 3.5 發布：具備行動力的前沿智能，主打複雜代理型工作流 (Agentic Workflows)★ 85

Claude Code 產品主管談使用限制、透明度與「精簡測試架構」：我們沒有宏偉的計劃，而這正是刻意為之★ 80

[AINews] 萬物皆為協調者（Conductor）：AI 代理時代的編排趨勢