Latest in AI

Showing:reasoningClear ×

← Home

Topic

Release New Tool Tutorial Business Paper Benchmark Opinion Regulation

For

General Developers Designers Product Founders Marketing Researchers Students

Cohere Introduces Command A+: Next-Gen Enterprise Model Optimized for Agentic Workflows★ 80
Cohere Blog50 days agoRelease
Cohere has introduced Command A+, its latest enterprise-grade model tailored for agentic workflows. Stepping beyond traditional RAG, Command A+ excels in multi-step reasoning, complex tool use, and multilingual capabilities. It is designed to seamlessly integrate with enterprise APIs, enabling highly autonomous and reliable AI agents.
Introducing Mistral Small 4★ 76
Mistral AI News50 days agoRelease
Mistral AI introduced Mistral Small 4 as the next major release in the Mistral Small family. It combines reasoning, multimodal, and agentic coding capabilities into one open model with configurable reasoning effort. The model uses a MoE architecture, supports a 256k context window and text-image inputs, and is available through Mistral API, AI Studio, Hugging Face, NVIDIA NIM, and common inference stacks.
Introducing Mistral Small 4★ 78
Mistral AI News50 days agoRelease
Mistral Small 4 is the next major release in the Mistral Small family, unifying Magistral-style reasoning, Pixtral-style multimodality, and Devstral-style coding agents. It uses a MoE architecture with 119B total parameters, 6B active parameters per token, a 256k context window, and configurable reasoning effort. The model is available via Mistral API, AI Studio, Hugging Face, open-source serving stacks, and NVIDIA deployment options.
Qwen 3.6 27B DeepSWE Benchmark Results Highlight Gap Between Local and Closed-Source Models
r/LocalLLaMA top day50 days agoBenchmark
A community benchmark of Qwen 3.6 27B on DeepSWE yielded a score of 1.79% (18/20th place), slightly outperforming Haiku 4.5. Run on a single RTX 6000 Blackwell GPU via vLLM with reasoning enabled, the test averaged 32 minutes and 44k output tokens per task. The author notes that while Qwen 3.6 27B represents a 'poor man's local SOTA,' the massive gap compared to frontier closed models suggests local LLMs are struggling to keep pace in complex coding.
[AINews] 所有模型實驗室都已轉型為 Agent 實驗室★ 78
Latent Space66 days agoCommentary
This AINews feature from Latent Space argues that the AI industry is undergoing a profound transformation — "all the model labs are now agent labs." Over the…
OpenAI GPT-next 僅花費不到 1,000 美元，便證偽了高達 80 年歷史的 Erdős 平面單位距離猜想★ 90
Latent Space68 days agoRelease
A historic and landmark breakthrough has arrived at the intersection of artificial intelligence and mathematics. According to Latent Space, OpenAI's…
llm-gemini 0.32a0 測試版發布：支援 Gemini 思考 Token 串流
Simon Willison's Weblog69 days agoRelease
Well-known developer Simon Willison recently released the latest alpha version of his open-source command-line tool plugin `llm-gemini`, version `0.32a0`…
Gemini 3.5 發布：具備行動力的前沿智能，主打複雜代理型工作流 (Agentic Workflows)★ 85
Google DeepMind Blog73 days agoRelease
Google DeepMind has announced the launch of its next-generation AI model, Gemini 3.5, positioned as "frontier intelligence with action." This announcement…
[AINews] 微調的終結？探討 Fine-tuning 在大模型時代的未來與轉變★ 75
Latent Space76 days agoOpinion
As AI technology continues to iterate at a rapid pace, the developer community is confronting a profound rethinking of the question: "Is fine-tuning heading…
OpenAI 物理學家 Alex Lupsasca 專訪：GPT-5.x 如何在理論物理與量子重力領域推導出全新研究成果 (Doing Vibe Physics)★ 85
Latent Space83 days agoCommentary
This interview records an in-depth conversation between OpenAI theoretical physicist Alex Lupsasca and Latent Space, centered on how GPT-5.x — OpenAI's…
未來的預兆：GPT-5.5 與 AI 指數型成長的下一步★ 85
One Useful Thing (Mollick)95 days agoCommentary
Wharton School professor Ethan Mollick, writing in his well-known newsletter "One Useful Thing," has published a profound analysis of GPT-5.5. He describes…
預測 2026 年年中：我對開源 AI 模型的幾點賭注與開閉源差距分析★ 75
Interconnects (Nathan L.)103 days agoOpinion
In this forward-looking article on the state of AI in mid-2026, Interconnects founder Nathan Lambert takes a deep dive into the dynamic gap between open-weight…
深入解析 VAKRA：IBM Research 評估 AI Agent 推理、工具調用與失敗模式的全新基準測試★ 75
Hugging Face Blog104 days agoRelease
As generative AI technology has evolved, the industry's focus has shifted from pure "Large Language Models (LLMs)" to "AI Agents" capable of autonomously…
Gemma 4：同等參數規模下最強大的開源模型，專為進階推理與 Agent 工作流打造★ 85
Google DeepMind Blog116 days agoRelease
Google DeepMind has today officially released its latest generation of open-source model series — Gemma 4. The company positions it as "the smartest and most…
損耗性自我提升：為什麼 AI 自我改進是真的，但不會導致「急遽暴漲」★ 75
Interconnects (Nathan L.)127 days agoOpinion
This article takes a deep dive into one of the most contentious topics in artificial intelligence: AI "self-improvement" and whether it will trigger a "fast…
Hugging Face 開源生態報告：2026 春季版★ 85
Hugging Face Blog132 days agoCommentary
Hugging Face has published its Spring 2026 "State of Open Source AI" report, offering a comprehensive review of the explosive growth and paradigm shifts that…
事情的輪廓：我們目前所處的 AI 階段與未來展望 (The Shape of the Thing)★ 85
One Useful Thing (Mollick)137 days agoOpinion
Wharton School professor Ethan Mollick, in his latest article "The Shape of the Thing," sketches out a clear picture of the current state of AI technological…
如何為 Seedream 5.0 撰寫 Prompt：掌握多步驟推理與範例編輯的新一代影像生成技術★ 75
Replicate Blog154 days agoTutorial
Replicate recently published a prompt-writing guide for the new image generation model Seedream 5.0. Seedream 5.0 introduces three revolutionary features to…
Google DeepMind 發表 Gemini 3.1 Pro：專為複雜任務打造的更智慧模型★ 85
Google DeepMind Blog158 days agoRelease
Google DeepMind officially released a brand-new model today (February 19, 2026): "Gemini 3.1 Pro." According to the initial official disclosure, the core…
代理人時代的 AI 選擇指南：不再只是聊天機器人★ 85
One Useful Thing (Mollick)160 days agoTutorial
Prominent scholar Ethan Mollick, in his latest article, points out that we have officially crossed beyond the era of simple "Chatbots" and entered what he…
開源模型陷入「永久追趕」：開源與閉源的差距、蒸餾、創新週期與開源的勝算★ 80
Interconnects (Nathan L.)160 days agoOpinion
This article by Nathan Lambert takes a deep dive into the tangled competitive dynamics between open-source and closed-source AI models. Lambert argues that…
Google DeepMind 推出 Gemini 3 Deep Think：專為科學、研究與工程設計的深度推理模式★ 90
Google DeepMind Blog165 days agoRelease
On February 12, 2026, Google DeepMind announced the launch of its most advanced reasoning mode update — Gemini 3 Deep Think. This model is Google's…
Gemini Deep Think 加速數學與科學發現：學術研究展現其跨領域的強大推理影響力★ 82
Google DeepMind Blog168 days agoCommentary
Google DeepMind recently published an article exploring how its deep-reasoning model, "Gemini Deep Think," is transforming the landscape of mathematics and…
中國開源 AI 生態系的架構抉擇：超越 DeepSeek 的下一步★ 85
Hugging Face Blog181 days agoCommentary
This blog post from Hugging Face reviews the full year of technical evolution since the "DeepSeek Moment" at the start of 2025 — the release of DeepSeek-V3 and…
Vercel AI Gateway 正式支援 Qwen 3 Max Thinking 推理模型★ 72
Vercel Changelog182 days agoRelease
Vercel announced in its changelog on January 26, 2026 that its AI Gateway service now officially supports Alibaba Cloud's latest Qwen 3 Max Thinking reasoning…
「DeepSeek 時刻」一週年：開源 AI 的典範轉移與變革回顧★ 85
Hugging Face Blog188 days agoCommentary
The DeepSeek-V3 and R1 models released in January 2025 have been hailed as the "DeepSeek Moment" in the AI world. This upheaval not only shattered the myth…
NVIDIA 推出 Cosmos Reason 2：為具身智能與物理 AI 注入先進推理能力★ 85
Hugging Face Blog203 days agoRelease
NVIDIA and Hugging Face have jointly announced the launch of the new Cosmos Reason 2 model, marking a major breakthrough in the fields of Physical AI and…
Google 2025 年度回顧：改變科學與 AI 未來的 8 大研究突破領域★ 85
Google DeepMind Blog216 days agoCommentary
As 2025 draws to a close, Google DeepMind has published its annual review, showcasing eight breakthrough research areas in artificial intelligence. This year…
Apriel-H1：揭示蒸餾高效推理模型的驚人關鍵★ 75
Hugging Face Blog251 days agoRelease
With the successive emergence of models with powerful "reasoning" capabilities — such as OpenAI o1, o3, and DeepSeek-R1 — the challenge of reducing the…
Google DeepMind 發表全新一代 Gemini 3：開啟主動式 AI 與超強推理的全新智能時代★ 98
Google DeepMind Blog251 days agoRelease
Google DeepMind officially unveiled its latest flagship AI model — Gemini 3 — in November 2025. This marks a new milestone for Google in the field of…

Page 1Next →

Latest in AI

Cohere Introduces Command A+: Next-Gen Enterprise Model Optimized for Agentic Workflows★ 80

Introducing Mistral Small 4★ 76

Introducing Mistral Small 4★ 78

Qwen 3.6 27B DeepSWE Benchmark Results Highlight Gap Between Local and Closed-Source Models

[AINews] 所有模型實驗室都已轉型為 Agent 實驗室★ 78

OpenAI GPT-next 僅花費不到 1,000 美元，便證偽了高達 80 年歷史的 Erdős 平面單位距離猜想★ 90

llm-gemini 0.32a0 測試版發布：支援 Gemini 思考 Token 串流

Gemini 3.5 發布：具備行動力的前沿智能，主打複雜代理型工作流 (Agentic Workflows)★ 85

[AINews] 微調的終結？探討 Fine-tuning 在大模型時代的未來與轉變★ 75

OpenAI 物理學家 Alex Lupsasca 專訪：GPT-5.x 如何在理論物理與量子重力領域推導出全新研究成果 (Doing Vibe Physics)★ 85

未來的預兆：GPT-5.5 與 AI 指數型成長的下一步★ 85

預測 2026 年年中：我對開源 AI 模型的幾點賭注與開閉源差距分析★ 75

深入解析 VAKRA：IBM Research 評估 AI Agent 推理、工具調用與失敗模式的全新基準測試★ 75

Gemma 4：同等參數規模下最強大的開源模型，專為進階推理與 Agent 工作流打造★ 85

損耗性自我提升：為什麼 AI 自我改進是真的，但不會導致「急遽暴漲」★ 75

Hugging Face 開源生態報告：2026 春季版★ 85

事情的輪廓：我們目前所處的 AI 階段與未來展望 (The Shape of the Thing)★ 85

如何為 Seedream 5.0 撰寫 Prompt：掌握多步驟推理與範例編輯的新一代影像生成技術★ 75

Google DeepMind 發表 Gemini 3.1 Pro：專為複雜任務打造的更智慧模型★ 85

代理人時代的 AI 選擇指南：不再只是聊天機器人★ 85

開源模型陷入「永久追趕」：開源與閉源的差距、蒸餾、創新週期與開源的勝算★ 80

Google DeepMind 推出 Gemini 3 Deep Think：專為科學、研究與工程設計的深度推理模式★ 90

Gemini Deep Think 加速數學與科學發現：學術研究展現其跨領域的強大推理影響力★ 82

中國開源 AI 生態系的架構抉擇：超越 DeepSeek 的下一步★ 85

Vercel AI Gateway 正式支援 Qwen 3 Max Thinking 推理模型★ 72

「DeepSeek 時刻」一週年：開源 AI 的典範轉移與變革回顧★ 85

NVIDIA 推出 Cosmos Reason 2：為具身智能與物理 AI 注入先進推理能力★ 85

Google 2025 年度回顧：改變科學與 AI 未來的 8 大研究突破領域★ 85

Apriel-H1：揭示蒸餾高效推理模型的驚人關鍵★ 75

Google DeepMind 發表全新一代 Gemini 3：開啟主動式 AI 與超強推理的全新智能時代★ 98