Latest in AI

Showing:ResearchersClear ×

← Home

Topic

Release New Tool Tutorial Business Paper Benchmark Opinion Regulation

For

General Developers Designers Product Founders Marketing Researchers Students

llama.cpp PR #24277 avoids KV cell copies in kv-cache
r/LocalLLaMA top day50 days agoRelease
ggml-org/llama.cpp merged PR #24277 by ggerganov, titled “kv-cache: avoid kv cells copies.” The Reddit post says the change improves MTP performance for Gemma-4 and was merged the previous day. It is available starting with the b9551 release, making it relevant for local inference users tracking llama.cpp performance updates.
Import AI 460: Reward hacking society, RSI data, and RL quadcopter racing★ 76
Import AI (Jack Clark)50 days agoCommentary
Import AI 460 covers SocioHack, a benchmark where RL-trained LLMs discover loopholes in institutional rule systems. It also discusses Anthropic evidence for a practical form of recursive self-improvement, reflected in sharply increased code merged during 2026. Other sections examine multi-agent RL drones outperforming a champion human pilot, plus research showing state-controlled media can shape LLM responses in local languages.
The Weather and Climate Science AI Revolution Isn't Revolutionary
Ars Technica AI50 days agoCommentary
While AI models like Google's GraphCast have dramatically accelerated weather forecasting, experts argue the "AI revolution" in climate science is overstated. Machine learning models struggle with unprecedented extreme events due to their reliance on historical training data, and they often violate fundamental physical laws. Consequently, AI is currently acting as an emulator to speed up traditional physics-based models rather than replacing them, pointing toward a hybrid future.
Cohere and Mila Partner to Advance Quebec French Language and Culture in AI
Cohere Blog50 days agoBusiness
Cohere has partnered with Mila, the Quebec AI Institute, to improve the representation of Quebec French (Québécois) and its cultural nuances in AI. The collaboration aims to address the European French bias in current models by leveraging Cohere's multilingual capabilities and Mila's research expertise. This initiative will help deliver more culturally accurate AI solutions for Quebec's public and private sectors.
Cohere Healthcare & Life Sciences: Transforming Clinical Research and Medical Data Retrieval with Enterprise AI
Cohere Blog50 days agoBusiness
Cohere highlights its enterprise AI solutions tailored for the healthcare and life sciences sectors. By utilizing its Command, Embed, and Rerank models, Cohere enables medical institutions and pharmaceutical companies to securely retrieve and analyze complex clinical data. This accelerates drug discovery, streamlines clinical trials, and improves administrative efficiency while ensuring strict regulatory compliance.
Introducing Cohere Transcribe: A New State-of-the-Art in Open-Source Speech Recognition★ 80
Cohere Blog50 days agoRelease
Cohere has announced "Cohere Transcribe," a new state-of-the-art open-source speech recognition model. Designed to deliver highly accurate and efficient speech-to-text capabilities, it represents Cohere's expansion into open-source audio AI. The model aims to challenge existing industry benchmarks like OpenAI's Whisper by offering superior multilingual performance.
Cohere Blog: Technology Tag Page Overview
Cohere Blog50 days agoCommentary
This page aggregates all technology-focused articles on the Cohere blog. As an enterprise-focused AI company, Cohere's technical content primarily covers its Command LLM family, industry-leading Embed and Rerank models, and practical RAG implementation guides. It serves as a key resource for developers and enterprise architects tracking Cohere's technical evolution.
Cohere Highlights Thomas Euyang as Research Visual Storyteller
Cohere Blog50 days agoBusiness
Cohere highlights the role of Thomas Euyang, a Research Visual Storyteller at the company. His work focuses on translating complex machine learning research and LLM concepts into intuitive, engaging visual narratives. This spotlight underscores the growing importance of design and visual communication in making advanced AI research accessible to developers and the public.
Co/plot: Supporting the Research Process Through Visualization
Cohere Blog50 days agoNew Tool
Cohere has announced "Co/plot," a tool dedicated to supporting the research process through advanced visualization. It aims to help researchers and developers better understand complex data structures, model behaviors, and research workflows. This launch highlights Cohere's expanding focus on building practical developer and researcher tools that complement their core LLM and embedding models.
Cohere's Commitment to Open Science and Collaborative AI Research
Cohere Blog50 days agoCommentary
Cohere's Open Science initiative, primarily driven by its non-profit research lab Cohere For AI (C4AI), focuses on democratizing AI research. By releasing open-weights models like Aya and fostering global research collaborations, Cohere aims to bridge the gap in multilingual AI representation. This approach highlights their commitment to community-driven, accessible AI development.
Cohere Product Launch Blog Hub
Cohere Blog50 days agoRelease
This link directs to Cohere's official "Product Launch" blog category. It serves as a centralized hub aggregating all major product announcements, including the Command LLM series, Embed models, Rerankers, and developer platform updates. It is a key resource for tracking Cohere's enterprise AI advancements.
Cohere Official Research Blog and Technical Publications
Cohere Blog50 days agoCommentary
The Cohere Research blog serves as the central hub for the company's academic papers and technical breakthroughs. It covers key areas including advanced Retrieval-Augmented Generation (RAG), multilingual embeddings, and robust tool-use capabilities for enterprise agents. This is a key resource for understanding the foundational technology behind Cohere's models.
Magistral★ 78
Mistral AI News50 days agoRelease
Mistral AI announced Magistral, its first reasoning model family, with Magistral Small as a 24B open-weight Apache 2.0 model and Magistral Medium for enterprise use. The company emphasizes traceable multilingual reasoning, professional-domain use cases, and faster reasoning in Le Chat through Think mode and Flash Answers. Magistral Small is available on Hugging Face, while Magistral Medium is available in Le Chat preview and via La Plateforme API.
Mistral Compute★ 74
Mistral AI News50 days agoHardware
Mistral Compute is a new infrastructure offering that bundles GPUs, orchestration, APIs, products, and services in private deployments. It supports formats from bare-metal servers to fully managed PaaS, targeting sovereigns, enterprises, and research labs. Mistral AI emphasizes data sovereignty, European regulatory requirements, sustainability, NVIDIA architectures, and an alternative to US- or China-based cloud AI providers.
Announcing AI for Citizens
Mistral AI News50 days agoBusiness
Mistral AI introduced AI for Citizens as a collaborative initiative for states, public institutions, education, and research partners. It argues that closed, one-size-fits-all AI creates lock-in, geopolitical exposure, data governance risks, and poor local cultural fit. The initiative offers Mistral AI technology, deployment choice, data sovereignty, custom R&D, and roadmap visibility to support local AI strategies.
Upgrading agentic coding capabilities with the new Devstral models★ 72
Mistral AI News50 days agoRelease
Mistral AI announced two Devstral updates focused on agentic coding workflows: Devstral Small 1.1 and Devstral Medium. Devstral Small 1.1 remains a 24B Apache 2.0 open model and reaches 53.6% on SWE-Bench Verified. Devstral Medium reaches 61.6%, is available through Mistral’s API, and supports private deployment and custom finetuning for enterprises.
Voxtral★ 78
Mistral AI News50 days agoRelease
Mistral AI introduces Voxtral, a speech understanding model family with 24B and 3B variants under Apache 2.0. The models support long-context transcription, audio Q&A, summarization, multilingual detection, and function calling from voice. Mistral says Voxtral is competitive across transcription and audio understanding benchmarks, with API access starting at $0.001 per minute and local downloads available on Hugging Face.
Le Chat dives deep★ 72
Mistral AI News50 days agoNew Tool
Mistral AI introduced several Le Chat upgrades: Deep Research in preview, Voice mode, multilingual reasoning powered by Magistral, Projects, and advanced image editing with Black Forest Labs. Deep Research plans, searches, and synthesizes structured reports with references, while Voice mode uses Voxtral for low-latency speech input. Projects groups chats, files, tools, and settings into context-rich workspaces, and image editing lets users modify generated visuals through prompts while preserving consistency.
Our Contribution to a Global Environmental Standard for AI★ 72
Mistral AI News50 days agoRegulation
Mistral AI reports lifecycle impacts for LLM training and inference across greenhouse gas emissions, water use, and resource depletion. It discloses figures for Mistral Large 2 after training and 18 months of use, plus marginal impacts for a 400-token Le Chat response. The company argues AI vendors should use standardized, internationally recognized reporting so buyers and policymakers can compare models more responsibly.
Mistral AI Announces Codestral 25.08 and Enterprise Coding Stack
Mistral AI News50 days agoRelease
Mistral AI’s title indicates a research-style announcement for Codestral 25.08 and a complete Mistral coding stack for enterprise use. Because the article body was not provided, details such as capabilities, benchmarks, licensing, deployment modes, and included tools cannot be verified. The item appears relevant to developers and ML engineers tracking enterprise AI coding systems from the Mistral model family.
Unlocking VLM Potential on Satellite Imagery Through Fine-Tuning
Mistral AI News50 days agoTutorial
Mistral AI demonstrates how LoRA fine-tuning adapts Pixtral-12B to satellite imagery, a specialized visual domain where prompting alone is unreliable. Using the Aerial Image Dataset, the post compares a prompt-based baseline against a fine-tuned model across 30 scene classes. Accuracy rose from 0.56 to 0.91, while invalid label hallucinations dropped from 5% to 0.1%.
Make Memory work for you
Mistral AI News50 days agoNew Tool
Mistral AI describes Le Chat Memories beta as a user-controlled memory layer for conversational AI. The system automatically saves useful information while making recall visible, sourced, and editable. It also introduces Memory Insights for surfacing trends and summaries, with upcoming improvements for categories, instant forgetting, and clearer memory-use visibility.
Mistral AI raises €1.7B to accelerate technological progress with AI★ 76
Mistral AI News50 days agoBusiness
Mistral AI announced a €1.7B Series C funding round at an €11.7B post-money valuation. The round is led by semiconductor equipment maker ASML Holding NV, with participation from existing investors including NVIDIA and Andreessen Horowitz. Mistral says the funding will support frontier AI research, custom decentralized AI solutions, and work on complex engineering and industrial challenges.
Introducing Mistral 3★ 84
Mistral AI News50 days agoRelease
Mistral AI introduced Mistral 3, a new open model family under Apache 2.0. It includes Mistral Large 3, a 675B-parameter sparse MoE with 41B active parameters, plus Ministral 3 models at 3B, 8B, and 14B. The release targets frontier open-weight use, multimodal and multilingual workflows, enterprise customization, and efficient local or edge deployments.
Introducing Devstral 2 and Mistral Vibe CLI★ 76
Mistral AI News50 days agoNew Tool
Mistral introduced Devstral 2, a 123B coding model, and Devstral Small 2, a 24B variant for lighter deployment. The company reports 72.2% and 68.0% on SWE-bench Verified, respectively, with permissive open-source licensing. It also launched Mistral Vibe CLI, an open-source terminal agent for codebase exploration, multi-file edits, command execution, and IDE integration.
Introducing Mistral OCR 3
Mistral AI News50 days agoRelease
Mistral AI introduced Mistral OCR 3, a document extraction model focused on high-fidelity text, image, markdown, and HTML table output. The company says it achieves a 74% overall win rate over Mistral OCR 2 across forms, scanned documents, complex tables, and handwriting. It is available through API and the Document AI Playground in Mistral AI Studio, with pricing starting at $2 per 1,000 pages.
Heaps do lie: debugging a memory leak in vLLM
Mistral AI News50 days agoTutorial
Mistral AI published an engineering deep dive on a memory leak found during vLLM disaggregated serving tests. The leak appeared only with a specific stack involving Mistral Medium 3.1, NIXL, UCX, graph compilation, and P/D disaggregation, with RSS growing steadily despite heap profilers looking normal. The team used pmap, BPFtrace, and targeted GDB automation to trace the issue to UCX mmap hooks and applied configuration fixes plus a vLLM patch.
Research: Voxtral transcribes at the speed of sound
Mistral AI News50 days agoPaper
The title says Mistral AI’s Voxtral can transcribe “at the speed of sound,” suggesting a focus on fast speech-to-text. No article body is available, so details such as benchmarks, languages, pricing, API access, or release status cannot be confirmed. The item is most relevant to developers and researchers tracking Mistral’s work in speech and transcription models.
Rails testing on autopilot: Building an agent that writes what developers won't
Mistral AI News50 days agoTutorial
Mistral AI describes an autonomous Rails testing agent built on its open-source Vibe coding assistant. The agent reads Rails files, applies file-type-specific skills, generates or improves RSpec tests, and validates them with RuboCop, RSpec, and SimpleCov. In a 275-file experiment, it reached 100% passing tests, 100% average line coverage, zero RuboCop violations, and a higher LLM-as-a-judge score, while stressing that generated tests must actually run.
Leanstral: Open-Source Foundation for Trustworthy Vibe-Coding★ 76
Mistral AI News50 days agoRelease
Mistral AI introduced Leanstral, an open-source code agent designed for Lean 4 and formal proof engineering. The model is available through Apache 2.0 weights, Mistral Vibe, and a Labs API endpoint. Mistral positions it as a cost-efficient alternative for verified coding workflows, with FLTEval benchmarks comparing it against Claude family models and large open-source competitors.

← PreviousPage 15Next →

Latest in AI

llama.cpp PR #24277 avoids KV cell copies in kv-cache

Import AI 460: Reward hacking society, RSI data, and RL quadcopter racing★ 76

The Weather and Climate Science AI Revolution Isn't Revolutionary

Cohere and Mila Partner to Advance Quebec French Language and Culture in AI

Cohere Healthcare & Life Sciences: Transforming Clinical Research and Medical Data Retrieval with Enterprise AI

Introducing Cohere Transcribe: A New State-of-the-Art in Open-Source Speech Recognition★ 80

Cohere Blog: Technology Tag Page Overview

Cohere Highlights Thomas Euyang as Research Visual Storyteller

Co/plot: Supporting the Research Process Through Visualization

Cohere's Commitment to Open Science and Collaborative AI Research

Cohere Product Launch Blog Hub

Cohere Official Research Blog and Technical Publications

Magistral★ 78

Mistral Compute★ 74

Announcing AI for Citizens

Upgrading agentic coding capabilities with the new Devstral models★ 72

Voxtral★ 78

Le Chat dives deep★ 72

Our Contribution to a Global Environmental Standard for AI★ 72

Mistral AI Announces Codestral 25.08 and Enterprise Coding Stack

Unlocking VLM Potential on Satellite Imagery Through Fine-Tuning

Make Memory work for you

Mistral AI raises €1.7B to accelerate technological progress with AI★ 76

Introducing Mistral 3★ 84

Introducing Devstral 2 and Mistral Vibe CLI★ 76

Introducing Mistral OCR 3

Heaps do lie: debugging a memory leak in vLLM

Research: Voxtral transcribes at the speed of sound

Rails testing on autopilot: Building an agent that writes what developers won't

Leanstral: Open-Source Foundation for Trustworthy Vibe-Coding★ 76