Latest in AI

Showing:ragResearchersClear ×

← Home

Topic

Release New Tool Tutorial Business Paper Benchmark Opinion Regulation

For

General Developers Designers Product Founders Marketing Researchers Students

Mistral AI Releases Codestral Embed Research Code Embedding Model
Mistral AI News40 days agoRelease
Mistral AI has announced Codestral Embed, a research-stage embedding model extending the Codestral family into vector representations of source code. The model targets developers and ML engineers building semantic code search, retrieval-augmented generation pipelines, and similarity tools over large codebases. As a research release, it signals Mistral's intent to compete at the retrieval and indexing layer of AI-assisted software development, not only at code generation.
Is Grep All You Need? How Agent Harnesses Reshape Agentic Search
Hacker News (AI keywords)49 days agoPaper
Echoing the famous Transformer paper, this work asks whether grep alone is sufficient for agentic search scenarios. The study focuses on 'agent harnesses'—the scaffolding wrapping an LLM, including prompting strategy, tool access, and memory—as the primary driver of search quality. Findings suggest harness design may matter more than the underlying model, challenging the community's focus on model scaling.
Cohere Healthcare & Life Sciences: Transforming Clinical Research and Medical Data Retrieval with Enterprise AI
Cohere Blog50 days agoBusiness
Cohere highlights its enterprise AI solutions tailored for the healthcare and life sciences sectors. By utilizing its Command, Embed, and Rerank models, Cohere enables medical institutions and pharmaceutical companies to securely retrieve and analyze complex clinical data. This accelerates drug discovery, streamlines clinical trials, and improves administrative efficiency while ensuring strict regulatory compliance.
Cohere Blog: Technology Tag Page Overview
Cohere Blog50 days agoCommentary
This page aggregates all technology-focused articles on the Cohere blog. As an enterprise-focused AI company, Cohere's technical content primarily covers its Command LLM family, industry-leading Embed and Rerank models, and practical RAG implementation guides. It serves as a key resource for developers and enterprise architects tracking Cohere's technical evolution.
Co/plot: Supporting the Research Process Through Visualization
Cohere Blog50 days agoNew Tool
Cohere has announced "Co/plot," a tool dedicated to supporting the research process through advanced visualization. It aims to help researchers and developers better understand complex data structures, model behaviors, and research workflows. This launch highlights Cohere's expanding focus on building practical developer and researcher tools that complement their core LLM and embedding models.
Cohere Product Launch Blog Hub
Cohere Blog50 days agoRelease
This link directs to Cohere's official "Product Launch" blog category. It serves as a centralized hub aggregating all major product announcements, including the Command LLM series, Embed models, Rerankers, and developer platform updates. It is a key resource for tracking Cohere's enterprise AI advancements.
Cohere Official Research Blog and Technical Publications
Cohere Blog50 days agoCommentary
The Cohere Research blog serves as the central hub for the company's academic papers and technical breakthroughs. It covers key areas including advanced Retrieval-Augmented Generation (RAG), multilingual embeddings, and robust tool-use capabilities for enterprise agents. This is a key resource for understanding the foundational technology behind Cohere's models.
Introducing Search Toolkit★ 72
Mistral AI News50 days agoNew Tool
Mistral AI introduced Search Toolkit in public preview as a composable framework for AI search infrastructure. It unifies ingestion, retrieval, and evaluation with support for parsing, chunking, embeddings, BM25, dense retrieval, hybrid search, and standard retrieval metrics. The toolkit targets enterprise search, RAG quality improvement, and domain-specific retrieval, with a starter app using Docker, uv, and Vespa.
Gemma 4 31B FP8 Matches Claude Sonnet 4.6 Medium in Custom Benchmark★ 75
r/LocalLLaMA top day50 days agoBenchmark
A Reddit user shared benchmark results showing Google's Gemma 4 31B (FP8) performing on par with Claude Sonnet 4.6 Medium. The custom evaluation harness tested complex tasks including Neo4j Cypher queries, entity extraction, agentic tool calling, Python coding, and multi-vector retrieval synthesis. This highlights how quantized mid-sized open-source models are closing the gap with leading proprietary frontier models.
[AINews] 新一代 AI 基礎設施獨角獸：Exa、Modal 與 TurboPuffer★ 75
Latent Space67 days agoBusiness
In an era of exploding AI applications, the competition and evolution of underlying AI infrastructure (AI Infra) is equally compelling. The latest issue of…
Hugging Face 推出 Ettin Reranker 重排模型家族：大幅提升 RAG 檢索精度與效率★ 80
Hugging Face Blog70 days agoRelease
In building Retrieval-Augmented Generation (RAG) systems, accurately locating the most relevant information from a vast document collection has always been the…
PaddleOCR 3.5 發布：支援 Transformers 後端，輕鬆執行 OCR 與文件解析任務★ 75
Hugging Face Blog70 days agoRelease
The well-known open-source OCR (Optical Character Recognition) toolkit PaddleOCR has long been celebrated for its high accuracy, lightweight models, and strong…
IBM 發布 Granite Embedding Multilingual R2：具備 32K 上下文與 Apache 2.0 授權，100M 參數以下最強多語言嵌入模型★ 75
Hugging Face Blog74 days agoRelease
IBM has officially released a new multilingual embedding model on the Hugging Face platform called "Granite Embedding Multilingual R2." The model's most…
[AINews] 微調的終結？探討 Fine-tuning 在大模型時代的未來與轉變★ 75
Latent Space76 days agoOpinion
As AI technology continues to iterate at a rapid pace, the developer community is confronting a profound rethinking of the question: "Is fine-tuning heading…
使用 Sentence Transformers 訓練與微調多模態嵌入與 Reranker 模型★ 80
Hugging Face Blog103 days agoTutorial
As multimodal AI has become widespread, integrating data from different modalities — text, images, and more — into a single vector space and performing…
Sentence Transformers 推出多模態嵌入與重排（Reranker）模型支援★ 78
Hugging Face Blog110 days agoRelease
The popular open-source library `sentence-transformers` from Hugging Face has received a major update, officially introducing native support for Multimodal…
一天內構建專屬領域的 Embedding 模型：Hugging Face 與 NVIDIA 實戰指南★ 80
Hugging Face Blog129 days agoTutorial
When building Retrieval-Augmented Generation (RAG) systems, general-purpose embedding models (such as those from OpenAI or common open-source alternatives)…
OpenEvidence 如何利用 Vercel 打造讓醫生真正信賴的醫療 AI 系統
Vercel Changelog153 days agoBusiness
In the medical field, AI "hallucinations" and uncertainty are the biggest barriers to widespread adoption. When making clinical decisions, doctors need…
Import AI 441：我的 AI Agent 開始工作了，你的呢？以及如何用「毒泉」污染 AI 系統★ 75
Import AI (Jack Clark)190 days agoCommentary
### The Age of Practical AI Agents Has Arrived In this edition of his column, Jack Clark shares his personal breakthrough in using AI Agents. Previously, many…
打造 Deep Research：Tavily 分享如何達到業界領先的 AI 深度研究技術★ 78
Hugging Face Blog245 days agoTutorial
In this technical blog post published on Hugging Face, Tavily — a search engine designed specifically for AI agents — details how they built a "Deep Research"…
Sentence Transformers 正式加入 Hugging Face 團隊，攜手推動開源嵌入模型生態系★ 75
Hugging Face Blog279 days agoRelease
In the fields of natural language processing (NLP) and vector retrieval, Sentence Transformers — founded by Nils Reimers — has long been the industry-standard…
使用開源模型大幅提升你的 OCR 工作流效率★ 80
Hugging Face Blog280 days agoTutorial
Traditional OCR systems (such as Tesseract) often struggle with complex layouts, multi-column tables, handwriting, and mathematical formulas, while using…
Replicate 推出 Datalab Marker 與 OCR 模型：快速將文件與圖片轉換為 Markdown 與精確文字定位★ 75
Replicate Blog280 days agoRelease
The Replicate platform has newly listed two powerful document and image parsing models developed by Datalab: "Datalab Marker" and "Datalab OCR." They are…
Hugging Face 推出 RTEB：全新檢索評估標準，為 RAG 系統打造更真實的測試基準★ 80
Hugging Face Blog300 days agoRelease
As Retrieval-Augmented Generation (RAG) becomes the dominant architecture for enterprises deploying large language models (LLMs), accurately evaluating the…
mmBERT：ModernBERT 邁向多語言時代，開源高效能多語言編碼器模型登場★ 78
Hugging Face Blog322 days agoRelease
In today's era dominated by generative AI and large language models (LLMs), bidirectional encoder models (such as BERT and RoBERTa) still play an indispensable…
歡迎 EmbeddingGemma：Google 全新高效嵌入模型上線 Hugging Face★ 75
Hugging Face Blog327 days agoRelease
Google has recently launched a new open-source text embedding model called "EmbeddingGemma" on the Hugging Face platform. This model is built on the…
MCP for Research：如何將 AI 連接到學術研究工具★ 75
Hugging Face Blog344 days agoTutorial
As the use of AI in academic research becomes increasingly widespread, enabling large language models (LLMs) to access the latest scientific literature in real…
Hugging Face 推出 Parquet 內容定義分塊 (CDC)：優化大規模 AI 資料集去重與傳輸效率★ 75
Hugging Face Blog368 days agoRelease
### What Is Parquet Content-Defined Chunking (CDC)? In the AI and machine learning field, dataset sizes are growing at a staggering pace. Datasets on the…
Ettin Suite：Hugging Face 推出最先進的成對編碼器與解碼器開源套件★ 75
Hugging Face Blog377 days agoRelease
Hugging Face has officially launched the Ettin Suite, a brand-new state-of-the-art (SoTA) open-source model family of "Paired Encoders and Decoders." In…
使用 Sentence Transformers 訓練與微調稀疏嵌入模型 (Sparse Embedding Models)★ 75
Hugging Face Blog392 days agoTutorial
This technical blog post from Hugging Face provides a detailed guide on how to train and fine-tune "Sparse Embedding Models" using the Sentence Transformers…

Page 1Next →

Latest in AI

Mistral AI Releases Codestral Embed Research Code Embedding Model

Is Grep All You Need? How Agent Harnesses Reshape Agentic Search

Cohere Healthcare & Life Sciences: Transforming Clinical Research and Medical Data Retrieval with Enterprise AI

Cohere Blog: Technology Tag Page Overview

Co/plot: Supporting the Research Process Through Visualization

Cohere Product Launch Blog Hub

Cohere Official Research Blog and Technical Publications

Introducing Search Toolkit★ 72

Gemma 4 31B FP8 Matches Claude Sonnet 4.6 Medium in Custom Benchmark★ 75

[AINews] 新一代 AI 基礎設施獨角獸：Exa、Modal 與 TurboPuffer★ 75

Hugging Face 推出 Ettin Reranker 重排模型家族：大幅提升 RAG 檢索精度與效率★ 80

PaddleOCR 3.5 發布：支援 Transformers 後端，輕鬆執行 OCR 與文件解析任務★ 75

IBM 發布 Granite Embedding Multilingual R2：具備 32K 上下文與 Apache 2.0 授權，100M 參數以下最強多語言嵌入模型★ 75

[AINews] 微調的終結？探討 Fine-tuning 在大模型時代的未來與轉變★ 75

使用 Sentence Transformers 訓練與微調多模態嵌入與 Reranker 模型★ 80

Sentence Transformers 推出多模態嵌入與重排（Reranker）模型支援★ 78

一天內構建專屬領域的 Embedding 模型：Hugging Face 與 NVIDIA 實戰指南★ 80

OpenEvidence 如何利用 Vercel 打造讓醫生真正信賴的醫療 AI 系統

Import AI 441：我的 AI Agent 開始工作了，你的呢？以及如何用「毒泉」污染 AI 系統★ 75

打造 Deep Research：Tavily 分享如何達到業界領先的 AI 深度研究技術★ 78

Sentence Transformers 正式加入 Hugging Face 團隊，攜手推動開源嵌入模型生態系★ 75

使用開源模型大幅提升你的 OCR 工作流效率★ 80

Replicate 推出 Datalab Marker 與 OCR 模型：快速將文件與圖片轉換為 Markdown 與精確文字定位★ 75

Hugging Face 推出 RTEB：全新檢索評估標準，為 RAG 系統打造更真實的測試基準★ 80

mmBERT：ModernBERT 邁向多語言時代，開源高效能多語言編碼器模型登場★ 78

歡迎 EmbeddingGemma：Google 全新高效嵌入模型上線 Hugging Face★ 75

MCP for Research：如何將 AI 連接到學術研究工具★ 75

Hugging Face 推出 Parquet 內容定義分塊 (CDC)：優化大規模 AI 資料集去重與傳輸效率★ 75

Ettin Suite：Hugging Face 推出最先進的成對編碼器與解碼器開源套件★ 75

使用 Sentence Transformers 訓練與微調稀疏嵌入模型 (Sparse Embedding Models)★ 75