Latest in AI

Showing:vlmGeneralClear ×

Topic

Release New Tool Tutorial Business Paper Benchmark Opinion Regulation

For

General Developers Designers Product Founders Marketing Researchers Students

TII 推出全新 Falcon Perception 多模態感知模型★ 75
Hugging Face Blog118 days agoRelease
The Technology Innovation Institute (TII) of the UAE has officially announced the launch of its new "Falcon Perception" model on the Hugging Face blog. As an…
Hugging Face 推出 AI Sheets 影像功能：用試算表輕鬆解鎖批次影像處理與多模態分析
Hugging Face Blog280 days agoNew Tool
Hugging Face has recently released a major update for its innovative spreadsheet AI tool "AI Sheets," officially unlocking powerful image processing…
Hcompany 推出 Holo1：全新 GUI 自動化 VLM 家族，為智慧代理人 Surfer-H 提供強大動力★ 78
Hugging Face Blog420 days agoRelease
H (formerly Holistic AI), a highly regarded French AI startup, recently officially released a new family of vision-language models (VLMs) on the Hugging Face…
Hugging Face 釋出 2025 視覺語言模型（VLM）指南：更強、更快、更實用的開源新時代★ 80
Hugging Face Blog442 days agoOpinion
With the explosion of multimodal technology, Vision Language Models (VLMs) have evolved from laboratory research prototypes into core tools for enterprises and…
深入解析 Aya Vision：推動多語言多模態 AI 的前沿發展★ 75
Hugging Face Blog511 days agoRelease
Cohere For AI (C4AI) has officially launched "Aya Vision," a series of open-source multimodal models (available in 8B and 32B parameter versions) designed…
視覺文件檢索邁向多語言：Hugging Face 推出 VDR-2B-multilingual 模型★ 80
Hugging Face Blog564 days agoRelease
Hugging Face has recently released a new Visual Document Retrieval (VDR) model — **VDR-2B-multilingual**. This technology marks a formal transition in document…
阿布達比 TII 發表 Falcon 2 11B：搭載 5 兆 Token 訓練的預訓練語言與視覺語言模型★ 75
Hugging Face Blog795 days agoRelease
The Technology Innovation Institute (TII) of Abu Dhabi has officially released a new open-source model family on Hugging Face — Falcon 2 11B. This model, with…
Hugging Face 推出 Idefics2：強大的 8B 開源視覺語言模型★ 80
Hugging Face Blog834 days agoRelease
Hugging Face has announced the launch of Idefics2, the next generation of its open-source Vision Language Model (VLM). With 8 billion (8B) parameters, this…