Latest in AI

Showing:on-deviceClear ×

← Home

Topic

Release New Tool Tutorial Business Paper Benchmark Opinion Regulation

For

General Developers Designers Product Founders Marketing Researchers Students

Google Introduces Gemma 4 12B: A Unified, Encoder-Free Multimodal Model★ 85
Google DeepMind Blog48 days agoRelease
Google DeepMind has unveiled Gemma 4 12B, a next-generation open-weights model featuring a unified, encoder-free multimodal architecture. By eliminating the traditional separate vision encoder (such as ViT), it processes diverse modalities directly within a single Transformer network. This design simplifies training, reduces inference latency, and enhances cross-modal alignment, marking a significant milestone for open-source AI.
NVIDIA 推出 Nemotron 3 Nano Omni：支援長文本的多模態智慧模型，專為文件、語音與影片 Agent 設計★ 75
Hugging Face Blog90 days agoRelease
NVIDIA has officially launched a new lightweight multimodal model, "Nemotron 3 Nano Omni." This model is designed to deliver powerful multimodal intelligence…
Google 發表 Gemma 4：專為裝置端設計的前沿多模態開放模型★ 85
Hugging Face Blog117 days agoRelease
Google and Hugging Face have jointly announced a new generation of open-weight models — "Gemma 4." This model represents a major breakthrough in on-device AI…
Hugging Face 開源生態報告：2026 春季版★ 85
Hugging Face Blog132 days agoCommentary
Hugging Face has published its Spring 2026 "State of Open Source AI" report, offering a comprehensive review of the explosive growth and paradigm shifts that…
Google DeepMind 推出 Gemma 3 270M：專為超高效能 AI 設計的極致輕量級模型★ 72
Google DeepMind Blog277 days agoRelease
Google DeepMind has officially announced the addition of a highly distinctive and specialized new member to its open-source model family — Gemma 3 270M. This…
使用 Core ML 與 dots.ocr 實現 Apple 平台上的 SOTA 本地端 OCR★ 72
Hugging Face Blog299 days agoRelease
This technical article from Hugging Face introduces how to deploy a state-of-the-art (SOTA) optical character recognition (OCR) model called dots.ocr using…
Gemma 3n 正式全面登陸開源生態系！Hugging Face 完整支援指南★ 80
Hugging Face Blog397 days agoRelease
Google's open-source model family welcomes a new member! The all-new Gemma 3n model series is now fully available within the Hugging Face ecosystem. Gemma 3n…
Google DeepMind 推出 Gemini Robotics On-Device：為本地機器人設備帶來高效能、具備通用靈巧性與快速適應能力的 AI 模型★ 85
Google DeepMind Blog398 days agoRelease
Google DeepMind has released the "Gemini Robotics On-Device" model, a significant breakthrough that brings advanced Gemini AI capabilities directly to local…
在 Arm 架構上實現即時 AI 聲音生成：賦予創意自由的個人工具
Hugging Face Blog419 days agoRelease
As generative AI technology becomes more widespread, AI Sound Generation has become an indispensable part of modern multimedia creation, game development, and…
Google 發表 Gemma 3n 預覽版：強大、高效且行動優先的端側多模態 AI 模型★ 78
Google DeepMind Blog434 days agoRelease
Google DeepMind has officially released a preview of its new open model "Gemma 3n." This is a cutting-edge open model purpose-built for mobile devices and…
SmolVLM2：將影片理解能力帶到每一台裝置的輕量級視覺語言模型★ 80
Hugging Face Blog523 days agoRelease
Hugging Face has introduced SmolVLM2, the latest addition to its Smol family of lightweight models. SmolVLM2 is designed to bring advanced vision-language…
Hugging Face 推出更輕量 SmolVLM：全新 256M 與 500M 超小視覺語言模型登場！★ 75
Hugging Face Blog551 days agoRelease
Hugging Face has officially introduced the newest members of the SmolVLM family, pushing vision-language model (VLM) sizes even further down to 256M (256…
Meta 推出 Llama 3.2：支援視覺多模態與邊緣裝置運行的輕量級模型，Hugging Face 全面支援★ 95
Hugging Face Blog671 days agoRelease
Meta has officially introduced the Llama 3.2 family of open-source models, marking a significant architectural upgrade with two major breakthroughs: multimodal…
Hugging Face 推出 Swift Transformers：在 Apple 裝置上本地端運行 LLM★ 75
Hugging Face Blog1,085 days agoRelease
Hugging Face has officially announced the release of a new open-source Swift package — `swift-transformers`. This tool is designed specifically for developers…
Hugging Face 開源 SD-Small 與 SD-Tiny 知識蒸餾程式碼與模型權重★ 70
Hugging Face Blog1,092 days agoRelease
While Stable Diffusion (SD) 1.5 has demonstrated powerful image generation capabilities, its 860 million parameter count still presents challenges for edge…

Latest in AI

Google Introduces Gemma 4 12B: A Unified, Encoder-Free Multimodal Model★ 85

NVIDIA 推出 Nemotron 3 Nano Omni：支援長文本的多模態智慧模型，專為文件、語音與影片 Agent 設計★ 75

Google 發表 Gemma 4：專為裝置端設計的前沿多模態開放模型★ 85

Hugging Face 開源生態報告：2026 春季版★ 85

Google DeepMind 推出 Gemma 3 270M：專為超高效能 AI 設計的極致輕量級模型★ 72

使用 Core ML 與 dots.ocr 實現 Apple 平台上的 SOTA 本地端 OCR★ 72

Gemma 3n 正式全面登陸開源生態系！Hugging Face 完整支援指南★ 80

Google DeepMind 推出 Gemini Robotics On-Device：為本地機器人設備帶來高效能、具備通用靈巧性與快速適應能力的 AI 模型★ 85

在 Arm 架構上實現即時 AI 聲音生成：賦予創意自由的個人工具

Google 發表 Gemma 3n 預覽版：強大、高效且行動優先的端側多模態 AI 模型★ 78

SmolVLM2：將影片理解能力帶到每一台裝置的輕量級視覺語言模型★ 80

Hugging Face 推出更輕量 SmolVLM：全新 256M 與 500M 超小視覺語言模型登場！★ 75

Meta 推出 Llama 3.2：支援視覺多模態與邊緣裝置運行的輕量級模型，Hugging Face 全面支援★ 95

Hugging Face 推出 Swift Transformers：在 Apple 裝置上本地端運行 LLM★ 75

Hugging Face 開源 SD-Small 與 SD-Tiny 知識蒸餾程式碼與模型權重★ 70