Latest in AI

Showing:deploymentResearchersClear ×

← Home

Topic

Release New Tool Tutorial Business Paper Benchmark Opinion Regulation

For

General Developers Designers Product Founders Marketing Researchers Students

Production-Ready W4A8: vLLM Integration and Quality Recovery Techniques
Cohere Blog46 days agoTutorial
Cohere’s post appears to explain how W4A8 quantization can be prepared for production inference through vLLM integration. From the title, the focus is likely on deployment mechanics and techniques for recovering model quality after aggressive quantization. Because no article body is available, specific benchmarks, supported models, implementation steps, and measured quality gains cannot be confirmed.
Vercel 提升 Python Functions 打包大小限制至 500MB★ 75
Vercel Changelog154 days agoRelease
The official Vercel Changelog announces that the bundle size limit for Python Vercel Functions (serverless functions) has been significantly raised to 500MB…
任何人都能構建 Agent，但需要一個平台來運行它們：Vercel 談 AI Agent 的部署挑戰★ 75
Vercel Changelog169 days agoOpinion
As the reasoning capabilities of Large Language Models (LLMs) improve, building a simple AI Agent has become easier than ever before. Developers can combine a…
使用 Torch Compile 快取加速模型啟動與推論速度★ 75
Replicate Blog323 days agoTutorial
When deploying modern AI models (such as LLaMA, Flux, or Stable Diffusion), `torch.compile` — introduced in PyTorch 2.0 — is a powerful performance…
在 Hugging Face 上透過 NVIDIA NIM 加速多樣化 LLM 部署★ 80
Hugging Face Blog371 days agoRelease
Hugging Face and NVIDIA have announced a new collaboration to bring NVIDIA NIM (NVIDIA Inference Microservices) into the Hugging Face ecosystem, with the goal…
Dell Enterprise Hub 助企業輕鬆在本地端建置 AI 應用★ 75
Hugging Face Blog431 days agoRelease
As enterprises place ever-increasing demands on data privacy, security, and regulatory compliance, deploying AI models on-premises has become the preferred…
Hugging Face 與 FriendliAI 達成合作，全面加速 Hub 上的模型部署★ 70
Hugging Face Blog552 days agoRelease
Hugging Face has announced a strategic partnership with FriendliAI, a company specializing in high-performance AI inference, aimed at comprehensively improving…
Hugging Face 推出 HUGS：以開源模型輕鬆擴展您的 AI 應用★ 85
Hugging Face Blog643 days agoRelease
Hugging Face has officially launched HUGS (Hugging Face Microservices), a brand-new microservices solution designed to address the pain points enterprises face…
在 Hugging Face 上部署語音對語音 (Speech-to-Speech) 模型★ 75
Hugging Face Blog644 days agoTutorial
As real-time voice interaction technologies like GPT-4o become more widespread, the open-source community is also actively developing speech-to-speech (S2S)…
Google Cloud TPU 正式登陸 Hugging Face，支援 Inference Endpoints 與 Spaces★ 75
Hugging Face Blog749 days agoRelease
Hugging Face announced a deep partnership with Google Cloud, officially integrating Google Cloud TPUs (Tensor Processing Units) into the Hugging Face platform…
在地端構建 AI：Hugging Face 聯手戴爾推出 Dell Enterprise Hub★ 75
Hugging Face Blog798 days agoRelease
Hugging Face and Dell Technologies have announced the launch of the "Dell Enterprise Hub," a new solution designed for enterprise on-premise AI deployment. As…
使用 Hugging Face Inference Endpoints 輕鬆部署高效能嵌入模型★ 75
Hugging Face Blog1,008 days agoRelease
As large language models (LLMs) and Retrieval-Augmented Generation (RAG) technology become increasingly widespread, embedding models have become an…
使用 Hugging Face Inference Endpoints 輕鬆部署大型語言模型 (LLM)★ 75
Hugging Face Blog1,120 days agoTutorial
This official Hugging Face blog post introduces how to use their hosted service "Inference Endpoints" to deploy large language models (LLMs). With the rapid…
將 Livebook 筆記本部署為 Hugging Face Spaces 應用程式
Hugging Face Blog1,139 days agoNew Tool
Hugging Face has announced a partnership with Livebook — the well-known interactive notebook tool from the Elixir ecosystem — to officially support deploying…
如何在 Hugging Face Spaces 中託管 Unity 遊戲
Hugging Face Blog1,194 days agoTutorial
This tutorial from the official Hugging Face blog details how to host a Unity game on the Hugging Face Spaces platform. As AI applications in game development…
為什麼我們轉向使用 Hugging Face Inference Endpoints，或許你也應該試試
Hugging Face Blog1,259 days agoOpinion
This case study from Mantis NLP details the core reasons behind their decision to migrate their machine learning model deployment workflow from traditional…
Hugging Face 推理解決方案全景指南：從免費 API 到企業級部署★ 75
Hugging Face Blog1,345 days agoTutorial
As the world's largest open-source AI model hub, Hugging Face not only provides model hosting but has also built a complete inference ecosystem. This article…
Hugging Face Inference Endpoints 入門指南：輕鬆部署生產級 AI 模型★ 75
Hugging Face Blog1,383 days agoTutorial
Hugging Face Inference Endpoints is a fully managed service designed for developers and enterprises, built to solve the pain points of deploying machine…
在 Vertex AI 上部署 Hugging Face ViT (Vision Transformer) 影像分類模型
Hugging Face Blog1,439 days agoTutorial
This technical tutorial from the official Hugging Face blog provides a detailed walkthrough of how to deploy the popular computer vision model ViT (Vision…
在 Hugging Face 中使用 TF Serving 部署 TensorFlow 視覺模型
Hugging Face Blog1,464 days agoTutorial
This is an official technical guide published by Hugging Face, designed to help developers deploy TensorFlow computer vision models from the Hugging Face Hub…
Hugging Face 機器學習主管洞察：從原型到生產環境的挑戰與趨勢
Hugging Face Blog1,553 days agoOpinion
This blog post published by Hugging Face in 2022 takes an in-depth look at the challenges, technology trends, and management insights that enterprise Directors…
在 Hugging Face Spaces 上使用 Gradio 展示你的 AI 專案★ 75
Hugging Face Blog1,757 days agoRelease
Hugging Face has officially launched its "Spaces" service with full support for the popular lightweight UI framework Gradio, aiming to make it easier for…
在 Hugging Face Spaces 上使用 Streamlit 託管您的模型與資料集★ 70
Hugging Face Blog1,757 days agoNew Tool
Hugging Face has announced the launch of its new "Spaces" feature, designed to provide the machine learning community with a simple, fast, and free platform…
使用 Amazon SageMaker 輕鬆部署 Hugging Face 模型
Hugging Face Blog1,846 days agoRelease
Hugging Face and Amazon Web Services (AWS) have entered into a deep collaboration aimed at simplifying the deployment process of machine learning models from…

Latest in AI

Production-Ready W4A8: vLLM Integration and Quality Recovery Techniques

Vercel 提升 Python Functions 打包大小限制至 500MB★ 75

任何人都能構建 Agent，但需要一個平台來運行它們：Vercel 談 AI Agent 的部署挑戰★ 75

使用 Torch Compile 快取加速模型啟動與推論速度★ 75

在 Hugging Face 上透過 NVIDIA NIM 加速多樣化 LLM 部署★ 80

Dell Enterprise Hub 助企業輕鬆在本地端建置 AI 應用★ 75

Hugging Face 與 FriendliAI 達成合作，全面加速 Hub 上的模型部署★ 70

Hugging Face 推出 HUGS：以開源模型輕鬆擴展您的 AI 應用★ 85

在 Hugging Face 上部署語音對語音 (Speech-to-Speech) 模型★ 75

Google Cloud TPU 正式登陸 Hugging Face，支援 Inference Endpoints 與 Spaces★ 75

在地端構建 AI：Hugging Face 聯手戴爾推出 Dell Enterprise Hub★ 75

使用 Hugging Face Inference Endpoints 輕鬆部署高效能嵌入模型★ 75

使用 Hugging Face Inference Endpoints 輕鬆部署大型語言模型 (LLM)★ 75

將 Livebook 筆記本部署為 Hugging Face Spaces 應用程式

如何在 Hugging Face Spaces 中託管 Unity 遊戲

為什麼我們轉向使用 Hugging Face Inference Endpoints，或許你也應該試試

Hugging Face 推理解決方案全景指南：從免費 API 到企業級部署★ 75

Hugging Face Inference Endpoints 入門指南：輕鬆部署生產級 AI 模型★ 75

在 Vertex AI 上部署 Hugging Face ViT (Vision Transformer) 影像分類模型

在 Hugging Face 中使用 TF Serving 部署 TensorFlow 視覺模型

Hugging Face 機器學習主管洞察：從原型到生產環境的挑戰與趨勢

在 Hugging Face Spaces 上使用 Gradio 展示你的 AI 專案★ 75

在 Hugging Face Spaces 上使用 Streamlit 託管您的模型與資料集★ 70

使用 Amazon SageMaker 輕鬆部署 Hugging Face 模型