Latest in AI

Showing:llm-deploymentClear ×

Topic

Release New Tool Tutorial Business Paper Benchmark Opinion Regulation

For

General Developers Designers Product Founders Marketing Researchers Students

介紹 AMD 第 5 代 EPYC™ 處理器：Hugging Face 攜手 AMD 釋放 CPU 的 AI 推論潛能★ 75
Hugging Face Blog656 days agoRelease
AMD has officially launched its 5th-generation EPYC processor, codenamed "Turin," and Hugging Face has promptly published a blog post detailing the deep…
使用 Optimum-Intel 與 OpenVINO GenAI 進行模型優化與部署★ 75
Hugging Face Blog676 days agoTutorial
This article provides a detailed look at how to use Hugging Face's `optimum-intel` library and Intel's OpenVINO GenAI toolkit to optimize and deploy generative…
在 Hugging Face 上輕鬆將模型部署至 AWS Inferentia2 晶片★ 75
Hugging Face Blog797 days agoRelease
Hugging Face has announced official support for AWS Inferentia2 (Inf2) instances within its hosted Inference Endpoints service. This update gives developers…
Hugging Face TGI 推出 Messages API：輕鬆將 OpenAI 應用無縫轉移至開源 LLM★ 80
Hugging Face Blog901 days agoRelease
Hugging Face has announced official support for an OpenAI-compatible Messages API (i.e., the `/v1/chat/completions` endpoint) in its high-performance…
Chat Templates：終結默默蠶食模型效能的隱形殺手★ 85
Hugging Face Blog1,029 days agoRelease
As large language models (LLMs) shift toward conversational (Chat/Instruct) applications, correctly formatting and feeding a user's conversation history —…
Llama 2 在 Amazon SageMaker 上的部署效能基準測試
Hugging Face Blog1,036 days agoTutorial
This Hugging Face blog post presents detailed performance benchmarks for deploying Meta's open-source large language models — Llama 2 (covering 7B, 13B, and…
使用 Hugging Face Transformers 與 Amazon SageMaker 部署 GPT-J 6B 進行推論
Hugging Face Blog1,659 days agoTutorial
With the rise of open-source large language models, deploying these models in cloud environments in a secure, stable, and scalable manner has become a critical…