Latest in AI

Showing:llm-deploymentResearchersClear ×

Topic

Release New Tool Tutorial Business Paper Benchmark Opinion Regulation

For

General Developers Designers Product Founders Marketing Researchers Students

介紹 AMD 第 5 代 EPYC™ 處理器：Hugging Face 攜手 AMD 釋放 CPU 的 AI 推論潛能★ 75
Hugging Face Blog656 days agoRelease
AMD has officially launched its 5th-generation EPYC processor, codenamed "Turin," and Hugging Face has promptly published a blog post detailing the deep…
使用 Optimum-Intel 與 OpenVINO GenAI 進行模型優化與部署★ 75
Hugging Face Blog676 days agoTutorial
This article provides a detailed look at how to use Hugging Face's `optimum-intel` library and Intel's OpenVINO GenAI toolkit to optimize and deploy generative…
在 Hugging Face 上輕鬆將模型部署至 AWS Inferentia2 晶片★ 75
Hugging Face Blog797 days agoRelease
Hugging Face has announced official support for AWS Inferentia2 (Inf2) instances within its hosted Inference Endpoints service. This update gives developers…
Chat Templates：終結默默蠶食模型效能的隱形殺手★ 85
Hugging Face Blog1,029 days agoRelease
As large language models (LLMs) shift toward conversational (Chat/Instruct) applications, correctly formatting and feeding a user's conversation history —…
Llama 2 在 Amazon SageMaker 上的部署效能基準測試
Hugging Face Blog1,036 days agoTutorial
This Hugging Face blog post presents detailed performance benchmarks for deploying Meta's open-source large language models — Llama 2 (covering 7B, 13B, and…
使用 Hugging Face Transformers 與 Amazon SageMaker 部署 GPT-J 6B 進行推論
Hugging Face Blog1,659 days agoTutorial
With the rise of open-source large language models, deploying these models in cloud environments in a secure, stable, and scalable manner has become a critical…