Latest in AI

Showing:optimizationResearchersClear ×

Topic

Release New Tool Tutorial Business Paper Benchmark Opinion Regulation

For

General Developers Designers Product Founders Marketing Researchers Students

使用 Hugging Face 輕鬆構建與分享 ROCm 核心 (Kernels)★ 70
Hugging Face Blog253 days agoRelease
Hugging Face recently announced a major update for AMD GPU users and developers, aimed at simplifying the process of building, packaging, and sharing ROCm…
Replicate 如何優化 FLUX.1 Kontext [dev]：深入解析 Taylor Seer 優化技術★ 75
Replicate Blog378 days agoTutorial
In the generative AI space, the FLUX.1 model developed by Black Forest Labs is renowned for its outstanding image quality and text rendering capabilities…
介紹 AutoRound：Intel 針對 LLM 與 VLM 的先進量化技術★ 75
Hugging Face Blog455 days agoRelease
As large language models (LLMs) and vision language models (VLMs) continue to scale up, running these models on limited hardware resources — such as…
Replicate 讓 FLUX 運行速度大幅提升，並開源其優化技術★ 72
Replicate Blog656 days agoRelease
The well-known AI model hosting platform Replicate has announced a significant speed improvement for FLUX image generation models running on its platform. FLUX…
AudioLDM 2 速度優化指南：如何讓文字轉音訊與音樂生成快上加快 ⚡️★ 75
Hugging Face Blog1,063 days agoTutorial
AudioLDM 2 is an advanced open-source text-to-audio and text-to-music generation model. However, under its default settings, the model's inference speed is…
使用 🤗 Transformers 優化 Bark 語音生成模型★ 75
Hugging Face Blog1,084 days agoTutorial
Bark is an innovative text-to-audio model developed by the team at Suno. It can generate not only high-quality, multilingual speech, but also background music…
Diffusers 釋出兩個月更新：Stable Diffusion 整合、效能優化與全新 Pipeline★ 70
Hugging Face Blog1,415 days agoRelease
Hugging Face published a major update and progress report on their diffusion model library `diffusers`, two months after its launch. Since its release in July…
使用 Optimum 與 Transformers Pipelines 加速模型推論★ 75
Hugging Face Blog1,540 days agoRelease
When deploying Transformer models in production, reducing inference latency and increasing throughput while keeping computational costs under control has…
在 CPU 上擴展 BERT 推論效能（第一部分）
Hugging Face Blog1,925 days agoTutorial
In many real-world enterprise production environments, although GPUs offer extremely high throughput for deep learning inference, CPUs remain indispensable due…
使用區塊稀疏矩陣（Block Sparse Matrices）打造更小、更快的語言模型
Hugging Face Blog2,147 days agoTutorial
In the field of natural language processing (NLP), the Transformer architecture has become the dominant paradigm, but its core self-attention mechanism…