Latest in AI

Showing:fsdpClear ×

Topic

Release New Tool Tutorial Business Paper Benchmark Opinion Regulation

For

General Developers Designers Product Founders Marketing Researchers Students

Hugging Face Accelerate ND-Parallel 指南：高效多 GPU 訓練完全解析★ 80
Hugging Face Blog354 days agoTutorial
As the parameter counts of generative AI and large language models (LLMs) push into the tens and hundreds of billions, the memory of a single GPU has long been…
Hugging Face 釋出 Accelerate 1.0.0 正式版：邁向分散式訓練與大模型推理的全新里程碑★ 80
Hugging Face Blog683 days agoRelease
Hugging Face has officially released version 1.0.0 of its core open-source library, Accelerate. This is a milestone update, signifying that since the project's…
從 DeepSpeed 到 FSDP 再切換回來：使用 Hugging Face Accelerate 實現無縫分散式訓練★ 75
Hugging Face Blog775 days agoTutorial
In the era of large language models (LLMs), the VRAM of a single GPU is often insufficient to hold models with tens of billions of parameters. To overcome this…
使用 PyTorch FSDP 高效微調 Llama 2 70B：解決 CPU 記憶體不足的實務指南★ 72
Hugging Face Blog1,049 days agoTutorial
When fine-tuning massively large open-source models like Llama 2 70B — with its 70 billion parameters — developers frequently encounter a bottleneck that goes…
使用 PyTorch Fully Sharded Data Parallel (FSDP) 加速超大型模型訓練★ 75
Hugging Face Blog1,548 days agoRelease
As AI model scale has grown exponentially, training large models with billions of parameters has become the norm — but this also presents enormous hardware…