Latest in AI

Showing:pre-trainingDevelopersClear ×

Topic

Release New Tool Tutorial Business Paper Benchmark Opinion Regulation

For

General Developers Designers Product Founders Marketing Researchers Students

Cosmopedia：如何為大型語言模型預訓練建立大規模合成數據★ 85
Hugging Face Blog860 days agoRelease
Hugging Face has officially released Cosmopedia, currently the largest and fully open-source synthetic dataset designed for the pre-training of large language…
GaLore：在消費級硬體上訓練大型語言模型的突破性技術★ 85
Hugging Face Blog860 days agoRelease
As the parameter counts of large language models (LLMs) have skyrocketed, the hardware requirements for training and fine-tuning these models have risen…
IBM 輕量級時間序列模型 PatchTSMixer 正式整合至 Hugging Face★ 75
Hugging Face Blog921 days agoRelease
Time series forecasting is critically important in fields such as finance, meteorology, energy, and the Internet of Things. In recent years, while the…
深入探討視覺語言模型 (Vision-Language Models) 的原理與架構★ 80
Hugging Face Blog1,271 days agoTutorial
This is a classic technical guide written by the Hugging Face team, designed to help developers and researchers gain a deep understanding of how…
無需真實數據的高效表格預訓練：TAPEX 概念與 Hugging Face 整合介紹
Hugging Face Blog1,527 days agoRelease
When working with structured data such as tables, traditional pre-trained models typically require crawling large amounts of real-world tables and related text…
BERT 101：最先進的 NLP 模型完整原理解析
Hugging Face Blog1,609 days agoTutorial
BERT (Bidirectional Encoder Representations from Transformers) is a landmark natural language processing (NLP) model proposed by Google in 2018. This Hugging…
從頭開始訓練 CodeParrot 🦜：Hugging Face 的程式碼生成模型實戰指南
Hugging Face Blog1,693 days agoTutorial
This classic Hugging Face blog post documents the birth of the "CodeParrot" project — an experiment in training a code generation model entirely from scratch…
如何使用 Transformers 和 Tokenizers 從頭開始訓練新的語言模型★ 75
Hugging Face Blog2,356 days agoTutorial
This classic blog post from Hugging Face provides a detailed walkthrough of how to use their open-source ecosystem libraries — `transformers` and `tokenizers`…