Latest in AI

Showing:fine-tuningClear ×

← Home

Topic

Release New Tool Tutorial Business Paper Benchmark Opinion Regulation

For

General Developers Designers Product Founders Marketing Researchers Students

讓 GPU 毫無閒置：利用 TRL 中協同部署的 vLLM 解鎖高效能強化學習訓練★ 85
Hugging Face Blog420 days agoRelease
In the reinforcement learning from human feedback (RLHF) training process for large language models — whether PPO or the recently popular GRPO — there are…
🐯 Liger GRPO 攜手 TRL：大幅降低 DeepSeek-R1 式強化學習訓練顯存與加速★ 82
Hugging Face Blog429 days agoNew Tool
Since the explosive rise of DeepSeek-R1, GRPO (Group Relative Policy Optimization) has become the most widely discussed reinforcement learning (RL) technique…
nanoVLM：用純 PyTorch 訓練視覺語言模型（VLM）的最簡開源專案★ 75
Hugging Face Blog433 days agoRelease
Hugging Face recently launched an open-source project called nanoVLM, positioned as "the simplest repository for training Vision Language Models (VLMs) in pure…
Falcon-Edge：TII 推出一系列強大、通用且可微調的 1.58-bit 邊緣端語言模型★ 82
Hugging Face Blog439 days agoRelease
The Technology Innovation Institute (TII) of the United Arab Emirates has officially released the "Falcon-Edge" model series on Hugging Face. This is a family…
微調 olmOCR 打造高保真度 OCR 引擎★ 75
Hugging Face Blog461 days agoTutorial
### Background With the proliferation of vision-language models (VLMs), using VLMs for document OCR (e.g., converting PDFs to Markdown) has become mainstream…
Hugging Face 經典 NLP 課程正式轉型為 LLM 課程：迎向大語言模型時代的全面升級★ 85
Hugging Face Blog481 days agoTutorial
Hugging Face's "NLP Course" has long been a must-read classic for developers and researchers worldwide looking to enter the fields of Transformers and natural…
使用 Sentence Transformers 訓練與微調 Reranker 重排模型教學★ 80
Hugging Face Blog489 days agoTutorial
When building RAG (Retrieval-Augmented Generation) systems, relying solely on vector embeddings for semantic search is often not precise enough. To improve…
Open R1 第三次更新：Hugging Face 釋出開源推理模型與 GRPO 訓練優化細節★ 85
Hugging Face Blog503 days agoRelease
Since its launch, Hugging Face's Open R1 project has been dedicated to replicating the reasoning capabilities of DeepSeek-R1 in a fully open-source manner. In…
Hugging Face 釋出 vid_ds_scripts：一站式構建影片生成高品質資料集★ 75
Hugging Face Blog531 days agoNew Tool
With the rise of open-source video generation models such as LTX-Video, HunyuanVideo, and CogVideoX, building high-quality training datasets has become the…
如何在 AWS 上部署與微調 DeepSeek 模型：Hugging Face 官方指南★ 85
Hugging Face Blog544 days agoTutorial
As DeepSeek-R1 swept through the AI landscape on the strength of its powerful reasoning capabilities, how to safely and efficiently deploy and fine-tune these…
Replicate 推出開源影片模型微調服務，支援自訂 Tencent HunyuanVideo 的風格、動作與角色★ 75
Replicate Blog550 days agoNew Tool
AI cloud hosting and API services platform Replicate has announced a major update: users can now fine-tune open-source video generation models directly on the…
Hugging Face 推出 Synthetic Data Generator：用自然語言輕鬆構建 AI 訓練資料集★ 82
Hugging Face Blog589 days agoNew Tool
Hugging Face launched a brand-new "Synthetic Data Generator" in December 2024 — a web-based, no-code tool designed to allow anyone to create high-quality AI…
投資於效能：利用 LLM 洞察微調小型模型 — CFM 案例研究★ 75
Hugging Face Blog602 days agoBusiness
This case study from Hugging Face details how quantitative asset management firm Capital Fund Management (CFM) has optimized its investment and research…
Replicate 大幅提升 FLUX 微調速度，並將優化技術開源★ 75
Replicate Blog609 days agoRelease
The AI cloud hosting platform Replicate has announced a major fine-tuning speed optimization for FLUX.1, currently the most popular open-source image…
Argilla 2.4 發布：在 Hugging Face Hub 上免程式碼輕鬆構建微調與評估數據集★ 75
Hugging Face Blog631 days agoRelease
The open-source data curation and annotation platform Argilla has officially released version 2.4, with the core of this update being deep integration with…
Llama 3.2 正式支援 Keras：跨框架輕鬆微調與部署 Meta 最新輕量級模型★ 75
Hugging Face Blog645 days agoRelease
Meta's Llama 3.2 release includes lightweight 1B and 3B text models designed specifically for edge computing and mobile devices. These models have now been…
修正梯度累積：解決 LLM 微調中常被忽視的數學偏差★ 85
Hugging Face Blog650 days agoTutorial
### The Mathematical Flaw in Traditional Gradient Accumulation Gradient accumulation is an extremely common technique in deep learning. When VRAM is limited…
使用合成訓練數據提升 Flux 微調效果的實用指南★ 75
Replicate Blog676 days agoTutorial
Fine-tuning the open-source image generation model Flux.1 has become a highly sought-after capability for creators and developers alike. However, relying…
微調 LLM 至 1.58-bit：讓極限模型量化變得簡單★ 85
Hugging Face Blog678 days agoTutorial
The deployment of large language models (LLMs) has long faced a dual bottleneck of VRAM capacity and memory bandwidth. Microsoft previously introduced the…
透過 API 微調 FLUX.1：Replicate 推出程式化微調服務★ 75
Replicate Blog687 days agoNew Tool
Replicate has officially announced support for fine-tuning the popular open-source image generation model FLUX.1 [dev] via its HTTP API. FLUX.1, developed by…
微調 FLUX.1 打造專屬個人寫真：Replicate 官方 LoRA 訓練教學★ 80
Replicate Blog697 days agoTutorial
This official Replicate tutorial walks through in detail how to use LoRA (Low-Rank Adaptation) technology to fine-tune FLUX.1 [dev] — currently the most…
Replicate Intelligence #12：Flux LoRA 訓練上線、熱門的祖克柏迷因與 Lex Fridman 訪談中的 Replicate★ 75
Replicate Blog704 days agoRelease
Replicate Intelligence #12 rounds up the most noteworthy AI technical developments and community trends from late August 2024, centered on three core themes…
透過 Flash Attention 2 的 Packing 技術提升 Hugging Face 訓練效率★ 80
Hugging Face Blog706 days agoTutorial
When fine-tuning or pre-training large language models (LLMs), the sequence lengths of input data are typically uneven. The traditional approach is to use…
Replicate Intelligence #11：微調 FLUX.1、生成式影像遊戲與元宇宙的新願景★ 75
Replicate Blog711 days agoRelease
This edition of Replicate Intelligence #11 compiles major recent technical breakthroughs and application trends in the generative AI space, focusing primarily…
Replicate 推出 FLUX.1 微調功能：用一行程式碼即可訓練專屬的 LoRA 圖像生成模型★ 80
Replicate Blog712 days agoRelease
Replicate, the well-known cloud AI execution platform, has announced official fine-tuning support for FLUX.1, the image generation model that has taken the…
LAVE：在 Docmatix 上使用 LLM 進行零樣本 VQA 評估——我們還需要微調嗎？★ 75
Hugging Face Blog733 days agoPaper
### Background and Challenges Document Visual Question Answering (DocVQA) is an important application of multimodal AI, requiring models to simultaneously…
Meta 推出 Llama 3.1：405B、70B 與 8B 旗艦開源模型，支援多語言與 128K 超長上下文★ 95
Hugging Face Blog735 days agoRelease
Meta's Llama 3.1 represents a major milestone in the open-source AI landscape. The most notable model is the 405B (405 billion parameter) version — the first…
如何利用 distilabel 打造 Argilla 2.0 專屬聊天機器人★ 75
Hugging Face Blog742 days agoTutorial
In the AI field, quickly building a chatbot that can accurately answer questions about a specific domain or newly released software has always been a major…
Replicate Intelligence #7：資料整理與資料生成的重要性
Replicate Blog746 days agoCommentary
In the current wave of generative AI, the industry's attention is gradually shifting from "fine-tuning model architectures" to "improving data quality." Issue…
NuminaMath 如何贏得首屆 AIMO 進步獎（AI 數學奧林匹亞）並宣佈完整開源★ 80
Hugging Face Blog747 days agoRelease
### Background and Achievement The AI Mathematical Olympiad (AIMO) Progress Prize aims to advance AI models capable of solving Olympiad-level mathematical…

← PreviousPage 2Next →

Latest in AI

讓 GPU 毫無閒置：利用 TRL 中協同部署的 vLLM 解鎖高效能強化學習訓練★ 85

🐯 Liger GRPO 攜手 TRL：大幅降低 DeepSeek-R1 式強化學習訓練顯存與加速★ 82

nanoVLM：用純 PyTorch 訓練視覺語言模型（VLM）的最簡開源專案★ 75

Falcon-Edge：TII 推出一系列強大、通用且可微調的 1.58-bit 邊緣端語言模型★ 82

微調 olmOCR 打造高保真度 OCR 引擎★ 75

Hugging Face 經典 NLP 課程正式轉型為 LLM 課程：迎向大語言模型時代的全面升級★ 85

使用 Sentence Transformers 訓練與微調 Reranker 重排模型教學★ 80

Open R1 第三次更新：Hugging Face 釋出開源推理模型與 GRPO 訓練優化細節★ 85

Hugging Face 釋出 vid_ds_scripts：一站式構建影片生成高品質資料集★ 75

如何在 AWS 上部署與微調 DeepSeek 模型：Hugging Face 官方指南★ 85

Replicate 推出開源影片模型微調服務，支援自訂 Tencent HunyuanVideo 的風格、動作與角色★ 75

Hugging Face 推出 Synthetic Data Generator：用自然語言輕鬆構建 AI 訓練資料集★ 82

投資於效能：利用 LLM 洞察微調小型模型 — CFM 案例研究★ 75

Replicate 大幅提升 FLUX 微調速度，並將優化技術開源★ 75

Argilla 2.4 發布：在 Hugging Face Hub 上免程式碼輕鬆構建微調與評估數據集★ 75

Llama 3.2 正式支援 Keras：跨框架輕鬆微調與部署 Meta 最新輕量級模型★ 75

修正梯度累積：解決 LLM 微調中常被忽視的數學偏差★ 85

使用合成訓練數據提升 Flux 微調效果的實用指南★ 75

微調 LLM 至 1.58-bit：讓極限模型量化變得簡單★ 85

透過 API 微調 FLUX.1：Replicate 推出程式化微調服務★ 75

微調 FLUX.1 打造專屬個人寫真：Replicate 官方 LoRA 訓練教學★ 80

Replicate Intelligence #12：Flux LoRA 訓練上線、熱門的祖克柏迷因與 Lex Fridman 訪談中的 Replicate★ 75

透過 Flash Attention 2 的 Packing 技術提升 Hugging Face 訓練效率★ 80

Replicate Intelligence #11：微調 FLUX.1、生成式影像遊戲與元宇宙的新願景★ 75

Replicate 推出 FLUX.1 微調功能：用一行程式碼即可訓練專屬的 LoRA 圖像生成模型★ 80

LAVE：在 Docmatix 上使用 LLM 進行零樣本 VQA 評估——我們還需要微調嗎？★ 75

Meta 推出 Llama 3.1：405B、70B 與 8B 旗艦開源模型，支援多語言與 128K 超長上下文★ 95

如何利用 distilabel 打造 Argilla 2.0 專屬聊天機器人★ 75

Replicate Intelligence #7：資料整理與資料生成的重要性

NuminaMath 如何贏得首屆 AIMO 進步獎（AI 數學奧林匹亞）並宣佈完整開源★ 80