Latest in AI

Showing:fine-tuningClear ×

← Home

Topic

Release New Tool Tutorial Business Paper Benchmark Opinion Regulation

For

General Developers Designers Product Founders Marketing Researchers Students

視覺語言模型（VLM）的偏好最佳化指南：使用 TRL 進行 DPO 微調★ 75
Hugging Face Blog748 days agoTutorial
As vision-language models (VLMs) are increasingly applied to multimodal tasks, how to make these models produce outputs that better align with human…
Hugging Face 推出全新資料集搜尋與篩選功能，大幅提升數據檢索效率★ 70
Hugging Face Blog750 days agoRelease
Hugging Face's official blog announced in July 2024 the launch of new "Dataset Search and Filtering Features," aimed at addressing the pain point of precisely…
微調 Microsoft Florence-2：微軟頂尖視覺語言模型實戰指南★ 80
Hugging Face Blog764 days agoTutorial
Microsoft open-sourced Florence-2 in June 2024 — a vision-language model (VLM) based on a sequence-to-sequence architecture. Despite its compact size (the Base…
NVIDIA H100 GPU 即將登陸 Replicate：支援更快速的模型推理與訓練
Replicate Blog776 days agoRelease
The official blog of Replicate, the popular AI model hosting and deployment platform, has announced that NVIDIA H100 Tensor Core GPUs will soon be officially…
Hugging Face 推出 RLOO 演算法：降低記憶體消耗，讓強化學習重回 RLHF 主流★ 80
Hugging Face Blog776 days agoRelease
In recent years, methods such as Direct Preference Optimization (DPO) have become mainstream for large language model (LLM) alignment, as they eliminate the…
Diffusers 正式支援 Stable Diffusion 3：更強大的圖像生成與記憶體優化★ 80
Hugging Face Blog776 days agoRelease
Hugging Face's official blog announced that its diffusers library now officially supports Stable Diffusion 3 (SD3), the latest release from Stability AI. SD3…
Replicate Intelligence #3：Garden State Llama、實用 LLM 指南與即時影像生成
Replicate Blog781 days agoCommentary
This issue of Replicate Intelligence #3 brings curated content on three core themes for developers and AI enthusiasts: 1. **Garden State Llama**: This is a…
使用 Sentence Transformers 訓練與微調嵌入模型 (Embedding Models)★ 80
Hugging Face Blog791 days agoRelease
The official Hugging Face blog introduces a major update to the Sentence Transformers library (v3.0), centered on the launch of the new…
Google 推出 PaliGemma：結合 SigLIP 與 Gemma 的開源視覺語言模型★ 80
Hugging Face Blog805 days agoRelease
Google has officially launched PaliGemma, a powerful yet lightweight open-source Vision-Language Model (VLM). The release of PaliGemma represents a significant…
StarCoder2-Instruct：完全透明且具備寬鬆授權的程式碼生成自我對齊技術★ 75
Hugging Face Blog820 days agoRelease
### Background and Challenges In the field of code generation, instruction tuning is the key to improving a model's practical utility and alignment with human…
歡迎 Llama 3：Meta 全新開源大型語言模型正式登陸 Hugging Face★ 95
Hugging Face Blog831 days agoRelease
Meta officially released Llama 3, the next generation of its open-source large language models, on April 18, 2024. The initial release includes two parameter…
Ryght 攜手 Hugging Face 專家支援，賦能醫療與生命科學領域的 AI 轉型之旅
Hugging Face Blog833 days agoBusiness
This case study details how biomedical AI startup Ryght leveraged Hugging Face's Expert Support service to overcome the many challenges of deploying generative…
Hugging Face 推出 Idefics2：強大的 8B 開源視覺語言模型★ 80
Hugging Face Blog834 days agoRelease
Hugging Face has announced the launch of Idefics2, the next generation of its open-source Vision Language Model (VLM). With 8 billion (8B) parameters, this…
GaLore：在消費級硬體上訓練大型語言模型的突破性技術★ 85
Hugging Face Blog860 days agoRelease
As the parameter counts of large language models (LLMs) have skyrocketed, the hardware requirements for training and fine-tuning these models have risen…
在 NVIDIA DGX Cloud 上輕鬆使用 H100 GPU 訓練 Hugging Face 模型★ 75
Hugging Face Blog862 days agoRelease
Hugging Face has announced a deep partnership with NVIDIA to directly integrate NVIDIA DGX Cloud services into the Hugging Face platform. This collaboration…
在 Hugging Face 中微調 Gemma 模型★ 80
Hugging Face Blog886 days agoTutorial
After Google released the Gemma family of open-source models (including 2B and 7B parameter versions), Hugging Face promptly published this practical…
歡迎 Gemma：Google 全新開源大語言模型登陸 Hugging Face★ 85
Hugging Face Blog888 days agoRelease
Google has officially released a new family of open-source large language models called "Gemma" — a series of lightweight, state-of-the-art open-source models…
使用 🤗 Transformers 微調 W2V2-BERT 以進行低資源語音辨識 (ASR)★ 75
Hugging Face Blog921 days agoTutorial
This technical blog post from Hugging Face provides a detailed walkthrough of how to use the `transformers` library to fine-tune Meta's open-source W2V2-BERT…
使用直接偏好最佳化 (DPO) 方法對 LLM 進行偏好微調 (Preference Tuning)★ 80
Hugging Face Blog922 days agoTutorial
This technical blog post from Hugging Face takes an in-depth look at the latest techniques in "preference tuning," with a particular focus on **Direct…
使用 Unsloth 與 🤗 TRL 讓 LLM 微調速度提升 2 倍★ 80
Hugging Face Blog930 days agoRelease
Hugging Face's official blog announced a partnership with the Unsloth team to integrate Unsloth's efficient fine-tuning technology directly into Hugging Face's…
全世界的 LoRA 訓練腳本聯合起來！Hugging Face 推出全新 SDXL LoRA 進階訓練腳本★ 75
Hugging Face Blog938 days agoNew Tool
Hugging Face's official blog published a post titled "LoRA training scripts of the world, unite!" announcing the release of a powerful new advanced SDXL LoRA…
使用開源模型複製你的聲音：Replicate 推出 RVC 微調 API★ 75
Replicate Blog965 days agoRelease
AI cloud hosting platform Replicate has officially announced support for fine-tuning with RVC (Retrieval-based Voice Conversion). This new feature allows…
比較大語言模型性能：深入探討使用 LoRA 微調 RoBERTa、Llama 2 與 Mistral 進行災難推特分析★ 75
Hugging Face Blog994 days agoTutorial
This Hugging Face blog post takes an in-depth look at how to use LoRA (Low-Rank Adaptation) to fine-tune three models of different architectures and scales for…
Personal Copilot：訓練專屬於你的程式碼助手★ 75
Hugging Face Blog1,005 days agoTutorial
In everyday development, tools like GitHub Copilot dramatically improve productivity, but for enterprises or individual developers, general-purpose models may…
深入剖析：使用 PPO 進行 RLHF 的 N 個關鍵實作細節★ 85
Hugging Face Blog1,008 days agoTutorial
This technical blog post from Hugging Face takes an in-depth look at the critical "implementation details" that are routinely glossed over in academic papers…
在 Replicate 上微調 MusicGen，輕鬆生成任何風格的音樂
Replicate Blog1,019 days agoNew Tool
AI cloud deployment and runtime platform Replicate has announced official support for fine-tuning Meta's open-source music generation model MusicGen. This new…
使用 TRL 透過 DDPO 微調 Stable Diffusion 模型★ 75
Hugging Face Blog1,033 days agoRelease
Hugging Face published a blog post introducing how to use the DDPO (Denoising Diffusion Policy Optimization) algorithm within the TRL (Transformer…
非工程師指南：如何訓練專屬的 LLaMA 2 聊天機器人
Hugging Face Blog1,034 days agoTutorial
This official guide from Hugging Face is designed for readers without a technical background. It provides a detailed walkthrough of how to use Hugging Face's…
使用 PyTorch FSDP 高效微調 Llama 2 70B：解決 CPU 記憶體不足的實務指南★ 72
Hugging Face Blog1,049 days agoTutorial
When fine-tuning massively large open-source models like Llama 2 70B — with its 70 billion parameters — developers frequently encounter a bottleneck that goes…
Hugging Face SafeCoder 對決閉源程式碼助手：企業級私有化部署的安全性優勢★ 70
Hugging Face Blog1,051 days agoRelease
In today's software development workflows, AI coding assistants have become a critical tool for boosting developer productivity. However, for many enterprises…

← PreviousPage 3Next →

Latest in AI

視覺語言模型（VLM）的偏好最佳化指南：使用 TRL 進行 DPO 微調★ 75

Hugging Face 推出全新資料集搜尋與篩選功能，大幅提升數據檢索效率★ 70

微調 Microsoft Florence-2：微軟頂尖視覺語言模型實戰指南★ 80

NVIDIA H100 GPU 即將登陸 Replicate：支援更快速的模型推理與訓練

Hugging Face 推出 RLOO 演算法：降低記憶體消耗，讓強化學習重回 RLHF 主流★ 80

Diffusers 正式支援 Stable Diffusion 3：更強大的圖像生成與記憶體優化★ 80

Replicate Intelligence #3：Garden State Llama、實用 LLM 指南與即時影像生成

使用 Sentence Transformers 訓練與微調嵌入模型 (Embedding Models)★ 80

Google 推出 PaliGemma：結合 SigLIP 與 Gemma 的開源視覺語言模型★ 80

StarCoder2-Instruct：完全透明且具備寬鬆授權的程式碼生成自我對齊技術★ 75

歡迎 Llama 3：Meta 全新開源大型語言模型正式登陸 Hugging Face★ 95

Ryght 攜手 Hugging Face 專家支援，賦能醫療與生命科學領域的 AI 轉型之旅

Hugging Face 推出 Idefics2：強大的 8B 開源視覺語言模型★ 80

GaLore：在消費級硬體上訓練大型語言模型的突破性技術★ 85

在 NVIDIA DGX Cloud 上輕鬆使用 H100 GPU 訓練 Hugging Face 模型★ 75

在 Hugging Face 中微調 Gemma 模型★ 80

歡迎 Gemma：Google 全新開源大語言模型登陸 Hugging Face★ 85

使用 🤗 Transformers 微調 W2V2-BERT 以進行低資源語音辨識 (ASR)★ 75

使用直接偏好最佳化 (DPO) 方法對 LLM 進行偏好微調 (Preference Tuning)★ 80

使用 Unsloth 與 🤗 TRL 讓 LLM 微調速度提升 2 倍★ 80

全世界的 LoRA 訓練腳本聯合起來！Hugging Face 推出全新 SDXL LoRA 進階訓練腳本★ 75

使用開源模型複製你的聲音：Replicate 推出 RVC 微調 API★ 75

比較大語言模型性能：深入探討使用 LoRA 微調 RoBERTa、Llama 2 與 Mistral 進行災難推特分析★ 75

Personal Copilot：訓練專屬於你的程式碼助手★ 75

深入剖析：使用 PPO 進行 RLHF 的 N 個關鍵實作細節★ 85

在 Replicate 上微調 MusicGen，輕鬆生成任何風格的音樂

使用 TRL 透過 DDPO 微調 Stable Diffusion 模型★ 75

非工程師指南：如何訓練專屬的 LLaMA 2 聊天機器人

使用 PyTorch FSDP 高效微調 Llama 2 70B：解決 CPU 記憶體不足的實務指南★ 72

Hugging Face SafeCoder 對決閉源程式碼助手：企業級私有化部署的安全性優勢★ 70