Latest in AI

Showing:fine-tuningResearchersClear ×

← Home

Topic

Release New Tool Tutorial Business Paper Benchmark Opinion Regulation

For

General Developers Designers Product Founders Marketing Researchers Students

Unlocking the Potential of Vision Language Models on Satellite Imagery Through Fine-Tuning
Mistral AI News40 days agoTutorial
Mistral AI publishes a technical guide on adapting vision language models (VLMs) for satellite imagery analysis through fine-tuning. General-purpose VLMs underperform on remote-sensing data due to domain gap — specialized vocabulary, top-down perspective, and scale variation. Fine-tuning on curated geospatial datasets is presented as the practical path to closing that gap for real-world deployment.
Beyond LoRA: Can You Beat the Most Popular Fine-Tuning Technique?
Hugging Face Blog40 days agoBenchmark
Hugging Face's PEFT team benchmarks alternatives to LoRA — the dominant parameter-efficient fine-tuning method — asking whether newer techniques can match or surpass it in practice. The post evaluates candidates such as DoRA, LoRA+, AdaLoRA, and IA³ across task performance, memory footprint, and training speed within the unified PEFT library framework. Rather than declaring a single winner, the piece delivers a practical guide for choosing the right technique based on model size, task type, and resource constraints.
SenseNova U1 Adds an Infographic-Specific Fine-Tune
r/LocalLLaMA top day48 days agoRelease
A Reddit post highlights a new infographic-specific fine-tune for SenseNova U1-8B-MoT, trained with an extended multi-task phase for structured visual output. The reported benchmarks show large gains in IGenBench infographic accuracy and chart understanding, with smaller improvement in text rendering. Aesthetic score appears roughly unchanged, suggesting the update mainly improves information structure and visual reasoning rather than overall visual polish.
NeuroBait: I fine-tuned a model to spark dopamine for ADHD brain
Hugging Face Blog49 days agoNew Tool
NeuroBait is a Hugging Face community project built to help with ADHD task-initiation freeze rather than diagnosis or to-do planning. It fine-tunes google/gemma-3-12b-it with LoRA to produce short, warm, context-aware nudges. The project uses Unsloth and Modal for training, then deploys on a Hugging Face Space with Gradio, transformers, peft, and a runtime LoRA adapter.
Unlocking VLM Potential on Satellite Imagery Through Fine-Tuning
Mistral AI News50 days agoTutorial
Mistral AI demonstrates how LoRA fine-tuning adapts Pixtral-12B to satellite imagery, a specialized visual domain where prompting alone is unreliable. Using the Aerial Image Dataset, the post compares a prompt-based baseline against a fine-tuned model across 30 scene classes. Accuracy rose from 0.56 to 0.91, while invalid label hallucinations dropped from 5% to 0.1%.
Fine-tuning an LLM to write docs like it's 1995
Hacker News (AI keywords)53 days agoTutorial
The author builds a corpus from old Microsoft manuals, cleans OCR text, generates instruction-style JSONL examples, and fine-tunes Llama 3.1 8B and Qwen 2.5 7B with QLoRA. Tests cover malloc(), a fictional Win32 API, and a deliberately anachronistic REST API prompt. Qwen fine-tunes transfer the period documentation style best, but the experiment also shows hallucination risks, tuning complexity, and why these models augment rather than replace technical writers.
How to Fine-Tune Nemotron 3.5 ASR for Your Language, Domain, or Accent
Hugging Face Blog54 days agoTutorial
This Hugging Face Blog post appears to be a practical tutorial for fine-tuning NVIDIA Nemotron 3.5 ASR. Based on the title, it focuses on adapting speech recognition to a target language, specialized domain, or accent. The original text was not provided, so implementation details, datasets, commands, metrics, and hardware requirements cannot be confirmed.
Direct Preference Optimization Beyond Chatbots
Hugging Face Blog55 days agoTutorial
Based only on the title, this Hugging Face Blog post appears to discuss Direct Preference Optimization outside conventional chatbot use cases. It may frame DPO as a broader preference-alignment method for model outputs, workflows, or non-conversational AI systems. Without the full article, specific claims about experiments, datasets, models, or implementation details cannot be verified.
LLMs believe false statements even after explicit warnings that they're false★ 74
Ars Technica AI60 days agoPaper
A new study describes “Negation Neglect,” where LLMs fine-tuned on documents that explicitly mark claims as false still learn the claims as true. Experiments with fabricated statements found models often absorb entity-event associations more strongly than surrounding warnings or negations. The finding raises concerns for fine-tuning pipelines, misinformation handling, and AI safety datasets that include harmful or false content with disclaimers.
專業化勝過規模：大多數 AI 採購決策忽略的關鍵戰略變數★ 75
Hugging Face Blog67 days agoOpinion
In the current wave of enterprise AI adoption, most decision-makers fall into the "scale myth" when making AI procurement decisions — the belief that the…
[AINews] 微調的終結？探討 Fine-tuning 在大模型時代的未來與轉變★ 75
Latent Space76 days agoOpinion
As AI technology continues to iterate at a rapid pace, the developer community is confronting a profound rethinking of the question: "Is fine-tuning heading…
使用 Sentence Transformers 訓練與微調多模態嵌入與 Reranker 模型★ 80
Hugging Face Blog103 days agoTutorial
As multimodal AI has become widespread, integrating data from different modalities — text, images, and more — into a single vector space and performing…
Hugging Face 發表 TRL v1.0：專為後訓練（Post-Training）打造的開源庫，邁向 API 穩定與高效對齊新里程碑★ 85
Hugging Face Blog119 days agoRelease
Hugging Face has officially announced the release of TRL (Transformer Reinforcement Learning) v1.0. This is a major milestone, marking TRL's transformation…
一天內構建專屬領域的 Embedding 模型：Hugging Face 與 NVIDIA 實戰指南★ 80
Hugging Face Blog129 days agoTutorial
When building Retrieval-Augmented Generation (RAG) systems, general-purpose embedding models (such as those from OpenAI or common open-source alternatives)…
免費訓練 AI 模型！Hugging Face 聯手 Unsloth 推出 Hugging Face Jobs 免費微調服務★ 85
Hugging Face Blog158 days agoNew Tool
Hugging Face's official blog has announced exciting news for the open-source AI community: Hugging Face has formed a deep partnership with Unsloth — the…
我們讓 Claude 撰寫 CUDA 核心並教導開源模型！Hugging Face 發表 Upskill 專案★ 80
Hugging Face Blog181 days agoRelease
### Background and Challenge: Why Is CUDA Programming So Hard for AI? CUDA (Compute Unified Device Architecture) is a parallel computing platform and…
讓 Claude 自動微調開源 LLM：Hugging Face 實測 Agentic 機器學習工作流★ 78
Hugging Face Blog236 days agoTutorial
Hugging Face recently shared a highly inspiring experiment: how to use Anthropic's Claude (as an AI Agent) to automate the fine-tuning of an open-source large…
使用 RapidFire AI 讓 Hugging Face TRL 微調速度提升 20 倍★ 80
Hugging Face Blog249 days agoRelease
The Hugging Face official blog has announced a collaboration with RapidFire AI, bringing a revolutionary performance improvement to its popular TRL…
ServiceNow AI 推出 SyGra：為 LLM 與 SLM 打造的一站式合成資料生成框架★ 75
Hugging Face Blog309 days agoRelease
ServiceNow AI recently published a post on the Hugging Face blog introducing a brand-new open-source framework called "SyGra" — a one-stop synthetic data…
使用 Together AI 微調 Hugging Face Hub 上的任何開源大語言模型 (LLM)★ 75
Hugging Face Blog320 days agoRelease
Hugging Face and Together AI have announced a deep partnership, launching a new integration designed to streamline the fine-tuning workflow for open-source…
Jupyter Agents：訓練 LLM 利用 Notebook 進行推理與自我修正★ 78
Hugging Face Blog321 days agoRelease
### Background and Core Concepts Traditional large language models (LLMs), when faced with complex mathematics, data analysis, or programming tasks, can…
Hugging Face TRL 支援視覺語言模型 (VLM) 對齊：輕鬆實現多模態 DPO 與 ORPO 訓練★ 80
Hugging Face Blog355 days agoRelease
Hugging Face's TRL (Transformer Reinforcement Learning) is a popular open-source library specifically designed for aligning language models (LLMs). In its…
使用 Sentence Transformers 訓練與微調稀疏嵌入模型 (Sparse Embedding Models)★ 75
Hugging Face Blog392 days agoTutorial
This technical blog post from Hugging Face provides a detailed guide on how to train and fine-tune "Sparse Embedding Models" using the Sentence Transformers…
Gemma 3n 正式全面登陸開源生態系！Hugging Face 完整支援指南★ 80
Hugging Face Blog397 days agoRelease
Google's open-source model family welcomes a new member! The all-new Gemma 3n model series is now fully available within the Hugging Face ecosystem. Gemma 3n…
在消費級硬體上微調 FLUX.1-dev：使用 QLoRA 技術指南★ 80
Hugging Face Blog404 days agoTutorial
FLUX.1-dev is a state-of-the-art open-source text-to-image model with 12 billion parameters (12B), developed by Black Forest Labs. However, due to its enormous…
Featherless AI 正式加入 Hugging Face 推理供應商（Inference Providers）★ 75
Hugging Face Blog411 days agoRelease
Hugging Face officially announced a partnership with Featherless AI, a serverless GPU inference platform, integrating it into the Hugging Face Inference…
結合 NVIDIA 與 Hugging Face：如何為 LeRobot SO-101 機械手臂後訓練 Isaac GR00T N1.5 模型★ 75
Hugging Face Blog411 days agoTutorial
As embodied AI develops rapidly, deploying powerful robotics foundation models onto specific hardware has become a key challenge. NVIDIA and Hugging Face have…
Hugging Face 聯手 NVIDIA 推出全新「訓練集群即服務」(Training Cluster as a Service)★ 85
Hugging Face Blog412 days agoRelease
Hugging Face has announced a new partnership with AI chip giant NVIDIA, launching "Training Cluster as a Service" (TCaaS). The introduction of this service…
讓 GPU 毫無閒置：利用 TRL 中協同部署的 vLLM 解鎖高效能強化學習訓練★ 85
Hugging Face Blog420 days agoRelease
In the reinforcement learning from human feedback (RLHF) training process for large language models — whether PPO or the recently popular GRPO — there are…
🐯 Liger GRPO 攜手 TRL：大幅降低 DeepSeek-R1 式強化學習訓練顯存與加速★ 82
Hugging Face Blog429 days agoNew Tool
Since the explosive rise of DeepSeek-R1, GRPO (Group Relative Policy Optimization) has become the most widely discussed reinforcement learning (RL) technique…

Page 1Next →

Latest in AI

Unlocking the Potential of Vision Language Models on Satellite Imagery Through Fine-Tuning

Beyond LoRA: Can You Beat the Most Popular Fine-Tuning Technique?

SenseNova U1 Adds an Infographic-Specific Fine-Tune

NeuroBait: I fine-tuned a model to spark dopamine for ADHD brain

Unlocking VLM Potential on Satellite Imagery Through Fine-Tuning

Fine-tuning an LLM to write docs like it's 1995

How to Fine-Tune Nemotron 3.5 ASR for Your Language, Domain, or Accent

Direct Preference Optimization Beyond Chatbots

LLMs believe false statements even after explicit warnings that they're false★ 74

專業化勝過規模：大多數 AI 採購決策忽略的關鍵戰略變數★ 75

[AINews] 微調的終結？探討 Fine-tuning 在大模型時代的未來與轉變★ 75

使用 Sentence Transformers 訓練與微調多模態嵌入與 Reranker 模型★ 80

Hugging Face 發表 TRL v1.0：專為後訓練（Post-Training）打造的開源庫，邁向 API 穩定與高效對齊新里程碑★ 85

一天內構建專屬領域的 Embedding 模型：Hugging Face 與 NVIDIA 實戰指南★ 80

免費訓練 AI 模型！Hugging Face 聯手 Unsloth 推出 Hugging Face Jobs 免費微調服務★ 85

我們讓 Claude 撰寫 CUDA 核心並教導開源模型！Hugging Face 發表 Upskill 專案★ 80

讓 Claude 自動微調開源 LLM：Hugging Face 實測 Agentic 機器學習工作流★ 78

使用 RapidFire AI 讓 Hugging Face TRL 微調速度提升 20 倍★ 80

ServiceNow AI 推出 SyGra：為 LLM 與 SLM 打造的一站式合成資料生成框架★ 75

使用 Together AI 微調 Hugging Face Hub 上的任何開源大語言模型 (LLM)★ 75

Jupyter Agents：訓練 LLM 利用 Notebook 進行推理與自我修正★ 78

Hugging Face TRL 支援視覺語言模型 (VLM) 對齊：輕鬆實現多模態 DPO 與 ORPO 訓練★ 80

使用 Sentence Transformers 訓練與微調稀疏嵌入模型 (Sparse Embedding Models)★ 75

Gemma 3n 正式全面登陸開源生態系！Hugging Face 完整支援指南★ 80

在消費級硬體上微調 FLUX.1-dev：使用 QLoRA 技術指南★ 80

Featherless AI 正式加入 Hugging Face 推理供應商（Inference Providers）★ 75

結合 NVIDIA 與 Hugging Face：如何為 LeRobot SO-101 機械手臂後訓練 Isaac GR00T N1.5 模型★ 75

Hugging Face 聯手 NVIDIA 推出全新「訓練集群即服務」(Training Cluster as a Service)★ 85

讓 GPU 毫無閒置：利用 TRL 中協同部署的 vLLM 解鎖高效能強化學習訓練★ 85

🐯 Liger GRPO 攜手 TRL：大幅降低 DeepSeek-R1 式強化學習訓練顯存與加速★ 82