Latest in AI

Showing:post-trainingClear ×

Topic

Release New Tool Tutorial Business Paper Benchmark Opinion Regulation

For

General Developers Designers Product Founders Marketing Researchers Students

Frontier Post-Training Recipe Review with Finbarr Timbers
Interconnects (Nathan L.)42 days agoCommentary
In the 18th installment of his interview series, Interconnects author Nathan Lambert speaks with Finbarr Timbers about the post-training techniques used at frontier AI labs. The conversation examines the methodologies — including supervised fine-tuning, reinforcement learning from human feedback, and preference optimization — that shape model behavior after pretraining. The discussion offers a practitioner's perspective on the evolving landscape of alignment and capability tuning at scale.
預測 2026 年年中：我對開源 AI 模型的幾點賭注與開閉源差距分析★ 75
Interconnects (Nathan L.)103 days agoOpinion
In this forward-looking article on the state of AI in mid-2026, Interconnects founder Nathan Lambert takes a deep dive into the dynamic gap between open-weight…
Nathan Lambert 的最新進展：ATOM Report、Post-Training 課程、新書與持續進行的 AI 研究★ 70
Interconnects (Nathan L.)104 days agoRelease
Nathan Lambert, a prominent AI expert, former Alignment Scientist at Hugging Face, and founder of the popular newsletter Interconnects, recently wrote about…
Hugging Face 發表 TRL v1.0：專為後訓練（Post-Training）打造的開源庫，邁向 API 穩定與高效對齊新里程碑★ 85
Hugging Face Blog119 days agoRelease
Hugging Face has officially announced the release of TRL (Transformer Reinforcement Learning) v1.0. This is a major milestone, marking TRL's transformation…
AI2 推出 OLMo Hybrid：探索未來 LLM 混合架構與開源後訓練技術前沿★ 78
Interconnects (Nathan L.)144 days agoCommentary
As large language models (LLMs) continue to evolve, the traditional pure-Transformer architecture faces physical bottlenecks in computational efficiency and…
Smol2Operator：用於電腦操作（Computer Use）的輕量級 GUI 代理後訓練指南與模型★ 80
Hugging Face Blog308 days agoRelease
### Background and Challenge: The Rise of Local "Computer Use" With Anthropic's introduction of Computer Use and the development of various OS-level agents…