Latest in AI

Showing:distillationResearchersClear ×

Topic

Release New Tool Tutorial Business Paper Benchmark Opinion Regulation

For

General Developers Designers Product Founders Marketing Researchers Students

蒸餾恐慌：為什麼將「知識蒸餾」稱為安全攻擊是極其糟糕的趨勢★ 75
Interconnects (Nathan L.)85 days agoOpinion
In the field of machine learning, "knowledge distillation" is a well-established technique that generally refers to using the output data generated by a…
知識蒸餾對中國大語言模型（LLM）到底有多重要？回應 Anthropic 的「蒸餾攻擊」觀點★ 75
Interconnects (Nathan L.)154 days agoCommentary
Anthropic recently published research on "distillation attacks," defining the practice of external developers using its API outputs to train other models as a…
開源模型陷入「永久追趕」：開源與閉源的差距、蒸餾、創新週期與開源的勝算★ 80
Interconnects (Nathan L.)160 days agoOpinion
This article by Nathan Lambert takes a deep dive into the tangled competitive dynamics between open-source and closed-source AI models. Lambert argues that…
Apriel-H1：揭示蒸餾高效推理模型的驚人關鍵★ 75
Hugging Face Blog251 days agoRelease
With the successive emergence of models with powerful "reasoning" capabilities — such as OpenAI o1, o3, and DeepSeek-R1 — the challenge of reducing the…
Open-R1：Hugging Face 推出完全開源的 DeepSeek-R1 重現計劃★ 90
Hugging Face Blog546 days agoRelease
### Project Background: Recreating the Open-Source Miracle of DeepSeek-R1 The emergence of DeepSeek-R1 sent shockwaves through the global AI community…
Snorkel AI 攜手 Hugging Face：為企業解鎖基礎模型的力量
Hugging Face Blog1,209 days agoBusiness
With the explosion of foundation models and large language models (LLMs), enterprises are eager to incorporate these powerful technologies into real-world…
Hugging Face 如何為 API 客戶將 Transformer 推理速度提升 100 倍
Hugging Face Blog2,017 days agoRelease
In this technical blog post, the Hugging Face team reveals in detail how they achieved up to 100x speedup in inference for Transformer models for customers of…