Latest in AI

Showing:rlDevelopersClear ×

Topic

Release New Tool Tutorial Business Paper Benchmark Opinion Regulation

For

General Developers Designers Product Founders Marketing Researchers Students

Kimina-Prover：在大型形式化推理模型中應用測試時強化學習搜尋 (Test-time RL Search)★ 82
Hugging Face Blog383 days agoRelease
Hugging Face's AI-MO (AI Math Olympiad) team has officially published Kimina-Prover, a research paper demonstrating how "test-time reinforcement learning…
DeepCoder：Together 與 Agentica 推出達到 o3-mini 水準的 14B 完全開源程式碼推理模型★ 85
TLDR AI (Buttondown)474 days agoRelease
After DeepSeek R1 set off a wave of open-source reasoning models, the open-source community saw many projects attempting to replicate its path to success…