Nathan L. says this was his final week at the Allen Institute for AI (Ai2). He highlights the privilege of working on the Olmo models and describes the role as a period of growth and learning. The brief farewell post does not provide a reason for leaving, future plans, or details about any impact on Olmo development.
Allen Institute for AI (AI2) 近期推出 OLMo Hybrid 模型,引發對未來 LLM 架構的廣泛討論。本文深入分析混合架構(如結合 Transformer 與狀態空間模型 SSM/Mamba)在提升效率與長文本處理上的潛力。同時,探討了開源社群在後訓練(Post-training)工具上的最新進展,指出開源生態正逐步縮小與閉源頂尖模型在對齊與強化學習上的差距。