Latest in AI

Showing:planningResearchersClear ×

Topic

Release New Tool Tutorial Business Paper Benchmark Opinion Regulation

For

General Developers Designers Product Founders Marketing Researchers Students

TextQuests：LLM 在文字冒險遊戲中的表現究竟如何？Hugging Face 推出全新評估基準★ 75
Hugging Face Blog350 days agoRelease
Hugging Face has recently introduced a new benchmark called "TextQuests," designed to evaluate the performance of large language models (LLMs) in text-based…
Google DeepMind 揭示通用 AI 助手願景：將 Gemini 擴展為具備規劃與模擬能力的「世界模型」★ 85
Google DeepMind Blog434 days agoOpinion
Google DeepMind recently published its latest vision for building a "Universal AI Assistant." In this blueprint, the core technical evolution lies in extending…