Latest in AI

Showing:computer-visionStudentsClear ×

Topic

Release New Tool Tutorial Business Paper Benchmark Opinion Regulation

For

General Developers Designers Product Founders Marketing Researchers Students

From ICRA to CVPR: What Is the Robotics Community Talking About? | Beijing Wednesday Evening
量子位 QbitAI42 days agoCommentary
QbitAI hosts a Beijing Wednesday-evening meetup tracing the key conversations in the robotics research community from ICRA to CVPR. The event format — common in China's academic tech circuit — brings together researchers and engineers to unpack conference highlights, emerging trends, and cross-disciplinary intersections. No specific paper or product is the focus; the value is in aggregated community signal across two flagship venues.
CVPR 2026 Highlights Guangdong as He Kaiming and GDUT Team Stand Out★ 76
量子位 QbitAI50 days agoPaper
CVPR 2026 named Google DeepMind’s D4RT as Best Paper for fast dynamic 4D scene reconstruction from video. Honorable mentions included Meta’s SAM 3D and NVIDIA’s NitroGen, while TRELLIS.2 won Best Student Paper. The article emphasizes Chinese researcher visibility, ResNet and YOLO receiving the Longuet-Higgins Prize, and a GDUT-led undergraduate-heavy ChordEdit team breaking through among major labs and elite universities.
Google 推出 PaliGemma：結合 SigLIP 與 Gemma 的開源視覺語言模型★ 80
Hugging Face Blog805 days agoRelease
Google has officially launched PaliGemma, a powerful yet lightweight open-source Vision-Language Model (VLM). The release of PaliGemma represents a significant…
視覺語言模型（VLM）原理解析：從架構、訓練到應用指南★ 80
Hugging Face Blog838 days agoTutorial
This technical blog post published by Hugging Face provides an accessible yet thorough breakdown of the core principles and applications of Vision Language…
深入探討視覺語言模型 (Vision-Language Models) 的原理與架構★ 80
Hugging Face Blog1,271 days agoTutorial
This is a classic technical guide written by the Hugging Face team, designed to help developers and researchers gain a deep understanding of how…
詳解擴散模型：The Annotated Diffusion Model 程式碼與原理實戰指南★ 85
Hugging Face Blog1,512 days agoTutorial
This classic blog post from Hugging Face, "The Annotated Diffusion Model," is an essential guide for learning about generative AI image synthesis. Modeled…
使用 🤗 Transformers 微調 ViT 進行影像分類教學★ 70
Hugging Face Blog1,628 days agoTutorial
This is an official tutorial article from Hugging Face that guides developers on how to fine-tune a Vision Transformer (ViT) model for image classification…