r/LocalLLaMA top dayJun 7, 2026, 5:29 PM/u/pftbest
Gemma-4-26B-A4B QAT Variant Performs Poorly in llama.cpp Compared to Non-QAT Version
Original: QAT variant of Gemma4 26B A4B is not working well for me
Users report that the QAT (Quantization-Aware Training) variant of Gemma-4-26B-A4B performs worse than the standard version in llama.cpp.
A LocalLLaMA user highlighted that the newly released QAT (Quantization-Aware Training) variant of Google's Gemma-4-26B-A4B model underperforms compared to its non-QAT predecessor. Testing via llama.cpp on a chessboard SVG generation task showed significant rendering errors in the QAT version. The non-QAT GGUF version, however, produced highly accurate results under identical settings.
想看英文原文 / 完整內容?
前往 r/LocalLLaMA top day 原文 →摘要由 AI 整理,以原文為準。