Unsloth Gemma 4 QAT MTP assistant models now available

Unsloth has published Gemma 4 QAT MTP assistant GGUF models on Hugging Face.

A r/LocalLLaMA post notes that Unsloth’s Gemma 4 QAT MTP assistant models are now available in GGUF format. The root directories include q8_0 files named mtp-gemma-4-*.gguf, while MTP folders contain q8_0 and larger quantized variants. The listed releases cover 12B, 26B-A4B, 31B, E2B, E2B mobile, E4B, and E4B mobile it-qat-GGUF repositories.

This r/LocalLLaMA post is a brief model release notice, focusing on the fact that Unsloth has made GGUF files for the Gemma 4 QAT MTP assistant models available on Hugging Face. The original post explains that q8_0 versions can be found at the root level of the corresponding model directories, with filenames in the form mtp-gemma-4-*.gguf; meanwhile, each directory also contains an MTP folder that includes q8_0 and larger quantized versions. This means users can not only directly fetch the q8_0 file from the root directory, but also enter the MTP subfolder to choose other quantization sizes.

Full summary

Free shows the 3-line summary; Pro unlocks the full deep summary (~300 words) so you never have to click through.

See Pro plans →

Summaries are AI-generated; the original article is authoritative.