r/LocalLLaMA top dayJun 7, 2026, 3:13 PM/u/rolznz
Qwen3.6 35B-A3B on a Laptop: A Local LLM "Zero to One" Milestone
Original: Qwen3.6 35B-A3B on a Laptop: My Zero to One Moment
A user shares how running Qwen3.6 35B-A3B locally on an RTX 4060 laptop achieved up to 27 TPS, creating a private "second brain."
A Reddit user detailed running Qwen3.6 35B-A3B (IQ3_XXS quantization) on an ASUS Zenbook Pro 14 (RTX 4060 8GB VRAM, 64GB RAM). Using llama.cpp, they achieved 27 TPS at 32k context and 18 TPS at 256k context. This setup serves as a highly capable, fully private local agent for file operations, CLI execution, and brainstorming, bypassing cloud privacy concerns.
想看英文原文 / 完整內容?
前往 r/LocalLLaMA top day 原文 →摘要由 AI 整理,以原文為準。