Qwen3.6-35B-A3B

Qwen3.6-35B-A3B is an open-weight LLM) from the Qwen family (Alibaba Cloud), released in April 2026. Following the Qwen MoE naming convention, the model has 35B total parameters with ~3B active per token (the `A3B` suffix), making it runnable on consumer hardware via quantization.

Canonical version: Qwen3.6-35B-A3B.

Qwen3.6-35B-A3B is an open-weight LLM from the Qwen family (Alibaba Cloud), released in April 2026. Following the Qwen MoE naming convention, the model has 35B total parameters with ~3B active per token (the A3B suffix), making it runnable on consumer hardware via quantization.

Architecture

  • Mixture-of-experts (MoE) design — 35B total parameters, ~3B active per token
  • Open-weight release under the standard Qwen licensing terms
  • Distributed in GGUF format via HuggingFace (ecosystem forks by Unsloth and others)
  • Typical quantized builds (Q4_K_S) land around 21 GB, suitable for consumer Apple Silicon and mid-range GPUs

Running it locally

  • Simon Willison documented running Qwen3.6-35B-A3B-UD-Q4_K_S.gguf (Unsloth Dynamic, 20.9 GB) on a MacBook Pro M5 via LM Studio
  • The low active-parameter count (3B) keeps inference fast despite the larger on-disk footprint
  • A good fit for the AI Expert Offloading strategy on hybrid CPU/GPU setups

Notable reception

  • Simon Willison's tongue-in-cheek "pelican riding a bicycle" SVG benchmark: Qwen3.6-35B-A3B produced a bicycle frame with correct geometry and a recognizable pelican, while Claude Opus 4.7 failed to render the frame in two attempts (including thinking_level: max)
  • The follow-up "flamingo on a unicycle" test also favored Qwen, which even added inline SVG comments like Sunglasses on flamingo!
  • Simon explicitly cautions: the pelican test is intentionally absurd and this is not evidence that a 21 GB quantized model is broadly more capable than Anthropic's flagship; narrow benchmarks are easy to over-read

Why it matters

  • Demonstrates that a small-active-parameter MoE, quantized to ~21 GB, can match or beat frontier proprietary models on specific creative/spatial tasks
  • Reinforces the case for local, open-weight models as a practical complement to API-only frontier models
  • Fits the broader pattern of Qwen shipping competitive open-weight releases on a fast cadence

References


About Sébastien

I'm Sébastien Dubois, and I'm on a mission to help knowledge workers escape information overload. After 20+ years in IT and seeing too many brilliant minds drowning in digital chaos, I've decided to help people build systems that actually work. Through the Knowii Community, my courses, products & services and my Website/Newsletter, I share practical and battle-tested systems.

I write about Knowledge Work, Personal Knowledge Management, Note-taking, Lifelong Learning, Personal Organization, Productivity, and more. I also craft lovely digital products and tools.

If you want to follow my work, then become a member and join our community.

Ready to get to the next level?

If you're tired of information overwhelm and ready to build a reliable knowledge system:

Found this valuable? Share it with someone who needs it.

Join 6,000+ readers. Get practical systems for knowledge & AI. Free.

Subscribe ✨

Free: Knowledge System Checklist

A clear roadmap to building your own knowledge system. Subscribe and get it straight to your inbox.

6,000+ readers. No spam. Unsubscribe anytime.