Qwen
Qwen
/Qwen3.5-35B-A3B

Quantizations

QuantQuantized bySizeDecodePrefillScoreActions
mudler
mudler
16.1 GBN/AN/AN/A
Elijah McMorris
Elijah McMorris
18.2 GB104.0 tok/s1,461.7 tok/sRuns ok
MLX Community
MLX Community
19.0 GB105.1 tok/s1,412.7 tok/sRuns ok
LM Studio
LM Studio
19.7 GBN/AN/AN/A
Unsloth
Unsloth
20.5 GB60.1 tok/s1,099.2 tok/sRuns ok

Device Comparison

Results include trials with 4,096 input tokens and 1,024 output tokens only.

Decode / Prefill Speeds

8 devices