Qwen
Qwen
/Qwen3.5-9B

Quantizations

QuantQuantized bySizeDecodePrefillScoreActions
Unsloth
Unsloth
5.0 GBN/AN/AN/A
Unsloth
Unsloth
5.3 GBN/AN/AN/A
MLX Community
MLX Community
5.5 GB65.9 tok/s628.9 tok/sRuns well
MLX Community
MLX Community
5.6 GB60.9 tok/s707.4 tok/sRuns well
Unknown8.9 GBN/AN/AN/A
Unsloth
Unsloth
8.9 GB35.9 tok/s696.6 tok/sRuns ok
Unsloth
Unsloth
16.7 GB21.3 tok/s707.1 tok/sRuns poorly
MLX Community
MLX Community
17.5 GB21.9 tok/s754.4 tok/sRuns poorly

Device Comparison

Results include trials with 4,096 input tokens and 1,024 output tokens only.

Decode / Prefill Speeds

66 devices
Qwen3.5-9B by Qwen | whatcani.run