Qwen
Qwen
/Qwen3-32B

Quantizations

QuantQuantized bySizeDecodePrefillScoreActions
MLX Community
MLX Community
17.2 GB29.6 tok/s255.3 tok/sRuns ok

Device Comparison

Results include trials with 4,096 input tokens and 1,024 output tokens only.

Decode / Prefill Speeds

2 devices