Qwen
Qwen
/Qwen3-0.6B

Quantizations

QuantQuantized bySizeDecodePrefillScoreActions
MLX Community
MLX Community
319.9 MB154.6 tok/s1,991.0 tok/sRuns great
Unsloth
Unsloth
609.8 MB108.9 tok/s2,757.2 tok/sRuns great

Device Comparison

Results include trials with 4,096 input tokens and 1,024 output tokens only.

Decode / Prefill Speeds

2 devices