Qwen
Qwen
/Qwen3-1.7B

Quantizations

QuantQuantized bySizeDecodePrefillScoreActions
MLX Community
MLX Community
923.2 MB113.5 tok/s1,053.3 tok/sRuns great

Device Comparison

Results include trials with 4,096 input tokens and 1,024 output tokens only.

Decode / Prefill Speeds

3 devices