Qwen
Qwen
Qwen/Qwen3-1.7B

Quantizations

Quant	Quantized by	Size	Decode	Prefill	Score	Actions
4bit	MLX Community MLX Community	923.2 MB	113.5 tok/s	1,053.3 tok/s	Runs great

Results include trials with 4,096 input tokens and 1,024 output tokens only.

3 devices