whatcani.run
Home
Models
Device
Runs
Docs
GitHub
whatcani.run
Qwen
Qwen
Qwen
/
Qwen3-1.7B
1.7B
April 29, 2025
Apache 2.0
Overview
Runs
Runs
View all benchmark runs for this model family.
Device
Quant
Runtime
Decode
Prefill
Peak memory
Date
Device
Quant
Decode
Actions
1
whatcani.run
Home
Models
Device
Runs
Docs
GitHub
Login
whatcani.run
Device
Quant
Runtime
Decode
Prefill
Peak memory
Date
M3 Ultra
28
60
256 GB
4bit
mlx_lm
0.31.1
251.1
tok/s
4,899.4
tok/s
2.96
GB
1%
7 days ago
M5 Pro
18
20
64 GB
4bit
mlx_lm
0.31.2
150.9
tok/s
7,366.3
tok/s
2.84
GB
4%
2 weeks ago
M5 Pro
18
20
64 GB
4bit
mlx_lm
0.31.2
146.3
tok/s
7,319.5
tok/s
2.78
GB
4%
2 weeks ago
M1 Max
10
32
64 GB
4bit
mlx_lm
0.31.0
113.5
tok/s
1,053.3
tok/s
2.58
GB
4%
3 weeks ago
Device
Quant
Decode
Actions
M3 Ultra
28
60
256 GB
4bit
251.1
tok/s
M5 Pro
18
20
64 GB
4bit
150.9
tok/s
M5 Pro
18
20
64 GB
4bit
146.3
tok/s
M1 Max
10
32
64 GB
4bit
113.5
tok/s
1