whatcani.run
Home
Models
Device
Runs
Docs
GitHub
whatcani.run
Qwen
Qwen
Qwen
/
Qwen3-0.6B
0.6B
April 29, 2025
Apache 2.0
Overview
Runs
Runs
View all benchmark runs for this model family.
Device
Quant
Runtime
Decode
Prefill
Peak memory
Date
Device
Quant
Decode
Actions
1
whatcani.run
Home
Models
Device
Runs
Docs
GitHub
Login
whatcani.run
Device
Quant
Runtime
Decode
Prefill
Peak memory
Date
M1 Max
10
32
64 GB
4bit
mlx_lm
0.31.1
163.6
tok/s
2,279.1
tok/s
2.13
GB
3%
7 days ago
M1 Max
10
32
64 GB
q8_0
llama.cpp
b8240
98.7
tok/s
2,281.0
tok/s
1.26
GB
2%
3 weeks ago
M1 Max
10
32
64 GB
4bit
mlx_lm
0.31.0
151.6
tok/s
1,895.0
tok/s
2.13
GB
3%
3 weeks ago
M1 Max
10
32
64 GB
q8_0
llama.cpp
b8240
119.2
tok/s
3,233.4
tok/s
1.26
GB
2%
3 weeks ago
Device
Quant
Decode
Actions
M1 Max
10
32
64 GB
4bit
163.6
tok/s
M1 Max
10
32
64 GB
q8_0
98.7
tok/s
M1 Max
10
32
64 GB
4bit
151.6
tok/s
M1 Max
10
32
64 GB
q8_0
119.2
tok/s
1