whatcani.run
Home
Models
Device
Runs
Docs
GitHub
whatcani.run
Google
Google
Google
/
Gemma 4 31B
30.7B
April 2, 2026
Apache 2.0
Share
Overview
Runs
Runs
View all benchmark runs for this model family.
Device
Quant
Runtime
Decode
Prefill
Peak memory
Date
Actions
Device
Quant
Decode
Actions
1
whatcani.run
Home
Models
Device
Runs
Docs
GitHub
Login
whatcani.run
Device
Quant
Runtime
Decode
Prefill
Peak memory
Date
Actions
M5 Pro
18
20
64 GB
mxfp4
mlx_lm
0.31.3
14.6
tok/s
364.6
tok/s
22.00
GB
34%
3 weeks ago
M2 Max
12
30
32 GB
mxfp4
mlx_lm
0.31.3
10.2
tok/s
70.5
tok/s
22.00
GB
69%
3 weeks ago
M4 Max
16
40
128 GB
8bit
mlx_lm
0.31.2
12.5
tok/s
154.7
tok/s
37.00
GB
29%
2 months ago
M4 Pro
14
20
64 GB
8bit
mlx_lm
0.31.2
6.9
tok/s
91.1
tok/s
37.00
GB
58%
2 months ago
Device
Quant
Decode
Actions
M5 Pro
18
20
64 GB
mxfp4
14.6
tok/s
M2 Max
12
30
32 GB
mxfp4
10.2
tok/s
M4 Max
16
40
128 GB
8bit
12.5
tok/s
M4 Pro
14
20
64 GB
8bit
6.9
tok/s
1