whatcani.run
Home
Models
Device
Runs
Docs
GitHub
whatcani.run
NVIDIA
NVIDIA
NVIDIA
/
Nemotron 3 Super
120B (12B active)
March 11, 2026
NVIDIA Nemotron Open Model License
Overview
Runs
Runs
View all benchmark runs for this model family.
Device
Quant
Runtime
Decode
Prefill
Peak memory
Date
Device
Quant
Decode
Actions
1
whatcani.run
Home
Models
Device
Runs
Docs
GitHub
Login
whatcani.run
Device
Quant
Runtime
Decode
Prefill
Peak memory
Date
M4 Max
16
40
128 GB
3bit
mlx_lm
0.31.2
43.3
tok/s
322.1
tok/s
61.00
GB
48%
2 weeks ago
Device
Quant
Decode
Actions
M4 Max
16
40
128 GB
3bit
43.3
tok/s
1