whatcani.run
Home
Models
Device
Runs
Docs
GitHub
whatcani.run
Z.ai
Z.ai
Z.ai
/
GLM-5.1
744B (40B active)
April 7, 2026
MIT
Overview
Runs
Runs
View all benchmark runs for this model family.
Device
Quant
Runtime
Decode
Prefill
Peak memory
Date
Device
Quant
Decode
Actions
1
whatcani.run
Home
Models
Device
Runs
Docs
GitHub
Login
whatcani.run
Device
Quant
Runtime
Decode
Prefill
Peak memory
Date
M3 Ultra
28
60
256 GB
ud-iq2_m
llama.cpp
b8680
15.5
tok/s
115.9
tok/s
207.46
GB
81%
4 days ago
Device
Quant
Decode
Actions
M3 Ultra
28
60
256 GB
ud-iq2_m
15.5
tok/s
1