whatcani.run
Home
Models
Device
Runs
Docs
GitHub
whatcani.run
Jackrong
Jackrong
Jackrong
/
Qwen3.5-9B (GLM-5.1 Distill) v1
9B
April 15, 2026
Apache 2.0
Share
Overview
Runs
Runs
View all benchmark runs for this model family.
Device
Quant
Runtime
Decode
Prefill
Peak memory
Date
Actions
Device
Quant
Decode
Actions
1
whatcani.run
Home
Models
Device
Runs
Docs
GitHub
Login
whatcani.run
Device
Quant
Runtime
Decode
Prefill
Peak memory
Date
Actions
M4
10
10
24 GB
q8_0
llama.cpp
b8680
7.5
tok/s
131.5
tok/s
0.84
GB
3%
4 weeks ago
GeForce RTX 5070 Ti
16 GB
q8_0
llama.cpp
b8849
73.8
tok/s
5,199.5
tok/s
9.22
GB
40%
1 month ago
Device
Quant
Decode
Actions
M4
10
10
24 GB
q8_0
7.5
tok/s
GeForce RTX 5070 Ti
16 GB
q8_0
73.8
tok/s
1