Jackrong
Jackrong
/Qwopus-GLM-18B-Merged

Quantizations

QuantQuantized bySizeDecodePrefillScoreActions
Jackrong
Jackrong
9.2 GB6.5 tok/s72.1 tok/sRuns poorly

Device Comparison

Results include trials with 4,096 input tokens and 1,024 output tokens only.

Decode / Prefill Speeds

2 devices