OpenAI
OpenAI
/gpt-oss-120b

Quantizations

QuantQuantized bySizeDecodePrefillScoreActions
MLX Community
MLX Community
61.3 GB84.0 tok/s820.2 tok/sRuns ok

Device Comparison

Results include trials with 4,096 input tokens and 1,024 output tokens only.

Decode / Prefill Speeds

1 device