OpenAI
OpenAI
OpenAI/gpt-oss-120b

Quantizations

Quant	Quantized by	Size	Decode	Prefill	Score	Actions
4bit	MLX Community MLX Community	61.3 GB	84.0 tok/s	820.2 tok/s	Runs ok

Results include trials with 4,096 input tokens and 1,024 output tokens only.

1 device