Google
Google
/Gemma 4 E2B

Quantizations

QuantQuantized bySizeDecodePrefillScoreActions
MLX Community
MLX Community
3.3 GBN/AN/AN/A
MLX Community
MLX Community
3.9 GBN/AN/AN/A
MLX Community
MLX Community
4.0 GB70.6 tok/s1,965.1 tok/sRuns great
MLX Community
MLX Community
4.1 GB72.9 tok/s2,279.7 tok/sRuns great
MLX Community
MLX Community
4.4 GBN/AN/AN/A
MLX Community
MLX Community
5.5 GB68.7 tok/s1,939.8 tok/sRuns great

Device Comparison

Results include trials with 4,096 input tokens and 1,024 output tokens only.

Decode / Prefill Speeds

7 devices