Google
Google
/Gemma 4 E2B IT

Quantizations

QuantQuantized bySizeDecodePrefillScoreActions
Unsloth
Unsloth
2.8 GBN/AN/AN/A
Unsloth
Unsloth
2.9 GB33.1 tok/s420.8 tok/sRuns ok
MLX Community
MLX Community
3.3 GBN/AN/AN/A
MLX Community
MLX Community
3.9 GB68.1 tok/s1,759.4 tok/sRuns great
MLX Community
MLX Community
4.0 GB65.5 tok/s1,876.4 tok/sRuns great
Unsloth
Unsloth
4.8 GBN/AN/AN/A
MLX Community
MLX Community
5.5 GB68.1 tok/s2,159.2 tok/sRuns great

Device Comparison

Results include trials with 4,096 input tokens and 1,024 output tokens only.

Decode / Prefill Speeds

24 devices
Gemma 4 E2B IT by Google | whatcani.run