whatcani.run
Home
Models
Device
Runs
Docs
GitHub
whatcani.run
Meta
Meta
Meta
/
Llama 3.2 3B Instruct
3B
September 25, 2024
Llama 3.2 Community License
Share
Overview
Runs
Runs
View all benchmark runs for this model family.
Device
Quant
Runtime
Decode
Prefill
Peak memory
Date
Actions
Device
Quant
Decode
Actions
1
whatcani.run
Home
Models
Device
Runs
Docs
GitHub
Login
whatcani.run
Device
Quant
Runtime
Decode
Prefill
Peak memory
Date
Actions
M4
10
10
32 GB
4bit
mlx_lm
0.31.2
30.8
tok/s
305.3
tok/s
3.38
GB
11%
11 hours ago
M5 Pro
15
16
48 GB
4bit
mlx_lm
0.31.2
106.3
tok/s
3,225.0
tok/s
3.75
GB
8%
4 days ago
M4 Max
16
40
128 GB
4bit
mlx_lm
0.31.2
160.1
tok/s
1,548.1
tok/s
3.64
GB
3%
2 weeks ago
M1 Max
10
32
64 GB
4bit
mlx_lm
0.31.0
68.4
tok/s
652.0
tok/s
3.43
GB
5%
4 weeks ago
Device
Quant
Decode
Actions
M4
10
10
32 GB
4bit
30.8
tok/s
M5 Pro
15
16
48 GB
4bit
106.3
tok/s
M4 Max
16
40
128 GB
4bit
160.1
tok/s
M1 Max
10
32
64 GB
4bit
68.4
tok/s
1