Asthenosphere
================================================================ ASTHENOSPHERE NPU INFERENCE METRICS Hardware: Device: AMD Phoenix XDNA gen1 (AIE2) Tiles: 12/12 (complete transformer pipeline) Devi...

Source: DEV Community
================================================================ ASTHENOSPHERE NPU INFERENCE METRICS Hardware: Device: AMD Phoenix XDNA gen1 (AIE2) Tiles: 12/12 (complete transformer pipeline) Device ID: /dev/accel/accel0 Status: ACTIVE Reliability: 100% Pipeline: PreScale > Q proj > RoPE > Attention > O proj > Attn ResAdd PreScale2 > Gate+SiLU+Up > EltMul > Down > FFN ResAdd > Score Head 14 ops, zero CPU/GPU during NPU compute SESSION AVERAGES (7 messages) Avg tokens/msg: 64.7 Avg elapsed/msg: 83ms Avg eff tok/s: 3866 Avg acceptance: 91.8% Avg cost/msg: 21.3 Motes ALL-TIME AVERAGES (7 messages) Avg tokens/msg: 64.7 Avg elapsed/msg: 83ms Avg eff tok/s: 3866 Avg acceptance: 91.8% Avg cost/msg: 21.3 Motes PER-DISPATCH LOG (7 entries) Time Tokens Dispatches Elapsed Eff tok/s Accept% Motes 16:31:41 65 12 5.4ms 11970 86% 6 16:31:38 65 12 134ms 485 87% 31 16:31:00 65 12 146.4ms 444 88% 33 16:30:48 65 12 147.6ms 440 90% 33 16:30:05 65 12 12.1ms 5356 93% 9 16:29:5