§ 07Benchmarks
The lines that diverge.
Throughput on dense FP16 matrix-multiply, normalised to a 2008-era data center accelerator (FP32 baseline projected to FP16 for comparability). Note the log scale: the CPU has roughly tripled, the accelerator has improved by three orders of magnitude.
2008 → 2026
22×
CPU improvement2008 → 2026
980×
accelerator improvement