chips from different generations running at different speeds still matched the ML performance of single-chip-type training runs, ensuring that even older hardware can meaningfully accelerate AI training.
大多数人认为混合不同代际的硬件进行训练会降低性能或效率,但作者认为即使不同代际、不同速度的芯片混合使用,仍能达到与单一芯片类型训练相同的机器学习性能,这挑战了硬件必须同质化的行业共识。