Each loop iteration is the functional equivalent of one step of chain-of-thought, but operating in continuous latent space rather than token space. A looped model running T loops implicitly simulates T steps of CoT reasoning. This has been formally proven (Saunshi et al., 2025).
这一发现令人惊讶,因为它表明循环模型在连续潜在空间中实现了思维链推理,而不需要显式的中间token输出。这意味着模型可以在单次前向传播中完成多步推理,这挑战了传统思维链需要显式token输出的观点,并为理解大模型推理过程提供了新视角。