this self-judgement does not need to be perfectly accurate, as we find ReasoningBank to be quite robust against judgment noise
大多数人认为智能体的自我评估需要高度准确才能有效学习,因为错误的判断会导致错误的记忆形成。但作者认为即使自我判断存在噪声,ReasoningBank仍然能够有效运作,这挑战了传统对评估精确性的严格要求,表明系统可能比预期更能容忍不完美的自我评估。