Hypothesis

1 Matching Annotations

May 2026
news.mit.edu news.mit.edu

https://news.mit.edu/2026/teaching-ai-models-to-say-im-not-sure-0422

1
1. fxp007 01 May 2026
  
  in Public
  
  Nothing in between. A model that arrives at the correct answer through careful reasoning receives the same reward as one that guesses correctly by chance.
  
  这一段落揭示了当前训练方法的问题：没有区分模型是通过深思熟虑还是偶然猜对答案，导致模型过度自信。
  
  training-methods overconfidence ai-rewards
Visit annotations in context

Tags

ai-rewards

overconfidence

training-methods

Annotators

fxp007

URL

news.mit.edu/2026/teaching-ai-models-to-say-im-not-sure-0422