i think we want more setup. I think we want to state: "As mentioned in the problem formulation, papers are transformed into reviews before reaching the decision. Given a paper (X), over decisions (D) and reviews (R) ( aka p(D,R|X)). While the flow indicates that reviews precede the decision, now that we have built a shortcut that predicts the decisionwithout the review, we question whether we can generate more \italics{decision-calibrated} reviews. This is valid by simply rearranging the joint distribution like so:
then the formula.
then the both factorizations are vlaid. then we can give a more conceptual understanding: if the conjecture is correct, that much of the decision can be determined by overall presentation quality, then perhaps reviews are simply a means-to-an-end to justify the reviewer's inital recaction.
heres more context about sources that we can cite for this behavior: The strongest terms to search are:
motivated reasoning, post hoc rationalization, biased assimilation, affect heuristic, halo effect, and in peer review specifically prestige bias, reviewer bias, or cognitive bias in peer review.
Here are the most relevant scientific articles.
- Kunda — “The Case for Motivated Reasoning”
Ziva Kunda, 1990, Psychological Bulletin
This is probably the canonical article. Kunda argues that motivation can bias the cognitive processes people use to access, construct, and evaluate beliefs. In your case: a reviewer’s initial evaluation of a paper can shape which criticisms feel salient or persuasive.
Best use: cite this for the general mechanism: people reason toward a conclusion they are already inclined to reach.
- Nisbett & Wilson — “Telling More Than We Can Know”
Richard Nisbett & Timothy Wilson, 1977, Psychological Review
This is highly relevant to the “reverse reasoning” part. They reviewed evidence that people often lack direct introspective access to the mental processes that caused their judgments, but still produce explanations for those judgments.
Best use: cite this for the idea that reviewers may sincerely report reasons for a judgment without accurately knowing what actually caused the judgment.
- Haidt — “The Emotional Dog and Its Rational Tail”
Jonathan Haidt, 2001, Psychological Review
Haidt’s social intuitionist model is about moral judgment, not peer review, but the mechanism maps well: quick intuitive judgment is followed by slower post hoc reasoning. The article explicitly frames reasoning as often coming after intuition rather than causing judgment.
Best use: cite this when you want the phrase “intuition first, reasoning second.”
- Lord, Ross & Lepper — “Biased Assimilation and Attitude Polarization”
Charles Lord, Lee Ross & Mark Lepper, 1979, Journal of Personality and Social Psychology
This is a classic study on how people evaluate evidence differently depending on their prior beliefs. People tend to accept congenial evidence more readily and scrutinize uncongenial evidence more harshly.
Best use: cite this for the reviewer behavior where evidence supporting the reviewer’s initial take gets treated as decisive, while contrary evidence gets nitpicked.
- Slovic et al. — “The Affect Heuristic”
Paul Slovic, Melissa Finucane, Ellen Peters & Donald MacGregor, 2007, European Journal of Operational Research
This article argues that feelings of “goodness” or “badness” can rapidly guide judgments and decisions. That is very close to what people informally mean by a “paper gestalt”: an overall positive or negative feeling that then colors assessment of specific features.
Best use: cite this for the gut reaction / affective global impression component.
- Tomkins, Zhang & Heavlin — “Reviewer Bias in Single- versus Double-Blind Peer Review”
Andrew Tomkins, Min Zhang & William D. Heavlin, 2017, PNAS
This is peer-review-specific. In a large conference-review setting, submissions were reviewed under single-blind and double-blind conditions. The study found that single-blind reviewing advantaged papers by famous authors and high-prestige institutions.
Best use: cite this to show that manuscript judgments are not purely about the paper’s intrinsic content; contextual cues can bias reviews.
- Blank — “The Effects of Double-Blind versus Single-Blind Reviewing”
Rebecca Blank, 1991, American Economic Review
This was a randomized experiment at the American Economic Review. It found that reviewers were more critical when author identity was blinded, and acceptance rates were lower under double-blind review.
Best use: cite this as older experimental evidence that review outcomes shift when identity/prestige cues are removed.
- Peters & Ceci — “Peer-Review Practices of Psychological Journals”
Douglas Peters & Stephen Ceci, 1982, Behavioral and Brain Sciences
Classic and brutal study: previously published psychology articles were resubmitted to the same journals with altered author/institution information. Most were not recognized, and many were rejected.
Best use: cite this for the unreliability/context-sensitivity of peer review.
- Teplitskiy et al. — “The Social Structure of Consensus in Scientific Review”
Misha Teplitskiy et al., 2018
This study analyzed reviews of 7,981 neuroscience manuscripts submitted to PLOS ONE and found that reviewers favored authors closer to them in the co-authorship network. The authors interpret this not just as simple nepotism but as partly reflecting “schools of thought” and substantive evaluative differences.
Best use: cite this if your point is that reviewers’ intellectual/social position can shape what they see as valid or flawed.
- Sordi et al. — “Halo Effect in Peer Review”
José Osvaldo De Sordi et al., 2020
This one is directly about the halo effect in peer review. It explores whether belonging to the same professional field or group can bias article evaluation during review.
Best use: cite this for the closest peer-review-specific phrase to “paper gestalt”: halo effect in peer review.
A good synthesis sentence would be:
The phenomenon can be described as an interaction of affective first impressions, motivated reasoning, and post hoc rationalization: reviewers may form an early global evaluation of a manuscript and then selectively construct or emphasize criticisms that make that evaluation appear analytically grounded.
For a paper or lit review, I’d probably cite Kunda 1990 + Nisbett & Wilson 1977 + Slovic et al. 2007 for the psychology mechanism, then Tomkins et al. 2017 + Peters & Ceci 1982 + Teplitskiy et al. 2018 for peer-review-specific evidence.
the key is that based off this realization, we sought to see whether conditioning frontier coding agents for reviews can 1) improve their decision-prediction and 2) improve their generated reviews' alignment to human reviews.