24 Matching Annotations
  1. May 2026
    1. A limitation of the present work is we assume that preferences are fixed, thought they are likely to be dynamically updated throughout the task. Future work should aim to characterize preferences update mechanisms, by dynamically updating both the reward and transitional probabilty structure of the task, either in a gradually or chunk-wise (i.e. having separate experimental blocks with different task structures).

      let's try and mkae the limitatons not too extensive. LLMs (when reviewing) actually pick up on these and present them as independent evidence that something's wrong with the paper.

    2. Future experiment: have participants conduct several versions of the same task in which different features of the task drive the return (i.e. make cost more relevant for example) and see how that influences the preferences weighting.

      nice idea! But keep for ourselves for now.

    3. Figure 5:

      well done. I assume energy has such a big impact bc the agent no longer can tell when it runs dry or is full of energy? so basically doing 'stupid' mistakes? It would be good to give a concrete reasoning like this, even if speculative.

    4. as the interaction between the two performs better than Ott et al. (2022a) original model (see Figure 4 A).

      it would be better ot compare against one of our own models (otherwise too confusing who said what and why)

    5. Importantly, these differences in RT patterns observed between intermediate and extreme offers might in fact be driven by differences in overall preference strength between both these conditions rather than these conditions being treated as separate contexts.

      maybe clear from my cokments above but no need ot argue here against Ott et al. This is a standalone paper. Rather derive directly from our motivation.

    6. . Figure 3 B depicts p

      before going in these results, we should motivate this, i.e. saying what we'll learn from this analysis. Otherise the following description will just appear confusing and will draw comments by reviewers.

    7. bias is more extreme for more extreme offers (1 and 4) than intermediate ones (2 and 3)

      I'm not partiiculary a friend of the terms 'extreme' and 'intermediate', so we might drop them if not required

    8. In line with Ott et al. (2022a), we observe that a model taking only planning component into account does not perform as well compared to taking state specific biases into account. Importantly, the preference model was found to significantly outperform the hybrid model,

      this will be too confusing for 1st time readers. I'd structure it like this: We just define our own models (even if they are the same in Ott et al), e.g. planning-only, preference-only, full model. Think of names that make sense for us. Then analyse all of them. And only mention then the hybrid model.

      This would be the cleanest way of presenting the results. And we also refer to Ott et al 2022 so no one will accuse us of not mentioning prior work.

    9. Task structure importance

      put this before 2.5 RTs small changes to the first paragraph: go quickly for the kill, no reference to Ott necessary here. As we will say in the intro that task structure may be used to form cheap priors, so ask there the question what are those in the present task. List them (briefly motivate each one of them why they give prior info). And then first say in a sentence or two what kind of analyses we're proposing and only then go through each analysis (as you do already now)

    10. his should be reflected in participants reaction time (RT), such that the stronger their preferences (i.e. the lower their preferences entropy), the lower their RT. Importantly, as it was previously observed that participants RT increases when decision values are close to 0, which was interepreted as evidence that participants are able to reach a decision quicker when the value asssociated with one decision is much higher than that associated with any other decisions Ott et al. (2022a). In this paper, they further observed that participant RT depended more on decision values in intermediate offers (2 and 3) compared to extreme ones (1 and 4), which they interpreted as evidence that participants consider these offer groups as separate contexts, with the latter requiring more forward planning than the former.

      there won't be the need to explain in detail what Ott found before. It should be in there but only 1 or 2 sentences and say that there was already some evidence found.

    11. Following the observation that modelling choice behaviour as a function of preferences, decisions values and the interaction between the two fitted the data best, we hypothesized that the degree to which participants engage in effortful foward planning in a given state depends on the strength of their preferences.

      The RT analysis would be a cornerstore, and we need to refer back briefly to our narrative of the intro to remind readers of the big picture

    12. In this work, we aimed to investigate whether instead of a binary contextualization of the task, participants behaviour reflects a weighted combination of a planning component with preferences reflecting the structure of the task.

      good. it'd be even better to have some prior motivation/hypothesis/conjecture that we can build on. And this motivation would be laid out in the introduction and only referred here to.

    13. which in the case of a binary decision problem simplifies to the logit function (see Lepauvre and Kiebel (2026) for proof):

      nice.... if this result has been used before though, it's better for a smooth review to say that we followed their analysis.

    14. , we incorporated one segment beyond the known horizon to ensure that future beyond known horizon is considered, otherwise the optimal solution would seek to reach minimal energy level by the end of the 2 segments, which would be detrimental to long term planning.

      the question is also why not model all 240 (?) trials, why we shortened (suboptimally) to 13 trials. Explain that it was our assumption that this is sufficient.

    15. Optimal planning

      Before going here, we also need to say how many trials there were and that the task was 'continuous' so doesnt end after 20 trials or so but goes much further.

    16. For this paper, we reused data from a previously published study in which 40 participants (22, mean age=24.4, SD=4.6)

      to make sure we both share the same understanding: A paper should usually be standalone so ideally you would put here all the info about the experiment and not assume that readers would read Ott et al 2022 to find out.

    17. from Ott et al. (2022a)

      no need to say this here again. But if the figure was taken from Florians paper, we must get approval of the journal and say that we did, usually at the end of the figure caption. if we changed somehting, the text at the end would be 'Figure adapted from Ott et al...' (or similar)

    18. (Ott et al. (2022a), Ott et al. (2022b) Ott, Florian, Eric Legler, and Stefan J Kiebel. 2022b. Forward Planning Driven by Context-Dependent Conflict Processing in Anterior Cingulate Cortex - Analysis Code and Datasets. V. 1.2. Zenodo, released. https://doi.org/10.5281/zenodo.6328296. ).

      we should talk about making Florian Ott a co-author. He provided the data, wrote the application with me, and one might even argue that the previous paper gave us the idea to think harder about what preferences subjects use. I'm neutral on this question of co-authorship but interested in an objective assessment of his contribution.