Hypothesis

24 Matching Annotations

May 2026
alexlepauvre.github.io alexlepauvre.github.io

Planning and preferences

24
1. stefan.kiebel 05 May 2026
  
  in Public
  
  A limitation of the present work is we assume that preferences are fixed, thought they are likely to be dynamically updated throughout the task. Future work should aim to characterize preferences update mechanisms, by dynamically updating both the reward and transitional probabilty structure of the task, either in a gradually or chunk-wise (i.e. having separate experimental blocks with different task structures).
  
  let's try and mkae the limitatons not too extensive. LLMs (when reviewing) actually pick up on these and present them as independent evidence that something's wrong with the paper.
2. stefan.kiebel 05 May 2026
  
  in Public
  
  Future experiment: have participants conduct several versions of the same task in which different features of the task drive the return (i.e. make cost more relevant for example) and see how that influences the preferences weighting.
  
  nice idea! But keep for ourselves for now.
3. stefan.kiebel 05 May 2026
  
  in Public
  
  Figure 5:
  
  well done. I assume energy has such a big impact bc the agent no longer can tell when it runs dry or is full of energy? so basically doing 'stupid' mistakes? It would be good to give a concrete reasoning like this, even if speculative.
4. stefan.kiebel 05 May 2026
  
  in Public
  
  as the interaction between the two performs better than Ott et al. (2022a) original model (see Figure 4 A).
  
  it would be better ot compare against one of our own models (otherwise too confusing who said what and why)
5. stefan.kiebel 05 May 2026
  
  in Public
  
  Importantly, these differences in RT patterns observed between intermediate and extreme offers might in fact be driven by differences in overall preference strength between both these conditions rather than these conditions being treated as separate contexts.
  
  maybe clear from my cokments above but no need ot argue here against Ott et al. This is a standalone paper. Rather derive directly from our motivation.
6. stefan.kiebel 05 May 2026
  
  in Public
  
  . Figure 3 B depicts p
  
  before going in these results, we should motivate this, i.e. saying what we'll learn from this analysis. Otherise the following description will just appear confusing and will draw comments by reviewers.
7. stefan.kiebel 05 May 2026
  
  in Public
  
  bias is more extreme for more extreme offers (1 and 4) than intermediate ones (2 and 3)
  
  I'm not partiiculary a friend of the terms 'extreme' and 'intermediate', so we might drop them if not required
8. stefan.kiebel 05 May 2026
  
  in Public
  
  In line with Ott et al. (2022a), we observe that a model taking only planning component into account does not perform as well compared to taking state specific biases into account. Importantly, the preference model was found to significantly outperform the hybrid model,
  
  this will be too confusing for 1st time readers. I'd structure it like this: We just define our own models (even if they are the same in Ott et al), e.g. planning-only, preference-only, full model. Think of names that make sense for us. Then analyse all of them. And only mention then the hybrid model.
  
  This would be the cleanest way of presenting the results. And we also refer to Ott et al 2022 so no one will accuse us of not mentioning prior work.
9. stefan.kiebel 05 May 2026
  
  in Public
  
  Task structure importance
  
  put this before 2.5 RTs small changes to the first paragraph: go quickly for the kill, no reference to Ott necessary here. As we will say in the intro that task structure may be used to form cheap priors, so ask there the question what are those in the present task. List them (briefly motivate each one of them why they give prior info). And then first say in a sentence or two what kind of analyses we're proposing and only then go through each analysis (as you do already now)
10. stefan.kiebel 05 May 2026
  
  in Public
  
  predicted
  
  better: formulate hypotheses in the intro, and refer back to them. ('Predictions' sound too grand. )
11. stefan.kiebel 05 May 2026
  
  in Public
  
  his should be reflected in participants reaction time (RT), such that the stronger their preferences (i.e. the lower their preferences entropy), the lower their RT. Importantly, as it was previously observed that participants RT increases when decision values are close to 0, which was interepreted as evidence that participants are able to reach a decision quicker when the value asssociated with one decision is much higher than that associated with any other decisions Ott et al. (2022a). In this paper, they further observed that participant RT depended more on decision values in intermediate offers (2 and 3) compared to extreme ones (1 and 4), which they interpreted as evidence that participants consider these offer groups as separate contexts, with the latter requiring more forward planning than the former.
  
  there won't be the need to explain in detail what Ott found before. It should be in there but only 1 or 2 sentences and say that there was already some evidence found.
12. stefan.kiebel 05 May 2026
  
  in Public
  
  Following the observation that modelling choice behaviour as a function of preferences, decisions values and the interaction between the two fitted the data best, we hypothesized that the degree to which participants engage in effortful foward planning in a given state depends on the strength of their preferences.
  
  The RT analysis would be a cornerstore, and we need to refer back briefly to our narrative of the intro to remind readers of the big picture
13. stefan.kiebel 05 May 2026
  
  in Public
  
  see Lepauvre and Kiebel (2026) for the exact models
  
  I assume this would be the appendices in the paper?
14. stefan.kiebel 05 May 2026
  
  in Public
  
  In addition to this model, we fitted
  
  add motivation: bc we wanted to compare models
15. stefan.kiebel 05 May 2026
  
  in Public
  
  In this work, we aimed to investigate whether instead of a binary contextualization of the task, participants behaviour reflects a weighted combination of a planning component with preferences reflecting the structure of the task.
  
  good. it'd be even better to have some prior motivation/hypothesis/conjecture that we can build on. And this motivation would be laid out in the introduction and only referred here to.
16. stefan.kiebel 05 May 2026
  
  in Public
  
  observed
  
  replace 'observed' by 'found' and now is a good time to cite Ott et al 2022
17. stefan.kiebel 05 May 2026
  
  in Public
  
  which in the case of a binary decision problem simplifies to the logit function (see Lepauvre and Kiebel (2026) for proof):
  
  nice.... if this result has been used before though, it's better for a smooth review to say that we followed their analysis.
18. stefan.kiebel 05 May 2026
  
  in Public
  
  , we incorporated one segment beyond the known horizon to ensure that future beyond known horizon is considered, otherwise the optimal solution would seek to reach minimal energy level by the end of the 2 segments, which would be detrimental to long term planning.
  
  the question is also why not model all 240 (?) trials, why we shortened (suboptimally) to 13 trials. Explain that it was our assumption that this is sufficient.
19. stefan.kiebel 05 May 2026
  
  in Public
  
  Optimal planning
  
  Before going here, we also need to say how many trials there were and that the task was 'continuous' so doesnt end after 20 trials or so but goes much further.
20. stefan.kiebel 05 May 2026
  
  in Public
  
  For this paper, we reused data from a previously published study in which 40 participants (22, mean age=24.4, SD=4.6)
  
  to make sure we both share the same understanding: A paper should usually be standalone so ideally you would put here all the info about the experiment and not assume that readers would read Ott et al 2022 to find out.
21. stefan.kiebel 05 May 2026
  
  in Public
  
  from Ott et al. (2022a)
  
  no need to say this here again. But if the figure was taken from Florians paper, we must get approval of the journal and say that we did, usually at the end of the figure caption. if we changed somehting, the text at the end would be 'Figure adapted from Ott et al...' (or similar)
22. stefan.kiebel 05 May 2026
  
  in Public
  
  (Ott et al. (2022a), Ott et al. (2022b) Ott, Florian, Eric Legler, and Stefan J Kiebel. 2022b. Forward Planning Driven by Context-Dependent Conflict Processing in Anterior Cingulate Cortex - Analysis Code and Datasets. V. 1.2. Zenodo, released. https://doi.org/10.5281/zenodo.6328296. ).
  
  we should talk about making Florian Ott a co-author. He provided the data, wrote the application with me, and one might even argue that the previous paper gave us the idea to think harder about what preferences subjects use. I'm neutral on this question of co-authorship but interested in an objective assessment of his contribution.
23. stefan.kiebel 05 May 2026
  
  in Public
  
  study was approved
  
  write ethics approval number
24. stefan.kiebel 05 May 2026
  
  in Public
  
  we reanalyzed the data from Ott et al. (2022a)
  
  We don't need to say this here. From the point of the reader, it's quite irrelevant where the data came from.
Visit annotations in context

Annotators

stefan.kiebel

URL

alexlepauvre.github.io/state_abstraction_paper/

Annotators

URL