Reviewer #2 (Public review):
Summary and overall evaluation:
The authors assessed how visual discrimination of stimuli in the foveola changes before, during, and after small instructed eye movements (in the "micro" range). Consistent with (and advancing) related prior work, their main finding regards a pre-saccadic modulation of visual performance at the saccade target vs. the opposite location. This pre-saccadic modulation in foveal vision peaks ~70 ms prior to the instructed small saccade.
Strengths:
The study uses an impressive, technically advanced set-up and zooms in on peri-saccadic modulations in visual acuity at the micro scale. The findings build on related prior findings from the literature on smaller and larger eye movements and add temporal granularity over prior work from the same lab. The writing is easy to follow, and the figures are clear.
Weaknesses:
At the same time, the findings remain relatively empirical in nature and do not profoundly advance theoretical understanding beyond adding valuable granularity to existing knowledge. Relevant prior literature could be better introduced and acknowledged. In addition, there remain concerns regarding potential cue-driven attentional influences that may confound the reported effects (leaving the possibility that the reported effects may be related to cue-driven attention, rather than saccade planning/execution per se). There are also some issues regarding specific statistical inferences. I detail these points below.
Major Points:
(1) Novelty framing and introduction of relevant prior literature
At times, this study is introduced as if no prior study explored the time course of changes in visual perception surrounding small (micro) saccades. Yet, it appears that a prior study from the same lab, using a very similar task, already showed a time course (Figure 5 in Shelchkova & Poletti, 2020). While this study is discussed in the introduction, it is not mentioned that at least some pre-saccade time course was already reported there, albeit a more crude one than the one in the current article. Moreover, the 2013 study by Hafed also specifically looked at "peri-microsaccade modulation in visual perception" and also already showed a temporal modulation that peaked ~50 ms before microsaccade onset. I appreciate how the current study differs in a number of ways (focusing on visual acuity in the foveola), but I was nevertheless surprised to see the first reference to this relevant prior finding in the discussion (and without any elaboration). Though more recent, the same could be argued for the 2025 study by Bouhnik et al. on pre-microsaccade modulations in visual processing in V1, which, like the Hafed study, is first mentioned only in the discussion. Perhaps these studies could be introduced in the paragraph starting at line 48, or in the next paragraph, to do better justice to the existing literature on this topic when motivating the study. This would likely also help to better point out the major advances provided by the current study.
Relatedly, in Shelchkova & Poletti (PNAS, 2020), an apparently similar congruency effect on performance was reported >200 ms milliseconds before saccade onset, as evident from Fig 5 in that article. How should readers rhyme this with the current findings? Ideally, the authors would not only acknowledge that such a time course was already reported previously, but also discuss the discrepancies between these findings further: why may the performance effects appear much earlier in this prior study compared to in the current study, where the congruency effect emerges only ~100 ms prior to the instructed small saccade?
(2) Saccade- or cue-driven? (assumption that attention is unaltered in failed saccade trials)
Because the authors used a cue to instruct saccade direction, it remains a possibility that the reported modulations in visual performance may be driven directly by the spatial cue (cue-related attentional allocation), rather than the instructed small saccade per se. While the authors are clearly aware of this potential confound, questions remain regarding the convincingness of the presented control analyses. In my view, a more compelling control would require an additional experiment.
The central argument against a cue-locked (purely attentional) modulation is the absence of a performance modulation in so-called "failed" saccade trials. However, a key assumption here is that putative cue-driven attention was unaltered in these trials. This is never verified and, in my opinion, highly unlikely. Rather, trials with failed microsaccades could very well be the result of failing to process the cue in the first place (indeed, if the task is to make a saccade to the cue, failure to make a saccade equates failure to perform the task). In such trials, any putative cue-driven influences over spatial attention would also be expected to be substantially reduced. Accordingly, just because failed saccade trials show little performance modulation does not rule out cue-driven attention effects, because attention may also have "failed" in these failed saccade trials. The control for potential cue-driven attention effects would be more convincing if the authors included a condition with the same cues, where participants are simply not instructed to make any saccades to the cues. Unfortunately, such an experimental condition appears not to have been included here. The author may still consider adding such a control experiment.
Another argument against a cue-driven effect is that the authors found no interaction with time in the cue-locked data, whereas they did find such an interaction in the saccade-locked data. However, the lack of significance in the cue-locked data but significance in the saccade-locked data is not strong evidence against a cue-driven influence. Statistically, there is no direct comparison here, and more importantly, with longer delays, the cue-locked data may also start to show a dip (this could potentially be tested by the authors if they have enough trials available to extend their cue-locked analysis further in time). Indeed, exogenous attention, that may have been automatically evoked by the spatial cue, is known to be transient and to eventually even reverse after a brief initial facilitation (see e.g., Klein TiCS, 2000).
Finally, the authors consistently refer to "endogenous" attention (starting at line 221) when addressing potential cue-driven attention confounds. However, because the cue is not predictive, but is a spatial cue that differs in a bottom-up manner between left and right cues, "exogenous" attention is a more likely confound here in my view. Specifically, the spatial cue may automatically trigger attention in the direction of the target location it points to (and such exogenous effects would be expected even for unpredictive cues).
(3) Benefit and cost, or just cost?
Line 151 states that no statistically significant benefit for the saccade target was found compared to the neutral baseline. Yet, the claim throughout the article is distinct, such as in line 159: "These results show that approximately 100 milliseconds before microsaccade onset, discrimination rapidly improved at the intended target location". I do not question the robustness of the congruency effect, but the authors should be more careful when inferring "improved" perception at the target location because, as far as I could tell (as well as in the authors' own writing in line 151), this is not substantiated statistically when compared to the neutral baseline.
Related to this point, in Figure 3B, it would be informative to also see the average performance in the neutral cue condition (for example, as a straight line as in some other figures). This would help to better appreciate the relative benefits and/or costs compared to the neutral condition, also in the time-resolved data.
(4) Statistical inference for the comparison between failed and non-failed trials
Currently, the lack of modulation in the failed saccade trials hinges on a null effect. It would be stronger to support the claims with a significant difference in the congruency effect between failed and non-failed trials. Indeed, lack of significance in failed saccade trials does by itself not constitute valid evidence that the congruency effect is larger in saccade compared to failed saccade trials. For this, a significant interaction between saccade-trial-type (failed/non-failed) and congruency (congruent/incongruent) should be established (see e.g., Nieuwenhuis et al., Nat Neurosci, 2011).
(5) Time window justification
While the authors nicely depict their data across the full time axis, all statistics are currently performed on data extracted from specific time windows. How exactly were these time windows determined and justified? Likewise, how were the specific times picked for visualizing and statistically quantifying the data in e.g., Figures 3D and E? It would be reassuring to add justification for these specific time windows and/or to verify (using follow-up analyses) that the presented results are robust when different time windows are chosen.
(6) Microsaccade definition
Microsaccades are explicitly defined as being below half a degree. This appears rather arbitrary and rigid. Does the size of saccades not ultimately depend on the task and stimulus (e.g., Otero-Millan et al., PNAS, 2013) rather than being a fixed biological property? Perhaps this could be stated less rigidly, such as by stating how microsaccades are often observed below 0.5 degrees.
(Relatedly, one may wonder whether the type of instructed saccades that the authors studied here involves the same type of eye movements as the type of fixational microsaccades that have been the focus of ample prior studies. However, I recognize that this specific reflection may open a debate that is beyond the scope of this article.