Hello, really nice work! The channel-adaptive variant is a neat way to get cross-dataset generalization. One thing I was curious about is how many (and which) landmark channels actually matter for prediction accuracy?
Figure S11 shows that it works qualitatively with a single channel, and OpenCell is quantitative but changes both channel count and imaging domain at once, so I couldn't tell how much is lost by dropping channels alone. Seems like it should cost something real given that each channel carries independent info, and the model only sees cell identity/state through landmark morphology, so fewer channels means less to condition on. The Vermeer-XL CA vs. fixed gap in Tables S2–S4 hints at this too. A quick within-HPA ablation (nucleus only vs. + microtubule vs. + ER, same metrics) would isolate it and tell people how much fidelity they give up when they've only got, say, a Hoechst stain. Thanks btw for sharing the code and weights!