Reviewer #3 (Public Review):
Summary:
In this study the authors tested for alterations in selection intensity across ~13,000 protein coding genes along the gorilla lineage in order to test the hypothesis that the evolution of a polygynous social system resulted in relaxed selective constraint through a reduction in sperm competition. Of these genes, 578 exhibited signatures of relaxed purifying selection that were enriched for functions in male germ cells including meiosis and sperm biology. These genes were also more likely expressed in male germ cells and to contain deleterious mutations. Functional analysis of genes not previously implicated in male reproduction identified 41 new genes essential to male fertility in a Drosophila model. Moreover, genes under relaxed selective constraint in the gorilla lineage were more likely to contain loss of function variants in a cohort of infertile men. The authors conclude their results support the hypothesis that the emergence of a polygynous social system may have reduced the degree of selective pressures exerted through sperm competition.
Strengths:
(1) The identification of novel genes involved in spermatogenesis using signatures of relaxed selective constraint coupled to in vivo RNAi in Drosophila is very exciting and offers a proof of principal as to the power of evolutionarily-informed functional genomics that has been largely underutilized.
Weaknesses:
(1) The analysis is restricted to protein-coding regions of genes that have single, orthologous sequences spanning 261 mammalian species, and as such is a non-random set of 13,310 genes that have higher evolutionary conservation. While this approach is necessary for the analyses being performed, it excludes non-coding regions, recently duplicated genes/gene families, and rapidly evolving genes, which are all likely subject to stronger selection as compared to evolutionarily conserved genes (and gene regions). Thus, the conclusions of relaxed selective constraint as being pervasive is likely missing a large number of the most strongly selected genes, among which have repeatedly been shown to include sex and reproduction related genes. Would the results be similar if the set of orthologous genes were restricted to the primate lineage, as it may include more rapidly evolving genes?
(2) The identification of genes showing relaxed selection along the gorilla lineage, which are overrepresented in male reproduction, supports the hypothesis that the emergency of polygyny resulted in relaxed sperm competition and is the driving force behind their observations. However, there is no control group to support that polygyny is the driving force. To more fully test this hypothesis the authors should consider contrasting their findings to observations for other species whereby polygyny did not evolve (or a gradation between). Ideally this could be integrated into RELAX-Scan comparisons, but even a semi-qualitative observation could be made for lineages more often having shared signatures of relaxed constraint across the 576 genes identified in gorilla.
(3) The comparisons of infertile human males to a large number of presumably healthy males from a separate cohort can lead to genetic differences related to population structure and/or differences in study recruitment as compared to infertility, and care must be taken to avoid confounding in any association study before drawing conclusions. Population structure is likely to occur in human cohorts and is more likely to affect patterns of rare variation, even when controls are ascertained using similar enrollment criteria, geographic regions, racial/ethnic and national identities. In this study, the MERGE cohort upon a quick search appears to be largely recruited from Germany, vs. the control cohort gnomeAD is a more cosmopolitan study including somewhat diverse ancestries. Thus, it is likely the infertile vs. control cohort has existing genetic differences unrelated to the phenotype.