Carnegie Mellon-University of Pittsburgh PhD Program in Computational Biology, Pittsburgh, United States.
Department of Computational Biology, University of Pittsburgh, Pittsburgh, United States.
Elife. 2022 Nov 7;11:e76911. doi: 10.7554/eLife.76911.
Body hair is a defining mammalian characteristic, but several mammals, such as whales, naked mole-rats, and humans, have notably less hair. To find the genetic basis of reduced hair quantity, we used our evolutionary-rates-based method, RERconverge, to identify coding and noncoding sequences that evolve at significantly different rates in so-called hairless mammals compared to hairy mammals. Using RERconverge, we performed a genome-wide scan over 62 mammal species using 19,149 genes and 343,598 conserved noncoding regions. In addition to detecting known and potential novel hair-related genes, we also discovered hundreds of putative hair-related regulatory elements. Computational investigation revealed that genes and their associated noncoding regions show different evolutionary patterns and influence different aspects of hair growth and development. Many genes under accelerated evolution are associated with the structure of the hair shaft itself, while evolutionary rate shifts in noncoding regions also included the dermal papilla and matrix regions of the hair follicle that contribute to hair growth and cycling. Genes that were top ranked for coding sequence acceleration included known hair and skin genes , , , and that surprisingly showed no signals of evolutionary rate shifts in nearby noncoding regions. Conversely, accelerated noncoding regions are most strongly enriched near regulatory hair-related genes and microRNAs, such as , , and , that themselves do not show rate shifts in their protein-coding sequences. Such dichotomy highlights the interplay between the evolution of protein sequence and regulatory sequence to contribute to the emergence of a convergent phenotype.
身体毛发是哺乳动物的显著特征之一,但有一些哺乳动物,如鲸鱼、裸鼹鼠和人类,毛发明显较少。为了找到毛发数量减少的遗传基础,我们使用基于进化速率的方法 RERconverge,来鉴定编码和非编码序列,这些序列在所谓的无毛哺乳动物和有毛哺乳动物之间的进化速率明显不同。使用 RERconverge,我们对 62 种哺乳动物物种进行了全基因组扫描,涉及 19149 个基因和 343598 个保守的非编码区。除了检测到已知和潜在的新的毛发相关基因外,我们还发现了数百个可能的毛发相关调控元件。计算分析表明,基因及其相关的非编码区表现出不同的进化模式,影响毛发生长和发育的不同方面。许多进化加速的基因与毛干本身的结构有关,而非编码区的进化速率变化也包括毛囊的真皮乳头和基质区域,这些区域有助于毛发生长和循环。编码序列加速排名最高的基因包括已知的毛发和皮肤基因 、 、 ,它们在附近的非编码区没有显示出进化速率变化的信号。相反,加速的非编码区最富集在与毛发相关的调控基因和 microRNAs 附近,如 、 、 ,它们自身的蛋白质编码序列没有显示出速率变化。这种二分法突出了蛋白质序列和调控序列进化之间的相互作用,有助于趋同表型的出现。