Department of Biological Sciences, George Washington University, Washington, DC, United States.
Front Immunol. 2022 Oct 18;13:1000177. doi: 10.3389/fimmu.2022.1000177. eCollection 2022.
The () gene family encodes a set of proteins that function in the sea urchin immune system. The gene sequences have a series of internal repeats in a mosaic pattern that is characteristic of this family. This mosaic pattern necessitates the insertion of large gaps, which has made alignments of the deduced protein sequences computationally difficult such that only manual alignments have been reported previously. Because manual alignments are time consuming for evaluating newly available SpTrf sequences, computational approaches were evaluated for the sequences reported previously. Furthermore, because two different manual alignments of the SpTrf sequences are feasible because of the multiple internal repeats, it is not known whether additional alternative alignments can be identified using different approaches. The bioinformatic program, PRANK, was used because it was designed to align sequences with large gaps and indels. The results from PRANK show that the alignments of the internal repeats are similar to those done manually, suggesting multiple feasible alignments for some regions. GUIDANCE based analysis of the alignments identified regions that were excellent and other regions that failed to align. This suggests that computational approaches have limits for aligning the SpTrf sequences that include multiple repeats and that require inserted gaps. Furthermore, it is unlikely that alternative alignments for the full-length SpTrf sequences will be identified.
SpTrf 序列的生物信息学分析显示,这些序列的比对存在局限性,这表明对于包含多个重复序列且需要插入间隙的 SpTrf 序列,计算方法的应用存在一定的局限性。此外,由于全长 SpTrf 序列可能存在多种可行的比对方式,因此不太可能确定替代的全长 SpTrf 序列比对方式。