Notredame C
Information Genetique et Structurale, CNRS-UMR 1889, 31 Ch. Joseph Aiguier, 13 402 Marseille, France.
Bioinformatics. 2001 Apr;17(4):373-4. doi: 10.1093/bioinformatics/17.4.373.
Multiple OCCurrences Analysis (Mocca) is a new method for repeat extraction. It is based on the T-Coffee package (Notredame et al., JMB, 302, 205-217, 2000). Given a sequence or a set of sequences, and a library of local alignments, Mocca extracts every segment of sequence homologous to a pre-specified master. The implementation is meant for domain hunting and makes it fast and easy to test for new boundaries or extend known repeats in an interactive manner. Mocca is designed to deal with highly divergent protein repeats (less than 30% amino acid identity) of more than 30 amino acids.
多次出现分析(Mocca)是一种用于重复序列提取的新方法。它基于T-Coffee软件包(Notredame等人,《分子生物学杂志》,302卷,205 - 217页,2000年)。给定一个序列或一组序列以及一个局部比对库,Mocca会提取与预先指定的主序列同源的每个序列片段。该工具旨在用于结构域搜索,能够快速且方便地以交互方式测试新的边界或扩展已知的重复序列。Mocca旨在处理长度超过30个氨基酸且氨基酸一致性低于30%的高度分化的蛋白质重复序列。