Navarro-Gomez Daniel, Leipzig Jeremy, Shen Lishuang, Lott Marie, Stassen Alphons P M, Wallace Douglas C, Wiggs Janey L, Falk Marni J, van Oven Mannis, Gai Xiaowu
Department of Ophthalmology, Harvard Medical School, Massachusetts Eye and Ear Infirmary, Boston, MA, USA, Center for Biomedical Informaticsand Center for Mitochondrial and Epigenomic Medicine, The Children's Hospital of Philadelphia, Philadelphia, PA, USA, Department of Clinical Genetics, Maastricht University Medical Centre, The Netherlands, Department of Pathology and Laboratory Medicine, University of Pennsylvania Perelman School of Medicine, Philadelphia, PA, USA, Division of Human Genetics, The Children's Hospital of Philadelphia, Philadelphia, PA, USA and Department of Forensic Molecular Biology, Erasmus MC, University Medical Center Rotterdam, Rotterdam, The Netherlands.
Department of Ophthalmology, Harvard Medical School, Massachusetts Eye and Ear Infirmary, Boston, MA, USA, Center for Biomedical Informaticsand Center for Mitochondrial and Epigenomic Medicine, The Children's Hospital of Philadelphia, Philadelphia, PA, USA, Department of Clinical Genetics, Maastricht University Medical Centre, The Netherlands, Department of Pathology and Laboratory Medicine, University of Pennsylvania Perelman School of Medicine, Philadelphia, PA, USA, Division of Human Genetics, The Children's Hospital of Philadelphia, Philadelphia, PA, USA and Department of Forensic Molecular Biology, Erasmus MC, University Medical Center Rotterdam, Rotterdam, The Netherlands Department of Ophthalmology, Harvard Medical School, Massachusetts Eye and Ear Infirmary, Boston, MA, USA, Center for Biomedical Informaticsand Center for Mitochondrial and Epigenomic Medicine, The Children's Hospital of Philadelphia, Philadelphia, PA, USA, Department of Clinical Genetics, Maastricht University Medical Centre, The Netherlands, Department of Pathology and Laboratory Medicine, University of Pennsylvania Perelman School of Medicine, Philadelphia, PA, USA, Division of Human Genetics, The Children's Hospital of Philadelphia, Philadelphia, PA, USA and Department of Forensic Molecular Biology, Erasmus MC, University Medical Center Rotterdam, Rotterdam, The Netherlands.
Bioinformatics. 2015 Apr 15;31(8):1310-2. doi: 10.1093/bioinformatics/btu825. Epub 2014 Dec 12.
All current mitochondrial haplogroup classification tools require variants to be detected from an alignment with the reference sequence and to be properly named according to the canonical nomenclature standards for describing mitochondrial variants, before they can be compared with the haplogroup determining polymorphisms. With the emergence of high-throughput sequencing technologies and hence greater availability of mitochondrial genome sequences, there is a strong need for an automated haplogroup classification tool that is alignment-free and agnostic to reference sequence.
We have developed a novel mitochondrial genome haplogroup-defining algorithm using a k-mer approach namely Phy-Mer. Phy-Mer performs equally well as the leading haplogroup classifier, HaploGrep, while avoiding the errors that may occur when preparing variants to required formats and notations. We have further expanded Phy-Mer functionality such that next-generation sequencing data can be used directly as input.
Phy-Mer is publicly available under the GNU Affero General Public License v3.0 on GitHub (https://github.com/danielnavarrogomez/phy-mer).
Supplementary data are available at Bioinformatics online.
目前所有的线粒体单倍群分类工具都要求在与单倍群决定多态性进行比较之前,从与参考序列的比对中检测变体,并根据描述线粒体变体的标准命名法进行正确命名。随着高通量测序技术的出现以及线粒体基因组序列的更多可得性,迫切需要一种无需比对且不依赖参考序列的自动化单倍群分类工具。
我们开发了一种使用k-mer方法的新型线粒体基因组单倍群定义算法,即Phy-Mer。Phy-Mer的性能与领先的单倍群分类器HaploGrep相当,同时避免了在将变体准备成所需格式和符号时可能出现的错误。我们进一步扩展了Phy-Mer的功能,以便可以直接将下一代测序数据用作输入。
Phy-Mer在GNU Affero通用公共许可证v3.0下可在GitHub上公开获取(https://github.com/danielnavarrogomez/phy-mer)。
补充数据可在《生物信息学》在线获取。