• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

48 个经实验验证的 21kb 等位基因的系统发育及其在临床等位基因检测中的应用。

The phylogeny of 48 alleles, experimentally verified at 21 kb, and its application to clinical allele detection.

机构信息

Laboratory Services Section, Department of Transfusion Medicine, NIH Clinical Center, National Institutes of Health, Bethesda, MD, 20892, USA.

Bioinformatics and Computational Biosciences Branch, Office of Cyber Infrastructure and Computational Biology, National Institute of Allergy and Infectious Diseases, Bethesda, MD, USA.

出版信息

J Transl Med. 2019 Feb 11;17(1):43. doi: 10.1186/s12967-019-1791-9.

DOI:10.1186/s12967-019-1791-9
PMID:30744658
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC6371619/
Abstract

BACKGROUND

Sequence information generated from next generation sequencing is often computationally phased using haplotype-phasing algorithms. Utilizing experimentally derived allele or haplotype information improves this prediction, as routinely used in HLA typing. We recently established a large dataset of long ERMAP alleles, which code for protein variants in the Scianna blood group system. We propose the phylogeny of this set of 48 alleles and identify evolutionary steps to derive the observed alleles.

METHODS

The nucleotide sequence of > 21 kb each was used for all physically confirmed 48 ERMAP alleles that we previously published. Full-length sequences were aligned and variant sites were extracted manually. The Bayesian coalescent algorithm implemented in BEAST v1.8.3 was used to estimate a coalescent phylogeny for these variants and the allelic ancestral states at the internal nodes of the phylogeny.

RESULTS

The phylogenetic analysis allowed us to identify the evolutionary relationships among the 48 ERMAP alleles, predict 4243 potential ancestral alleles and calculate a posterior probability for each of these unobserved alleles. Some of them coincide with observed alleles that are extant in the population.

CONCLUSIONS

Our proposed strategy places known alleles in a phylogenetic framework, allowing us to describe as-yet-undiscovered alleles. In this new approach, which relies heavily on the accuracy of the alleles used for the phylogenetic analysis, an expanded set of predicted alleles can be used to infer alleles when large genotype data are analyzed, as typically generated by high-throughput sequencing. The alleles identified by studies like ours may be utilized in designing of microarray technologies, imputing of genotypes and mapping of next generation sequencing data.

摘要

背景

下一代测序产生的序列信息通常使用单倍型相位算法进行计算相位。利用实验得出的等位基因或单倍型信息可以改善这种预测,这在 HLA 分型中经常使用。我们最近建立了一个包含大量长 ERMAP 等位基因的数据集,这些等位基因编码 Scianna 血型系统中的蛋白质变体。我们提出了这组 48 个等位基因的系统发育,并确定了推导观察到的等位基因的进化步骤。

方法

我们之前发表的所有经过物理确认的 48 个 ERMAP 等位基因,每个等位基因的核苷酸序列都超过 21kb。使用全序列进行比对,并手动提取变异位点。贝叶斯合并算法(在 BEAST v1.8.3 中实现)用于估计这些变体的合并系统发育,以及系统发育内部节点的等位基因祖先状态。

结果

系统发育分析使我们能够确定 48 个 ERMAP 等位基因之间的进化关系,预测 4243 个潜在的祖先等位基因,并为每个未观察到的等位基因计算后验概率。其中一些与现存于人群中的观察到的等位基因吻合。

结论

我们提出的策略将已知的等位基因置于系统发育框架中,使我们能够描述尚未发现的等位基因。在这种新方法中,严重依赖于用于系统发育分析的等位基因的准确性,可以在分析大型基因型数据时使用扩展的预测等位基因集来推断等位基因,如高通量测序通常产生的。像我们这样的研究中确定的等位基因可以用于设计微阵列技术、基因型推断和下一代测序数据的映射。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/ef10/6371619/d537c848db5a/12967_2019_1791_Fig2_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/ef10/6371619/79bf190a996c/12967_2019_1791_Fig1_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/ef10/6371619/d537c848db5a/12967_2019_1791_Fig2_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/ef10/6371619/79bf190a996c/12967_2019_1791_Fig1_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/ef10/6371619/d537c848db5a/12967_2019_1791_Fig2_HTML.jpg

相似文献

1
The phylogeny of 48 alleles, experimentally verified at 21 kb, and its application to clinical allele detection.48 个经实验验证的 21kb 等位基因的系统发育及其在临床等位基因检测中的应用。
J Transl Med. 2019 Feb 11;17(1):43. doi: 10.1186/s12967-019-1791-9.
2
Full-length nucleotide sequence of ERMAP alleles encoding Scianna (SC) antigens.编码斯恰纳(SC)抗原的ERM基因座等位基因的全长核苷酸序列。
Transfusion. 2016 Dec;56(12):3047-3054. doi: 10.1111/trf.13801. Epub 2016 Sep 9.
3
Scianna antigens including Rd are expressed by ERMAP.包括Rd在内的Scianna抗原由ERMAP表达。
Blood. 2003 Jan 15;101(2):752-7. doi: 10.1182/blood-2002-07-2064. Epub 2002 Aug 22.
4
SCER and SCAN: two novel high-prevalence antigens in the Scianna blood group system.SCER和SCAN:斯恰纳血型系统中的两种新型高流行率抗原。
Transfusion. 2005 Dec;45(12):1940-4. doi: 10.1111/j.1537-2995.2005.00646.x.
5
SCAR: The high-prevalence antigen 013.008 in the Scianna blood group system.SCAR:Scianna 血型系统中的高流行抗原 013.008。
Transfusion. 2021 Jan;61(1):246-254. doi: 10.1111/trf.16152. Epub 2020 Oct 24.
6
Two new Scianna variants causing loss of high prevalence antigens: ERMAP model and 3D analysis of the antigens.两种新的 Scianna 变异体导致高流行抗原丢失:ERMAP 模型和抗原的 3D 分析。
Transfusion. 2023 Jan;63(1):230-238. doi: 10.1111/trf.17182. Epub 2022 Nov 8.
7
Genotype calling and phasing using next-generation sequencing reads and a haplotype scaffold.使用下一代测序reads 和单倍型支架进行基因型调用和相位分析。
Bioinformatics. 2013 Jan 1;29(1):84-91. doi: 10.1093/bioinformatics/bts632. Epub 2012 Oct 23.
8
DR2S: an integrated algorithm providing reference-grade haplotype sequences from heterozygous samples.DR2S:一种集成算法,可从杂合样本中提供参考级别的单倍型序列。
BMC Bioinformatics. 2021 May 10;22(1):236. doi: 10.1186/s12859-021-04153-0.
9
Reference Grade Characterization of Polymorphisms in Full-Length HLA Class I and II Genes With Short-Read Sequencing on the ION PGM System and Long-Reads Generated by Single Molecule, Real-Time Sequencing on the PacBio Platform.基于 ION PGM 系统的短读长测序和 PacBio 平台的单分子实时测序生成的长读长对全长 HLA I 类和 II 类基因中多态性的参考级特征描述。
Front Immunol. 2018 Oct 4;9:2294. doi: 10.3389/fimmu.2018.02294. eCollection 2018.
10
Allele phasing has minimal impact on phylogenetic reconstruction from targeted nuclear gene sequences in a case study of Artocarpus.在对花梨树进行的一项案例研究中,等位基因定相对靶向核基因序列的系统发育重建的影响极小。
Am J Bot. 2018 Mar;105(3):404-416. doi: 10.1002/ajb2.1068. Epub 2018 May 5.

引用本文的文献

1
SCAR: The high-prevalence antigen 013.008 in the Scianna blood group system.SCAR:Scianna 血型系统中的高流行抗原 013.008。
Transfusion. 2021 Jan;61(1):246-254. doi: 10.1111/trf.16152. Epub 2020 Oct 24.
2
ACKR1 Alleles at 5.6 kb in a Well-Characterized Renewable US Food and Drug Administration (FDA) Reference Panel for Standardization of Blood Group Genotyping.在一个经过充分特征描述的、可再生的美国食品和药物管理局(FDA)参考面板中,5.6kb 处的 ACKR1 等位基因用于血液组基因分型的标准化。
J Mol Diagn. 2020 Oct;22(10):1272-1279. doi: 10.1016/j.jmoldx.2020.06.014. Epub 2020 Jul 17.

本文引用的文献

1
Immunohaematological complications in patients with sickle cell disease after haemopoietic progenitor cell transplantation: a prospective, single-centre, observational study.造血祖细胞移植后镰状细胞病患者的免疫血液学并发症:一项前瞻性、单中心观察性研究
Lancet Haematol. 2017 Nov;4(11):e553-e561. doi: 10.1016/S2352-3026(17)30196-5.
2
Evaluation of GRCh38 and de novo haploid genome assemblies demonstrates the enduring quality of the reference assembly.对GRCh38和从头单倍体基因组组装的评估证明了参考组装的持久质量。
Genome Res. 2017 May;27(5):849-864. doi: 10.1101/gr.213611.116. Epub 2017 Apr 10.
3
Epithelia Use Butyrophilin-like Molecules to Shape Organ-Specific γδ T Cell Compartments.
上皮细胞利用类乳脂肪球膜蛋白分子塑造器官特异性γδ T细胞区室。
Cell. 2016 Sep 22;167(1):203-218.e17. doi: 10.1016/j.cell.2016.08.030. Epub 2016 Sep 15.
4
Full-length nucleotide sequence of ERMAP alleles encoding Scianna (SC) antigens.编码斯恰纳(SC)抗原的ERM基因座等位基因的全长核苷酸序列。
Transfusion. 2016 Dec;56(12):3047-3054. doi: 10.1111/trf.13801. Epub 2016 Sep 9.
5
Regulation of Immunity by Butyrophilins.黏蛋白家族蛋白对免疫的调节作用。
Annu Rev Immunol. 2016 May 20;34:151-72. doi: 10.1146/annurev-immunol-041015-055435. Epub 2016 Jan 11.
6
A global reference for human genetic variation.人类遗传变异的全球参考。
Nature. 2015 Oct 1;526(7571):68-74. doi: 10.1038/nature15393.
7
ClonalFrameML: efficient inference of recombination in whole bacterial genomes.ClonalFrameML:高效推断全细菌基因组中的重组。
PLoS Comput Biol. 2015 Feb 12;11(2):e1004041. doi: 10.1371/journal.pcbi.1004041. eCollection 2015 Feb.
8
The IPD and IMGT/HLA database: allele variant databases.国际参与者数据(IPD)和国际免疫遗传学信息系统/HLA数据库:等位基因变异数据库。
Nucleic Acids Res. 2015 Jan;43(Database issue):D423-31. doi: 10.1093/nar/gku1161. Epub 2014 Nov 20.
9
Discovery of common sequences absent in the human reference genome using pooled samples from next generation sequencing.利用来自下一代测序的混合样本发现人类参考基因组中缺失的常见序列。
BMC Genomics. 2014 Aug 16;15(1):685. doi: 10.1186/1471-2164-15-685.
10
Genetic variation of the whole ICAM4 gene in Caucasians and African Americans.高加索人群和非裔美国人中整个人细胞间黏附分子 4(ICAM4)基因的遗传变异。
Transfusion. 2014 Sep;54(9):2315-24. doi: 10.1111/trf.12615. Epub 2014 Mar 28.