• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

评估和共享生物医学数据集的全球遗传血统。

Evaluating and sharing global genetic ancestry in biomedical datasets.

机构信息

Health Department of Biomedical Informatics, University of California, San Diego, La Jolla, California, USA.

Moores Cancer Center, University of California, San Diego, La Jolla, California, USA.

出版信息

J Am Med Inform Assoc. 2019 May 1;26(5):457-461. doi: 10.1093/jamia/ocy194.

DOI:10.1093/jamia/ocy194
PMID:30869786
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC6433181/
Abstract

Genetic ancestry is a critical co-factor to study phenotype-genotype associations using cohorts of human subjects. Most publicly available molecular datasets are, however, missing this information or only share self-reported race and ethnicity, representing a limitation to identify and repurpose datasets to investigate the contribution of ancestry to diseases and traits. We propose an analytical framework to enrich the metadata from publicly available cohorts with genetic ancestry information and a resulting diversity score at continental resolution, calculated directly from the data. We illustrate this framework using The Cancer Genome Atlas datasets searched through the DataMed Data Discovery Index. Data repositories and contributors can use this framework to provide genetic diversity measurements for controlled access datasets, minimizing the work involved in requesting a dataset that may ultimately prove inadequate for a researcher's purpose. With the increasing global scale of human genetics research, studies on disease risk and susceptibility would benefit greatly from the adequate estimation and sharing of genetic diversity in publicly available datasets following a framework such as the one presented.

摘要

遗传背景是研究人类研究对象表型-基因型关联的关键协同因素。然而,大多数公开可用的分子数据集都缺少这方面的信息,或者只共享自我报告的种族和民族信息,这限制了对数据集的识别和重新利用,以研究遗传背景对疾病和特征的贡献。我们提出了一个分析框架,用遗传背景信息和大陆分辨率的多样性评分来丰富公开队列的元数据,这些评分是直接从数据中计算出来的。我们使用通过 DataMed 数据发现索引搜索的癌症基因组图谱数据集来说明这个框架。数据存储库和贡献者可以使用这个框架为受控访问数据集提供遗传多样性测量,从而最大限度地减少请求数据集的工作,这些数据集最终可能不适合研究人员的目的。随着人类遗传学研究的全球范围不断扩大,疾病风险和易感性的研究将从在公开数据集的充分估计和共享遗传多样性中获益,而这正是所提出的框架的作用。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8a9e/6433181/5ee4304d5780/ocy194f1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8a9e/6433181/5ee4304d5780/ocy194f1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8a9e/6433181/5ee4304d5780/ocy194f1.jpg

相似文献

1
Evaluating and sharing global genetic ancestry in biomedical datasets.评估和共享生物医学数据集的全球遗传血统。
J Am Med Inform Assoc. 2019 May 1;26(5):457-461. doi: 10.1093/jamia/ocy194.
2
Harmonizing Genetic Ancestry and Self-identified Race/Ethnicity in Genome-wide Association Studies.全基因组关联研究中遗传祖源与自我认定的种族/族群的协调。
Am J Hum Genet. 2019 Oct 3;105(4):763-772. doi: 10.1016/j.ajhg.2019.08.012. Epub 2019 Sep 26.
3
Characterizing Race/Ethnicity and Genetic Ancestry for 100,000 Subjects in the Genetic Epidemiology Research on Adult Health and Aging (GERA) Cohort.在成人健康与衰老基因流行病学研究(GERA)队列中对10万名受试者的种族/民族和基因血统进行特征分析。
Genetics. 2015 Aug;200(4):1285-95. doi: 10.1534/genetics.115.178616. Epub 2015 Jun 19.
4
The Future of Genomic Studies Must Be Globally Representative: Perspectives from PAGE.基因组研究的未来必须具有全球代表性:PAGE 的观点。
Annu Rev Genomics Hum Genet. 2019 Aug 31;20:181-200. doi: 10.1146/annurev-genom-091416-035517. Epub 2019 Apr 12.
5
Genetics, epidemiology, and cancer disparities: is it black and white?遗传学、流行病学与癌症差异:非黑即白吗?
J Clin Oncol. 2006 May 10;24(14):2164-9. doi: 10.1200/JCO.2005.05.1656.
6
Translating Population Difference: The Use and Re-Use of Genetic Ancestry in Brazilian Cancer Genetics.解读人群差异:巴西癌症遗传学中遗传血统的应用与再利用
Med Anthropol. 2016;35(1):58-72. doi: 10.1080/01459740.2015.1091818. Epub 2015 Oct 9.
7
Examining How Race, Ethnicity, and Ancestry Data Are Used in Biomedical Research.审视种族、民族和血统数据在生物医学研究中的应用方式。
JAMA. 2018 Oct 16;320(15):1533-1534. doi: 10.1001/jama.2018.13609.
8
Evaluating genetic ancestry and self-reported ethnicity in the context of carrier screening.在携带者筛查背景下评估遗传血统和自我报告的种族。
BMC Genet. 2017 Nov 28;18(1):99. doi: 10.1186/s12863-017-0570-y.
9
Biogeographical ancestry and race.生物地理血统与种族。
Stud Hist Philos Biol Biomed Sci. 2014 Sep;47 Pt A:173-84. doi: 10.1016/j.shpsc.2014.05.017. Epub 2014 Jun 29.
10
The importance of race and ethnic background in biomedical research and clinical practice.种族和族裔背景在生物医学研究及临床实践中的重要性。
N Engl J Med. 2003 Mar 20;348(12):1170-5. doi: 10.1056/NEJMsb025007.

引用本文的文献

1
Genotype prediction of 336,463 samples from public expression data.基于公开表达数据对336,463个样本进行基因型预测。
bioRxiv. 2024 Mar 13:2023.10.21.562237. doi: 10.1101/2023.10.21.562237.
2
Uncovering the genomic basis of an extraordinary plant invasion.揭示一种非凡植物入侵的基因组基础。
Sci Adv. 2022 Aug 26;8(34):eabo5115. doi: 10.1126/sciadv.abo5115. Epub 2022 Aug 24.
3
Validation of an NGS Panel Designed for Detection of Actionable Mutations in Tumors Common in Latin America.用于检测拉丁美洲常见肿瘤中可操作突变的二代测序(NGS)检测板的验证

本文引用的文献

1
Finding useful data across multiple biomedical data repositories using DataMed.利用 DataMed 在多个生物医学数据存储库中查找有用数据。
Nat Genet. 2017 May 26;49(6):816-819. doi: 10.1038/ng.3864.
2
iPSCORE: A Resource of 222 iPSC Lines Enabling Functional Characterization of Genetic Variation across a Variety of Cell Types.iPSCORE:一个包含 222 个人诱导多能干细胞系的资源库,能够对多种细胞类型中的遗传变异进行功能特征分析。
Stem Cell Reports. 2017 Apr 11;8(4):1086-1100. doi: 10.1016/j.stemcr.2017.03.012.
3
Genomics, Health Disparities, and Missed Opportunities for the Nation's Research Agenda.
J Pers Med. 2021 Sep 8;11(9):899. doi: 10.3390/jpm11090899.
4
Prevalence of cancer susceptibility variants in patients with multiple Lynch syndrome related cancers.林奇综合征相关多种癌症患者的癌症易感性变异体的流行率。
Sci Rep. 2021 Jul 20;11(1):14807. doi: 10.1038/s41598-021-94292-4.
5
Global gene flow releases invasive plants from environmental constraints on genetic diversity.全球基因流使入侵植物摆脱了环境对遗传多样性的限制。
Proc Natl Acad Sci U S A. 2020 Feb 25;117(8):4218-4227. doi: 10.1073/pnas.1915848117. Epub 2020 Feb 7.
6
Not the medical informatics of our founding mothers and fathers, or is it?这不是我们先驱者的医学信息学,还是说它是呢?
J Am Med Inform Assoc. 2019 May 1;26(5):381-382. doi: 10.1093/jamia/ocz027.
基因组学、健康差异与国家研究议程中错失的机遇。
JAMA. 2017 May 9;317(18):1831-1832. doi: 10.1001/jama.2017.3096.
4
Genetically determined ancestry is more informative than self-reported race in HIV-infected and -exposed children.在感染艾滋病毒和接触过艾滋病毒的儿童中,基因决定的血统比自我报告的种族信息更丰富。
Medicine (Baltimore). 2016 Sep;95(36):e4733. doi: 10.1097/MD.0000000000004733.
5
A global reference for human genetic variation.人类遗传变异的全球参考。
Nature. 2015 Oct 1;526(7571):68-74. doi: 10.1038/nature15393.
6
Ancestry, admixture and fitness in Colombian genomes.哥伦比亚基因组中的祖先、混合与适应性
Sci Rep. 2015 Jul 21;5:12376. doi: 10.1038/srep12376.
7
Fast individual ancestry inference from DNA sequence data leveraging allele frequencies for multiple populations.利用多个群体的等位基因频率从DNA序列数据中快速推断个体祖先。
BMC Bioinformatics. 2015 Jan 16;16:4. doi: 10.1186/s12859-014-0418-7.
8
Self-reported race/ethnicity in the age of genomic research: its potential impact on understanding health disparities.基因组研究时代的自我报告种族/族裔:其对理解健康差异的潜在影响。
Hum Genomics. 2015 Jan 7;9(1):1. doi: 10.1186/s40246-014-0023-x.
9
Genetic ancestry of participants in the National Children's Study.国家儿童研究参与者的遗传血统。
Genome Biol. 2014 Feb 3;15(2):R22. doi: 10.1186/gb-2014-15-2-r22.
10
The Cancer Genome Atlas Pan-Cancer analysis project.癌症基因组图谱泛癌分析项目。
Nat Genet. 2013 Oct;45(10):1113-20. doi: 10.1038/ng.2764.