• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

一个可扩展的人工智能平台,可自动在期刊文章中发现拷贝数变异(CNV),并将其转换为数据库:CNV 提取、转换和加载人工智能(CNV-ETLAI)。

A scalable artificial intelligence platform that automatically finds copy number variations (CNVs) in journal articles and transforms them into a database: CNV extraction, transformation, and loading AI (CNV-ETLAI).

机构信息

Department of Radiology, Laboratory of Medical Imaging and Computation, Massachusetts General Brigham and Harvard Medical School, Boston, MA, USA; Department of Laboratory Medicine, Hanyang University College of Medicine, Seoul, South Korea; GC Genome, GC Laboratories, Yong-in, South Korea.

Department of Radiology, Laboratory of Medical Imaging and Computation, Massachusetts General Brigham and Harvard Medical School, Boston, MA, USA.

出版信息

Comput Biol Med. 2022 May;144:105332. doi: 10.1016/j.compbiomed.2022.105332. Epub 2022 Feb 24.

DOI:10.1016/j.compbiomed.2022.105332
PMID:35240378
Abstract

BACKGROUND

Although copy number variations (CNVs) are infrequent, each anomaly is unique, and multiple CNVs can appear simultaneously. Growing evidence suggests that CNVs contribute to a wide range of diseases. When CNVs are detected, assessment of their clinical significance requires a thorough literature review. This process can be extremely time-consuming and may delay disease diagnosis. Therefore, we have developed CNV Extraction, Transformation, and Loading Artificial Intelligence (CNV-ETLAI), an innovative tool that allows experts to classify and interpret CNVs accurately and efficiently.

METHODS

We combined text, table, and image processing algorithms to develop an artificial intelligence platform that automatically extracts, transforms, and organizes CNV information into a database. To validate CNV-ETLAI, we compared its performance to ground truth datasets labeled by a human expert. In addition, we analyzed the CNV data, which was collected using CNV-ETLAI via a crowdsourcing approach.

RESULTS

In comparison to a human expert, CNV-ETLAI improved CNV detection accuracy by 4% and performed the analysis 60 times faster. This performance can improve even further with upscaling of the CNV-ETLAI database as usage increases. 5,800 CNVs from 2,313 journal articles were collected. Total CNV frequency for the whole chromosome was highest for chromosome X, whereas CNV frequency per 1 Mb of genomic length was highest for chromosome 22.

CONCLUSIONS

We have developed, tested, and shared CNV-ETLAI for research and clinical purposes (https://lmic.mgh.harvard.edu/CNV-ETLAI). Use of CNV-ETLAI is expected to ease and accelerate diagnostic classification and interpretation of CNVs.

摘要

背景

尽管拷贝数变异(CNVs)很少见,但每个异常都是独特的,并且可以同时出现多个 CNVs。越来越多的证据表明,CNVs 导致了广泛的疾病。当检测到 CNVs 时,需要对其临床意义进行全面的文献回顾。这个过程非常耗时,可能会延迟疾病诊断。因此,我们开发了 CNV 提取、转换和加载人工智能(CNV-ETLAI),这是一种创新的工具,可以让专家准确高效地对 CNVs 进行分类和解释。

方法

我们结合文本、表格和图像处理算法,开发了一个人工智能平台,该平台可以自动提取、转换和组织 CNV 信息到数据库中。为了验证 CNV-ETLAI,我们将其性能与由人类专家标记的地面真实数据集进行了比较。此外,我们还分析了通过众包方式使用 CNV-ETLAI 收集的 CNV 数据。

结果

与人类专家相比,CNV-ETLAI 提高了 4%的 CNV 检测准确性,分析速度提高了 60 倍。随着 CNV-ETLAI 数据库的扩展和使用量的增加,性能还可以进一步提高。从 2313 篇期刊文章中收集了 5800 个 CNVs。整个染色体的总 CNV 频率最高的是 X 染色体,而每 1Mb 基因组长度的 CNV 频率最高的是 22 号染色体。

结论

我们已经开发、测试并共享了用于研究和临床目的的 CNV-ETLAI(https://lmic.mgh.harvard.edu/CNV-ETLAI)。预计使用 CNV-ETLAI 将简化和加速 CNV 的诊断分类和解释。

相似文献

1
A scalable artificial intelligence platform that automatically finds copy number variations (CNVs) in journal articles and transforms them into a database: CNV extraction, transformation, and loading AI (CNV-ETLAI).一个可扩展的人工智能平台,可自动在期刊文章中发现拷贝数变异(CNV),并将其转换为数据库:CNV 提取、转换和加载人工智能(CNV-ETLAI)。
Comput Biol Med. 2022 May;144:105332. doi: 10.1016/j.compbiomed.2022.105332. Epub 2022 Feb 24.
2
Noise cancellation using total variation for copy number variation detection.利用全变差降噪进行拷贝数变异检测。
BMC Bioinformatics. 2018 Oct 22;19(Suppl 11):361. doi: 10.1186/s12859-018-2332-x.
3
Exome copy number variant detection, analysis, and classification in a large cohort of families with undiagnosed rare genetic disease.在一大群未确诊罕见遗传病的家庭中进行外显子组拷贝数变异检测、分析和分类。
Am J Hum Genet. 2024 May 2;111(5):863-876. doi: 10.1016/j.ajhg.2024.03.008. Epub 2024 Apr 1.
4
Copy number variation in Thai population.泰国人群中的拷贝数变异
PLoS One. 2014 Aug 13;9(8):e104355. doi: 10.1371/journal.pone.0104355. eCollection 2014.
5
CrowdVariant: a crowdsourcing approach to classify copy number variants.群体变异:一种用于分类拷贝数变异的众包方法。
Pac Symp Biocomput. 2019;24:224-235.
6
AutoCNV: a semiautomatic CNV interpretation system based on the 2019 ACMG/ClinGen Technical Standards for CNVs.AutoCNV:一种基于 2019 年 ACMG/ClinGen 拷贝数变异技术标准的半自动拷贝数变异解释系统。
BMC Genomics. 2021 Oct 6;22(1):721. doi: 10.1186/s12864-021-08011-4.
7
Constructing a database for the relations between CNV and human genetic diseases via systematic text mining.通过系统文本挖掘构建 CNV 与人类遗传疾病关系数据库。
BMC Bioinformatics. 2018 Dec 31;19(Suppl 19):528. doi: 10.1186/s12859-018-2526-2.
8
Comprehensive performance comparison of high-resolution array platforms for genome-wide Copy Number Variation (CNV) analysis in humans.用于人类全基因组拷贝数变异(CNV)分析的高分辨率阵列平台的综合性能比较
BMC Genomics. 2017 Apr 24;18(1):321. doi: 10.1186/s12864-017-3658-x.
9
Evaluation of copy number variation detection for a SNP array platform.SNP 芯片平台拷贝数变异检测评估。
BMC Bioinformatics. 2014 Feb 21;15:50. doi: 10.1186/1471-2105-15-50.
10
miRNA and miRNA target genes in copy number variations occurring in individuals with intellectual disability.智力障碍个体中发生的拷贝数变异中的 miRNA 和 miRNA 靶基因。
BMC Genomics. 2013 Aug 10;14:544. doi: 10.1186/1471-2164-14-544.

引用本文的文献

1
Internet-Based Abnormal Chromosomal Diagnosis During Pregnancy Using a Noninvasive Innovative Approach to Detecting Chromosomal Abnormalities in the Fetus: Scoping Review.基于互联网的孕期染色体异常诊断:使用无创创新方法检测胎儿染色体异常的范围综述
JMIR Bioinform Biotechnol. 2024 Oct 16;5:e58439. doi: 10.2196/58439.