• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

REDBot:用于产前和妊娠产物诊断中临床拷贝数变异报告的自然语言处理方法。

REDBot: Natural language process methods for clinical copy number variation reporting in prenatal and products of conception diagnosis.

机构信息

Berry Genomics Corporation, Beijing, China.

Department of Obstetrics and Gynecology, West China Second University Hospital, Sichuan University, Chengdu.

出版信息

Mol Genet Genomic Med. 2020 Nov;8(11):e1488. doi: 10.1002/mgg3.1488. Epub 2020 Sep 22.

DOI:10.1002/mgg3.1488
PMID:32961042
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC7667294/
Abstract

BACKGROUND

Current copy number variation (CNV) identification methods have rapidly become mature. However, the postdetection processes such as variant interpretation or reporting are inefficient. To overcome this situation, we developed REDBot as an automated software package for accurate and direct generation of clinical diagnostic reports for prenatal and products of conception (POC) samples.

METHODS

We applied natural language process (NLP) methods for analyzing 30,235 in-house historical clinical reports through active learning, and then, developed clinical knowledge bases, evidence-based interpretation methods and reporting criteria to support the whole postdetection pipeline.

RESULTS

Of the 30,235 reports, we obtained 37,175 CNV-paragraph pairs. For these pairs, the active learning approaches achieved a 0.9466 average F1-score in sentence classification. The overall accuracy for variant classification was 95.7%, 95.2%, and 100.0% in retrospective, prospective, and clinical utility experiments, respectively.

CONCLUSION

By integrating NLP methods in CNVs postdetection pipeline, REDBot is a robust and rapid tool with clinical utility for prenatal and POC diagnosis.

摘要

背景

目前的拷贝数变异(CNV)识别方法已经迅速成熟。然而,变异检测后的处理,如变异解释或报告,效率不高。为了克服这种情况,我们开发了 REDBot,这是一个自动化软件包,用于准确和直接生成产前和妊娠产物(POC)样本的临床诊断报告。

方法

我们通过主动学习应用自然语言处理(NLP)方法分析了 30235 份内部历史临床报告,然后开发了临床知识库、基于证据的解释方法和报告标准,以支持整个检测后管道。

结果

在 30235 份报告中,我们获得了 37175 对 CNV-段落。对于这些对,主动学习方法在句子分类中达到了 0.9466 的平均 F1 分数。在回顾性、前瞻性和临床实用性实验中,变异分类的总体准确率分别为 95.7%、95.2%和 100.0%。

结论

通过在 CNV 检测后管道中集成 NLP 方法,REDBot 是一种具有临床实用性的强大而快速的产前和 POC 诊断工具。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/91d6/7667294/24277781d3d4/MGG3-8-e1488-g005.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/91d6/7667294/98e78e70caf4/MGG3-8-e1488-g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/91d6/7667294/1ac054c53d96/MGG3-8-e1488-g002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/91d6/7667294/45763e9cb305/MGG3-8-e1488-g003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/91d6/7667294/b17a91f77536/MGG3-8-e1488-g004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/91d6/7667294/24277781d3d4/MGG3-8-e1488-g005.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/91d6/7667294/98e78e70caf4/MGG3-8-e1488-g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/91d6/7667294/1ac054c53d96/MGG3-8-e1488-g002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/91d6/7667294/45763e9cb305/MGG3-8-e1488-g003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/91d6/7667294/b17a91f77536/MGG3-8-e1488-g004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/91d6/7667294/24277781d3d4/MGG3-8-e1488-g005.jpg

相似文献

1
REDBot: Natural language process methods for clinical copy number variation reporting in prenatal and products of conception diagnosis.REDBot:用于产前和妊娠产物诊断中临床拷贝数变异报告的自然语言处理方法。
Mol Genet Genomic Med. 2020 Nov;8(11):e1488. doi: 10.1002/mgg3.1488. Epub 2020 Sep 22.
2
The impact of human copy number variation on a new era of genetic testing.人类拷贝数变异对遗传检测新时代的影响。
BJOG. 2010 Mar;117(4):391-8. doi: 10.1111/j.1471-0528.2009.02470.x. Epub 2010 Jan 26.
3
Identification of copy number variations associated with congenital heart disease by chromosomal microarray analysis and next-generation sequencing.通过染色体微阵列分析和下一代测序鉴定与先天性心脏病相关的拷贝数变异
Prenat Diagn. 2016 Apr;36(4):321-7. doi: 10.1002/pd.4782. Epub 2016 Mar 8.
4
Pilot study of a novel multi-functional noninvasive prenatal test on fetus aneuploidy, copy number variation, and single-gene disorder screening.一项关于新型多功能无创产前检测胎儿非整倍体、拷贝数变异和单基因疾病筛查的试点研究。
Mol Genet Genomic Med. 2019 Apr;7(4):e00597. doi: 10.1002/mgg3.597. Epub 2019 Feb 14.
5
[A consensus recommendation for the interpretation and reporting of copy number variation and regions of homozygosity in prenatal genetic diagnosis].[关于产前基因诊断中拷贝数变异和纯合区域解释与报告的共识性建议]
Zhonghua Yi Xue Yi Chuan Xue Za Zhi. 2020 Jul 10;37(7):701-708. doi: 10.3760/cma.j.issn.1003-9406.2020.07.001.
6
Traditional karyotyping vs copy number variation sequencing for detection of chromosomal abnormalities associated with spontaneous miscarriage.传统核型分析与拷贝数变异测序在检测与自然流产相关的染色体异常中的应用比较
Ultrasound Obstet Gynecol. 2015 Oct;46(4):472-7. doi: 10.1002/uog.14849.
7
Natural Language Processing Accurately Calculates Adenoma and Sessile Serrated Polyp Detection Rates.自然语言处理准确计算腺瘤和无蒂锯齿状息肉的检出率。
Dig Dis Sci. 2018 Jul;63(7):1794-1800. doi: 10.1007/s10620-018-5078-4. Epub 2018 Apr 26.
8
BIOFILTER AS A FUNCTIONAL ANNOTATION PIPELINE FOR COMMON AND RARE COPY NUMBER BURDEN.生物过滤器作为常见和罕见拷贝数负担的功能注释管道。
Pac Symp Biocomput. 2016;21:357-68.
9
Natural language processing of radiology reports for identification of skeletal site-specific fractures.放射科报告的自然语言处理以识别骨骼部位特异性骨折。
BMC Med Inform Decis Mak. 2019 Apr 4;19(Suppl 3):73. doi: 10.1186/s12911-019-0780-5.
10
CNV-RF Is a Random Forest-Based Copy Number Variation Detection Method Using Next-Generation Sequencing.CNV-RF是一种基于随机森林的利用下一代测序技术进行拷贝数变异检测的方法。
J Mol Diagn. 2016 Nov;18(6):872-881. doi: 10.1016/j.jmoldx.2016.07.001. Epub 2016 Sep 3.

引用本文的文献

1
Clinical applications of large language models in medicine and surgery: A scoping review.大型语言模型在医学与外科中的临床应用:一项范围综述
J Int Med Res. 2025 Jul;53(7):3000605251347556. doi: 10.1177/03000605251347556. Epub 2025 Jul 4.
2
Whole exome sequencing is an alternative method in the diagnosis of mitochondrial DNA diseases.全外显子组测序是诊断线粒体 DNA 疾病的一种替代方法。
Mol Genet Genomic Med. 2022 Jun;10(6):e1943. doi: 10.1002/mgg3.1943. Epub 2022 Apr 7.
3
Cruxome: a powerful tool for annotating, interpreting and reporting genetic variants.

本文引用的文献

1
Technical standards for the interpretation and reporting of constitutional copy-number variants: a joint consensus recommendation of the American College of Medical Genetics and Genomics (ACMG) and the Clinical Genome Resource (ClinGen).《常染色体拷贝数变异解释和报告的技术标准:美国医学遗传学与基因组学学会(ACMG)与临床基因组资源(ClinGen)的联合共识推荐》
Genet Med. 2020 Feb;22(2):245-257. doi: 10.1038/s41436-019-0686-8. Epub 2019 Nov 6.
2
Clinical utility of noninvasive prenatal screening for expanded chromosome disease syndromes.无创性产前筛查扩展染色体疾病综合征的临床应用
Genet Med. 2019 Sep;21(9):1998-2006. doi: 10.1038/s41436-019-0467-4. Epub 2019 Mar 4.
3
Cruxome:注释、解释和报告遗传变异的有力工具。
BMC Genomics. 2021 Jun 3;22(1):407. doi: 10.1186/s12864-021-07728-6.
ClinTAD: a tool for copy number variant interpretation in the context of topologically associated domains.
ClinTAD:拓扑关联域背景下拷贝数变异解释的工具。
J Hum Genet. 2019 May;64(5):437-443. doi: 10.1038/s10038-019-0573-9. Epub 2019 Feb 14.
4
The Belgian MicroArray Prenatal (BEMAPRE) database: A systematic nationwide repository of fetal genomic aberrations.比利时微阵列产前 (BEMAPRE) 数据库:一个系统的全国性胎儿基因组异常存储库。
Prenat Diagn. 2018 Dec;38(13):1120-1128. doi: 10.1002/pd.5373. Epub 2018 Nov 14.
5
Prospective chromosome analysis of 3429 amniocentesis samples in China using copy number variation sequencing.使用拷贝数变异测序对中国 3429 例羊水穿刺样本进行前瞻性染色体分析。
Am J Obstet Gynecol. 2018 Sep;219(3):287.e1-287.e18. doi: 10.1016/j.ajog.2018.05.030. Epub 2018 May 29.
6
Copy Number Variation Disorders.拷贝数变异疾病
Curr Genet Med Rep. 2017 Dec;5(4):183-190. doi: 10.1007/s40142-017-0129-2. Epub 2017 Oct 14.
7
Prenatal diagnosis by chromosomal microarray analysis.染色体微阵列分析的产前诊断。
Fertil Steril. 2018 Feb;109(2):201-212. doi: 10.1016/j.fertnstert.2018.01.005.
8
SeqCNV: a novel method for identification of copy number variations in targeted next-generation sequencing data.SeqCNV:一种用于在靶向新一代测序数据中识别拷贝数变异的新方法。
BMC Bioinformatics. 2017 Mar 3;18(1):147. doi: 10.1186/s12859-017-1566-3.
9
Microarrays in prenatal diagnosis.产前诊断中的微阵列技术。
Best Pract Res Clin Obstet Gynaecol. 2017 Jul;42:53-63. doi: 10.1016/j.bpobgyn.2017.01.003. Epub 2017 Jan 23.
10
Next generation sequencing: Coping with rare genetic diseases in China.下一代测序:应对中国的罕见遗传病
Intractable Rare Dis Res. 2016 Aug;5(3):140-4. doi: 10.5582/irdr.2016.01020.