• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

胚胎学家与人工智能算法之间的胚胎分级一致性。

Embryo ranking agreement between embryologists and artificial intelligence algorithms.

作者信息

Zaninovic Nikica, Sierra Jose T, Malmsten Jonas E, Rosenwaks Zev

机构信息

Weill Cornell Medicine, Ronald O. Perelman and Claudia Cohen Center for Reproductive Medicine, New York, New York.

QED Analytics, Princeton, New Jersey.

出版信息

F S Sci. 2024 Feb;5(1):50-57. doi: 10.1016/j.xfss.2023.10.002. Epub 2023 Oct 14.

DOI:10.1016/j.xfss.2023.10.002
PMID:37820865
Abstract

OBJECTIVE

To evaluate the degree of agreement of embryo ranking between embryologists and eight artificial intelligence (AI) algorithms.

DESIGN

Retrospective study.

PATIENT(S): A total of 100 cycles with at least eight embryos were selected from the Weill Cornell Medicine database. For each embryo, the full-length time-lapse (TL) videos, as well as a single embryo image at 120 hours, were given to five embryologists and eight AI algorithms for ranking.

INTERVENTION(S): None.

MAIN OUTCOME MEASURE(S): Kendall rank correlation coefficient (Kendall's τ).

RESULT(S): Embryologists had a high degree of agreement in the overall ranking of 100 cycles with an average Kendall's tau (K-τ) of 0.70, slightly lower than the interembryologist agreement when using a single image or video (average K-τ = 0.78). Overall agreement between embryologists and the AI algorithms was significantly lower (average K-τ = 0.53) and similar to the observed low inter-AI algorithm agreement (average K-τ = 0.47). Notably, two of the eight algorithms had a very low agreement with other ranking methodologies (average K-τ = 0.05) and between each other (K-τ = 0.01). The average agreement in selecting the best-quality embryo (1/8 in 100 cycles with an expected agreement by random chance of 12.5%; confidence interval [CI]95: 6%-19%) was 59.5% among embryologists and 40.3% for six AI algorithms. The incidence of the agreement for the two algorithms with the low overall agreement was 11.7%. Agreement on selecting the same top two embryos/cycle (expected agreement by random chance corresponds to 25.0%; CI95: 17%-32%) was 73.5% among embryologists and 56.0% among AI methods excluding two discordant algorithms, which had an average agreement of 24.4%, the expected range of agreement by random chance. Intraembryologist ranking agreement (single image vs. video) was 71.7% and 77.8% for single and top two embryos, respectively. Analysis of average raw scores indicated that cycles with low diversity of embryo quality generally resulted in a lower overall agreement between the methods (embryologists and AI models).

CONCLUSION(S): To our knowledge, this is the first study that evaluates the level of agreement in ranking embryo quality between different AI algorithms and embryologists. The different concordance methods were consistent and indicated that the highest agreement was intraembryologist agreement, followed by interembryologist agreement. In contrast, the agreement between some of the AI algorithms and embryologists was similar to the inter-AI algorithm agreement, which also showed a wide range of pairwise concordance. Specifically, two AI models showed intra- and interagreement at the level expected from random selection.

摘要

目的

评估胚胎学家与八种人工智能(AI)算法在胚胎排名上的一致程度。

设计

回顾性研究。

患者

从威尔康奈尔医学院数据库中选取了总共100个周期,每个周期至少有8个胚胎。对于每个胚胎,将全长延时(TL)视频以及120小时时的单个胚胎图像提供给五名胚胎学家和八种AI算法进行排名。

干预措施

无。

主要观察指标

肯德尔等级相关系数(肯德尔τ系数)。

结果

胚胎学家在100个周期的总体排名中具有高度一致性,平均肯德尔τ系数(K-τ)为0.70,略低于使用单个图像或视频时胚胎学家之间的一致性(平均K-τ = 0.78)。胚胎学家与AI算法之间的总体一致性显著较低(平均K-τ = 0.53),与观察到的AI算法之间较低的一致性相似(平均K-τ = 0.47)。值得注意的是,八种算法中的两种与其他排名方法的一致性非常低(平均K-τ = 0.05),且它们彼此之间的一致性(K-τ = 0.01)也很低。在选择最佳质量胚胎方面(100个周期中有1/8,随机选择的预期一致性为12.5%;95%置信区间[CI]:6%-19%),胚胎学家之间的平均一致性为59.5%,六种AI算法为40.3%。总体一致性较低的两种算法的一致性发生率为11.7%。在选择相同的前两个胚胎/周期方面(随机选择的预期一致性对应于25.0%;CI95:17%-32%),胚胎学家之间为73.5%,排除两种不一致算法后的AI方法为56.0%,这两种算法的平均一致性为24.4%,即随机选择的预期一致范围。胚胎学家内部的排名一致性(单个图像与视频)对于单个胚胎和前两个胚胎分别为71.7%和77.8%。对平均原始分数的分析表明,胚胎质量多样性低的周期通常导致方法之间(胚胎学家和AI模型)的总体一致性较低。

结论

据我们所知,这是第一项评估不同AI算法与胚胎学家在胚胎质量排名上的一致程度的研究。不同的一致性方法是一致的,表明最高的一致性是胚胎学家内部的一致性,其次是胚胎学家之间的一致性。相比之下,一些AI算法与胚胎学家之间的一致性与AI算法之间的一致性相似,后者也显示出广泛的成对一致性。具体而言,两个AI模型在随机选择预期的水平上显示出内部和相互之间的一致性。

相似文献

1
Embryo ranking agreement between embryologists and artificial intelligence algorithms.胚胎学家与人工智能算法之间的胚胎分级一致性。
F S Sci. 2024 Feb;5(1):50-57. doi: 10.1016/j.xfss.2023.10.002. Epub 2023 Oct 14.
2
Embryologist agreement when assessing blastocyst implantation probability: is data-driven prediction the solution to embryo assessment subjectivity?胚胎学家在评估囊胚着床概率时的意见一致:数据驱动的预测是否是解决胚胎评估主观性的方法?
Hum Reprod. 2022 Sep 30;37(10):2275-2290. doi: 10.1093/humrep/deac171.
3
A hybrid artificial intelligence model leverages multi-centric clinical data to improve fetal heart rate pregnancy prediction across time-lapse systems.一种混合人工智能模型利用多中心临床数据,改善跨时间 lapse 系统的胎儿心率妊娠预测。
Hum Reprod. 2023 Apr 3;38(4):596-608. doi: 10.1093/humrep/dead023.
4
Inter-observer and intra-observer agreement between embryologists during selection of a single Day 5 embryo for transfer: a multicenter study.胚胎学家在挑选单个第5天胚胎进行移植时的观察者间和观察者内一致性:一项多中心研究。
Hum Reprod. 2017 Feb;32(2):307-314. doi: 10.1093/humrep/dew330. Epub 2016 Dec 28.
5
Should there be an "AI" in TEAM? Embryologists selection of high implantation potential embryos improves with the aid of an artificial intelligence algorithm.是否应该在 TEAM 中加入“AI”?胚胎学家在人工智能算法的辅助下选择具有高着床潜力的胚胎的能力有所提高。
J Assist Reprod Genet. 2021 Oct;38(10):2663-2670. doi: 10.1007/s10815-021-02318-7. Epub 2021 Sep 17.
6
Development of an artificial intelligence-based assessment model for prediction of embryo viability using static images captured by optical light microscopy during IVF.开发一种基于人工智能的评估模型,用于通过体外受精期间光学显微镜拍摄的静态图像预测胚胎活力。
Hum Reprod. 2020 Apr 28;35(4):770-784. doi: 10.1093/humrep/deaa013.
7
Evaluation of the Clinical Efficacy and Trust in AI-Assisted Embryo Ranking: Survey-Based Prospective Study.评价 AI 辅助胚胎评分的临床疗效和信任度:基于调查的前瞻性研究。
J Med Internet Res. 2024 Jun 3;26:e52637. doi: 10.2196/52637.
8
Should we freeze it? Agreement on fate of borderline blastocysts is poor and does not improve with a modified blastocyst grading system.我们应该冷冻它吗?对于边缘性囊胚的命运,人们的意见分歧很大,而且使用改良的囊胚分级系统也无法改善这种情况。
Hum Reprod. 2020 May 1;35(5):1045-1053. doi: 10.1093/humrep/deaa060.
9
Embryo Ranking Intelligent Classification Algorithm (ERICA): artificial intelligence clinical assistant predicting embryo ploidy and implantation.胚胎分级智能分类算法(ERICA):人工智能临床助手预测胚胎倍性和着床。
Reprod Biomed Online. 2020 Oct;41(4):585-593. doi: 10.1016/j.rbmo.2020.07.003. Epub 2020 Jul 5.
10
Embryo selection through artificial intelligence versus embryologists: a systematic review.通过人工智能与胚胎学家进行胚胎选择:一项系统综述。
Hum Reprod Open. 2023 Aug 15;2023(3):hoad031. doi: 10.1093/hropen/hoad031. eCollection 2023.