• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

利用稀疏特征和术语-术语相互作用对果蝇基因表达模式进行注释

Drosophila Gene Expression Pattern Annotation Using Sparse Features and Term-Term Interactions.

作者信息

Ji Shuiwang, Yuan Lei, Li Ying-Xin, Zhou Zhi-Hua, Kumar Sudhir, Ye Jieping

机构信息

Center for Evolutionary Functional Genomics, The Biodesign Institute, Arizona State University, Tempe, AZ 85287.

出版信息

KDD. 2009 Jun 28;2009:407-415. doi: 10.1145/1557019.1557068.

DOI:10.1145/1557019.1557068
PMID:21614142
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC3100181/
Abstract

The Drosophila gene expression pattern images document the spatial and temporal dynamics of gene expression and they are valuable tools for explicating the gene functions, interaction, and networks during Drosophila embryogenesis. To provide text-based pattern searching, the images in the Berkeley Drosophila Genome Project (BDGP) study are annotated with ontology terms manually by human curators. We present a systematic approach for automating this task, because the number of images needing text descriptions is now rapidly increasing. We consider both improved feature representation and novel learning formulation to boost the annotation performance. For feature representation, we adapt the bag-of-words scheme commonly used in visual recognition problems so that the image group information in the BDGP study is retained. Moreover, images from multiple views can be integrated naturally in this representation. To reduce the quantization error caused by the bag-of-words representation, we propose an improved feature representation scheme based on the sparse learning technique. In the design of learning formulation, we propose a local regularization framework that can incorporate the correlations among terms explicitly. We further show that the resulting optimization problem admits an analytical solution. Experimental results show that the representation based on sparse learning outperforms the bag-of-words representation significantly. Results also show that incorporation of the term-term correlations improves the annotation performance consistently.

摘要

果蝇基因表达模式图像记录了基因表达的时空动态,是阐明果蝇胚胎发育过程中基因功能、相互作用和网络的宝贵工具。为了提供基于文本的模式搜索,伯克利果蝇基因组计划(BDGP)研究中的图像由人工策展人手动用本体术语进行注释。由于需要文本描述的图像数量现在正在迅速增加,我们提出了一种自动化此任务的系统方法。我们考虑改进特征表示和新颖的学习公式来提高注释性能。对于特征表示,我们采用视觉识别问题中常用的词袋方案,以便保留BDGP研究中的图像组信息。此外,来自多个视图的图像可以自然地整合到这种表示中。为了减少词袋表示引起的量化误差,我们提出了一种基于稀疏学习技术的改进特征表示方案。在学习公式的设计中,我们提出了一个局部正则化框架,该框架可以明确纳入术语之间的相关性。我们进一步表明,由此产生的优化问题允许解析解。实验结果表明,基于稀疏学习的表示明显优于词袋表示。结果还表明,纳入术语间相关性可持续提高注释性能。

相似文献

1
Drosophila Gene Expression Pattern Annotation Using Sparse Features and Term-Term Interactions.利用稀疏特征和术语-术语相互作用对果蝇基因表达模式进行注释
KDD. 2009 Jun 28;2009:407-415. doi: 10.1145/1557019.1557068.
2
A bag-of-words approach for Drosophila gene expression pattern annotation.一种用于果蝇基因表达模式注释的词袋法。
BMC Bioinformatics. 2009 Apr 21;10:119. doi: 10.1186/1471-2105-10-119.
3
Image-level and group-level models for Drosophila gene expression pattern annotation.基于果蝇基因表达模式注释的图像级和组级模型。
BMC Bioinformatics. 2013 Dec 3;14:350. doi: 10.1186/1471-2105-14-350.
4
Joint stage recognition and anatomical annotation of Drosophila gene expression patterns.果蝇基因表达模式的联合阶段识别和解剖注释。
Bioinformatics. 2012 Jun 15;28(12):i16-24. doi: 10.1093/bioinformatics/bts220.
5
Drosophila Gene Expression Pattern Annotation through Multi-Instance Multi-Label Learning.通过多示例多标签学习进行果蝇基因表达模式注释。
IJCAI (U S). 2009 Jan 1;2009:1445-1450.
6
FlyIT: Drosophila Embryogenesis Image Annotation based on Image Tiling and Convolutional Neural Networks.FlyIT:基于图像分块和卷积神经网络的果蝇胚胎发育图像标注
IEEE/ACM Trans Comput Biol Bioinform. 2021 Jan-Feb;18(1):194-204. doi: 10.1109/TCBB.2019.2935723. Epub 2021 Feb 3.
7
Drosophila gene expression pattern annotation through multi-instance multi-label learning.通过多实例多标签学习进行果蝇基因表达模式注释。
IEEE/ACM Trans Comput Biol Bioinform. 2012 Jan-Feb;9(1):98-112. doi: 10.1109/TCBB.2011.73. Epub 2011 Apr 15.
8
Learning sparse representations for fruit-fly gene expression pattern image annotation and retrieval.学习果蝇基因表达模式图像注释和检索的稀疏表示。
BMC Bioinformatics. 2012 May 23;13:107. doi: 10.1186/1471-2105-13-107.
9
GINI: from ISH images to gene interaction networks.GINI:从图像到基因交互网络。
PLoS Comput Biol. 2013;9(10):e1003227. doi: 10.1371/journal.pcbi.1003227. Epub 2013 Oct 10.
10
AnnoFly: annotating Drosophila embryonic images based on an attention-enhanced RNN model.AnnoFly:基于注意力增强 RNN 模型的 Drosophila 胚胎图像标注。
Bioinformatics. 2019 Aug 15;35(16):2834-2842. doi: 10.1093/bioinformatics/bty1064.

引用本文的文献

1
PPML-Omics: A privacy-preserving federated machine learning method protects patients' privacy in omic data.PPML-Omics:一种保护隐私的联邦机器学习方法,保护了组学数据中患者的隐私。
Sci Adv. 2024 Feb 2;10(5):eadh8601. doi: 10.1126/sciadv.adh8601. Epub 2024 Jan 31.
2
Deep Model Based Transfer and Multi-Task Learning for Biological Image Analysis.基于深度模型的生物图像分析迁移与多任务学习
IEEE Trans Big Data. 2020 Jun;6(2):322-333. doi: 10.1109/tbdata.2016.2573280. Epub 2016 May 30.
3
Predicting gene regulatory interactions based on spatial gene expression data and deep learning.基于空间基因表达数据和深度学习预测基因调控相互作用。
PLoS Comput Biol. 2019 Sep 17;15(9):e1007324. doi: 10.1371/journal.pcbi.1007324. eCollection 2019 Sep.
4
The Effectiveness of Multitask Learning for Phenotyping with Electronic Health Records Data.多任务学习在利用电子健康记录数据进行表型分析中的有效性。
Pac Symp Biocomput. 2019;24:18-29.
5
Image-level and group-level models for Drosophila gene expression pattern annotation.基于果蝇基因表达模式注释的图像级和组级模型。
BMC Bioinformatics. 2013 Dec 3;14:350. doi: 10.1186/1471-2105-14-350.
6
Learning Incoherent Sparse and Low-Rank Patterns from Multiple Tasks.从多个任务中学习非相干稀疏和低秩模式。
ACM Trans Knowl Discov Data. 2012 Feb 1;5(4):22. doi: 10.1145/2086737.2086742.
7
A convex formulation for learning a shared predictive structure from multiple tasks.从多个任务中学习共享预测结构的凸优化方法。
IEEE Trans Pattern Anal Mach Intell. 2013 May;35(5):1025-38. doi: 10.1109/TPAMI.2012.189.
8
Applying tensor-based morphometry to parametric surfaces can improve MRI-based disease diagnosis.基于张量的形态测量学应用于参数曲面可以提高基于 MRI 的疾病诊断。
Neuroimage. 2013 Jul 1;74:209-30. doi: 10.1016/j.neuroimage.2013.02.011. Epub 2013 Feb 20.
9
Joint stage recognition and anatomical annotation of Drosophila gene expression patterns.果蝇基因表达模式的联合阶段识别和解剖注释。
Bioinformatics. 2012 Jun 15;28(12):i16-24. doi: 10.1093/bioinformatics/bts220.
10
Learning sparse representations for fruit-fly gene expression pattern image annotation and retrieval.学习果蝇基因表达模式图像注释和检索的稀疏表示。
BMC Bioinformatics. 2012 May 23;13:107. doi: 10.1186/1471-2105-13-107.

本文引用的文献

1
Drosophila Gene Expression Pattern Annotation through Multi-Instance Multi-Label Learning.通过多示例多标签学习进行果蝇基因表达模式注释。
IJCAI (U S). 2009 Jan 1;2009:1445-1450.
2
A bag-of-words approach for Drosophila gene expression pattern annotation.一种用于果蝇基因表达模式注释的词袋法。
BMC Bioinformatics. 2009 Apr 21;10:119. doi: 10.1186/1471-2105-10-119.
3
Developmental Stage Annotation of Drosophila Gene Expression Pattern Images via an Entire Solution Path for LDA.通过潜在狄利克雷分配(LDA)的完整求解路径对果蝇基因表达模式图像进行发育阶段注释
ACM Trans Knowl Discov Data. 2008 Mar;2(1). doi: 10.1145/1342320.1342324.
4
Automated annotation of Drosophila gene expression patterns using a controlled vocabulary.使用受控词汇对果蝇基因表达模式进行自动注释。
Bioinformatics. 2008 Sep 1;24(17):1881-8. doi: 10.1093/bioinformatics/btn347. Epub 2008 Jul 16.
5
Randomized clustering forests for image classification.用于图像分类的随机聚类森林
IEEE Trans Pattern Anal Mach Intell. 2008 Sep;30(9):1632-46. doi: 10.1109/TPAMI.2007.70822.
6
A quantitative spatiotemporal atlas of gene expression in the Drosophila blastoderm.果蝇囊胚层基因表达的定量时空图谱。
Cell. 2008 Apr 18;133(2):364-74. doi: 10.1016/j.cell.2008.01.053.
7
Global analysis of mRNA localization reveals a prominent role in organizing cellular architecture and function.mRNA定位的全局分析揭示了其在构建细胞结构和功能方面的重要作用。
Cell. 2007 Oct 5;131(1):174-87. doi: 10.1016/j.cell.2007.08.003.
8
Global analysis of patterns of gene expression during Drosophila embryogenesis.果蝇胚胎发育过程中基因表达模式的全局分析。
Genome Biol. 2007;8(7):R145. doi: 10.1186/gb-2007-8-7-r145.
9
Classification of Drosophila embryonic developmental stage range based on gene expression pattern images.基于基因表达模式图像的果蝇胚胎发育阶段范围分类
Comput Syst Bioinformatics Conf. 2006:293-8.
10
Performance evaluation of local descriptors.局部描述符的性能评估
IEEE Trans Pattern Anal Mach Intell. 2005 Oct;27(10):1615-30. doi: 10.1109/TPAMI.2005.188.