Lin Yating, Li Haojun, Xiao Xu, Zhang Lei, Wang Kejia, Zhao Jingbo, Wang Minshu, Zheng Frank, Zhang Minwei, Yang Wenxian, Han Jiahuai, Yu Rongshan
School of Informatics, Xiamen University, Xiamen 361005, China.
National Institute for Data Science in Health and Medicine, Xiamen University, Xiamen 361005, China.
Patterns (N Y). 2022 Feb 3;3(3):100440. doi: 10.1016/j.patter.2022.100440. eCollection 2022 Mar 11.
Understanding the immune cell abundance of cancer and other disease-related tissues has an important role in guiding disease treatments. Computational cell type proportion estimation methods have been previously developed to derive such information from bulk RNA sequencing data. Unfortunately, our results show that the performance of these methods can be seriously plagued by the mismatch between training data and real-world data. To tackle this issue, we propose the DAISM-DNN (XMBD: Xiamen Big Data, a biomedical open software initiative in the National Institute for Data Science in Health and Medicine, Xiamen University, China.) (denoted as DAISM-DNN) pipeline that trains a deep neural network (DNN) with dataset-specific training data populated from a certain amount of calibrated samples using DAISM, a novel data augmentation method with an mixing strategy. The evaluation results demonstrate that the DAISM-DNN pipeline outperforms other existing methods consistently and substantially for all the cell types under evaluation in real-world datasets.
STAR Protoc. 2022-9-16
Clin Chim Acta. 2018-11-15
Brief Bioinform. 2025-7-2
Comput Struct Biotechnol J. 2025-6-11
Brief Bioinform. 2025-5-1
PLoS Comput Biol. 2025-1-17
Proc Natl Acad Sci U S A. 2024-11-12
Bioinformatics. 2024-6-28
bioRxiv. 2024-4-4
BMC Bioinformatics. 2020-3-17
Nat Biotechnol. 2019-5-6
Nat Rev Genet. 2019-7
Nat Rev Immunol. 2019-6