Department of Gynecological Oncology, The First Hospital of Jilin University, Changchun, Jilin, People's Republic of China.
Department of Radiation Oncology, The First Hospital of Jilin University, Changchun, Jilin, People's Republic of China.
PLoS One. 2023 Aug 10;18(8):e0289971. doi: 10.1371/journal.pone.0289971. eCollection 2023.
As breast cancer is a multistage progression disease resulting from a genetic sequence of mutations, understanding the genes whose expression values increase or decrease monotonically across pathologic stages can provide insightful clues about how breast cancer initiates and advances. Utilizing variational autoencoder (VAE) networks in conjunction with traditional statistical testing, we successfully ascertain long non-coding RNAs (lncRNAs) that exhibit monotonically differential expression values in breast cancer. Subsequently, we validate that the identified lncRNAs really present monotonically changed patterns. The proposed procedure identified 248 monotonically decreasing expressed and 115 increasing expressed lncRNAs. They correspond to a total of 65 and 33 genes respectively, which possess unique known gene symbols. Some of them are associated with breast cancer, as suggested by previous studies. Furthermore, enriched pathways by the target mRNAs of these identified lncRNAs include the Wnt signaling pathway, human papillomavirus (HPV) infection, and Rap 1 signaling pathway, which have been shown to play crucial roles in the initiation and development of breast cancer. Additionally, we trained a VAE model using the entire dataset. To assess the effectiveness of the identified lncRNAs, a microarray dataset was employed as the test set. The results obtained from this evaluation were deemed satisfactory. In conclusion, further experimental validation of these lncRNAs with a large-sized study is warranted, and the proposed procedure is highly recommended.
由于乳腺癌是一种多阶段进展性疾病,是由一系列遗传突变引起的,因此,了解表达值在病理阶段单调增加或减少的基因可以提供有关乳腺癌发生和进展的深入线索。我们利用变分自动编码器(VAE)网络结合传统的统计测试,成功确定了在乳腺癌中表现出单调差异表达值的长非编码 RNA(lncRNA)。随后,我们验证了所鉴定的 lncRNA 确实呈现出单调变化的模式。该方法共鉴定到 248 个单调下调表达的 lncRNA 和 115 个单调上调表达的 lncRNA,分别对应于 65 个和 33 个具有独特已知基因符号的基因。其中一些与乳腺癌有关,这与之前的研究结果一致。此外,这些鉴定的 lncRNA 的靶 mRNAs 富集的途径包括 Wnt 信号通路、人乳头瘤病毒(HPV)感染和 Rap1 信号通路,这些途径已被证明在乳腺癌的发生和发展中起着关键作用。此外,我们使用整个数据集训练了一个 VAE 模型。为了评估所鉴定的 lncRNA 的有效性,我们使用了一个微阵列数据集作为测试集。从该评估中获得的结果被认为是令人满意的。总之,需要对这些 lncRNA 进行大规模研究的进一步实验验证,并且强烈推荐该方法。