Suppr超能文献

BiDaS:一个基于序列/特征特征的基于网络的蒙特卡罗生物数据模拟器。

BiDaS: a web-based Monte Carlo BioData Simulator based on sequence/feature characteristics.

机构信息

Biomedical Informatics Unit, Biomedical Research Foundation, Academy of Athens, 4 Soranou Ephessiou, 115 27 Athens, Greece.

出版信息

Nucleic Acids Res. 2013 Jul;41(Web Server issue):W582-6. doi: 10.1093/nar/gkt420. Epub 2013 May 28.

Abstract

BiDaS is a web-application that can generate massive Monte Carlo simulated sequence or numerical feature data sets (e.g. dinucleotide content, composition, transition, distribution properties) based on small user-provided data sets. BiDaS server enables users to analyze their data and generate large amounts of: (i) Simulated DNA/RNA and aminoacid (AA) sequences following practically identical sequence and/or extracted feature distributions with the original data. (ii) Simulated numerical features, presenting identical distributions, while preserving the exact 2D or 3D between-feature correlations observed in the original data sets. The server can project the provided sequences to multidimensional feature spaces based on: (i) 38 DNA/RNA features describing conformational and physicochemical nucleotide sequence features from the B-DNA-VIDEO database, (ii) 122 DNA/RNA features based on conformational and thermodynamic dinucleotide properties from the DiProDB database and (iii) Pseudo-aminoacid composition of the initial sequences. To the best of our knowledge, this is the first available web-server that allows users to generate vast numbers of biological data sets with realistic characteristics, while keeping between-feature associations. These data sets can be used for a wide variety of current biological problems, such as the in-depth study of gene, transcript, peptide and protein groups/families; the creation of large data sets from just a few available members and the strengthening of machine learning classifiers. All simulations use advanced Monte Carlo sampling techniques. The BiDaS web-application is available at http://bioserver-3.bioacademy.gr/Bioserver/BiDaS/.

摘要

BiDaS 是一个网络应用程序,可以根据用户提供的小数据集生成大量的蒙特卡罗模拟序列或数值特征数据集(例如二核苷酸含量、组成、转换、分布特性)。BiDaS 服务器使用户能够分析他们的数据并生成大量:(i)模拟 DNA/RNA 和氨基酸 (AA) 序列,这些序列与原始数据具有几乎相同的序列和/或提取的特征分布。(ii)模拟数值特征,呈现相同的分布,同时保留原始数据集之间观察到的精确二维或三维特征相关性。服务器可以根据以下内容将提供的序列投影到多维特征空间:(i)38 个描述 B-DNA-VIDEO 数据库中构象和物理化学核苷酸序列特征的 DNA/RNA 特征,(ii)122 个基于 DiProDB 数据库中构象和热力学二核苷酸特性的 DNA/RNA 特征,以及(iii)初始序列的伪氨基酸组成。据我们所知,这是第一个允许用户生成具有真实特征的大量生物数据集的可用网络服务器,同时保持特征之间的关联。这些数据集可用于各种当前的生物学问题,例如基因、转录物、肽和蛋白质组/家族的深入研究;从几个可用成员中创建大量数据集以及增强机器学习分类器。所有模拟都使用先进的蒙特卡罗采样技术。BiDaS 网络应用程序可在 http://bioserver-3.bioacademy.gr/Bioserver/BiDaS/ 上获得。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/1590/3692108/c3d56fe5ffc9/gkt420f1p.jpg

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验