Suppr超能文献

理解生物统计学解释

Understanding Biostatistics Interpretation

作者信息

Cash Elizabeth, Boktor Sameh W.

机构信息

Univ of Louisville School of Medicine

Penn State College of Medicine

Abstract

A basic understanding of statistical concepts is necessary to evaluate existing literature effectively. Statistical results do not, however, allow one to determine the clinical applicability of published findings. Statistical results can be used to make inferences about the probability of an event among a given population. Careful interpretation by the clinician is required to determine the value of the data as it applies to an individual or group of patients. Good research studies provide a clear, testable hypothesis, or prediction, about what they expect to find in the relationships being tested. The hypothesis is grounded in the empirical literature and based on clinical observations or expertise. It should be innovative in testing a novel relationship or confirming a prior study. There are at minimum 2 hypotheses in any study: The null hypothesis assumes there is no difference or no effect, and (2) the experimental or alternative hypothesis predicts an event or outcome. Often, the null hypothesis is not stated or is assumed. Hypotheses are tested by examining relationships between independent variables, or those thought to have some effect, and dependent variables, or those thought to be moved or affected by the independent variable. These are also called predictor and outcome variables, respectively. Statistics are used to test a study’s alternative or experimental hypothesis. Statistical models are fitted based on the dataset's nature, type, and other characteristics. Data typically involves measurement levels, which determine the type of statistical models that can be applied to test a hypothesis. Nominal data are those variables containing 2 or more categories without underlying order or value. Examples of nominal data include indicators of group membership, such as male or female. Ordinal data is nominal data that includes an order or rank but has undefined spacing between groups or levels, such as faculty ranking or educational level. Interval data is ordinal data with clearly defined spacing between the intervals and no absolute zero points. An example of interval data is the temperature scale, as the magnitude of the difference between intervals is consistent and measurable (one degree). Ratio data are interval data that include an absolute zero, such as the amount of student loan debt. Nominal and ordinal data are categorical, where entities are divided into distinct groups, whereas interval and ratio data are considered continuous, giving each observation a distinct score. It is up to the researcher to appropriately apply statistical models when testing hypotheses. Several approaches can be used to analyze the same dataset, and how this is accomplished depends heavily on the nature of the wording in a researcher’s hypothesis. Various statistical software packages exist, some available for free while others charge annual license fees that can be used to analyze data. Nearly all packages require the user to understand the types of data and the appropriate application of statistical models for each type. More sophisticated packages require the user to use the program’s proprietary coding language to perform hypothesis tests. These can require much time to learn, and errors can easily slip past the untrained eye. It is strongly recommended that unfamiliar users consult a statistical analyst when designing and running statistical models. Biostatistician consultations can occur at any time during a study, but earlier consultations are wise to prevent the introduction of accidental bias into study data and to help ensure accuracy and collection methods that are adequate to allow for tests of hypotheses.

摘要

要有效地评估现有文献,必须对统计概念有基本的了解。然而,统计结果并不能让人确定已发表研究结果的临床适用性。统计结果可用于推断给定人群中某一事件发生的概率。临床医生需要仔细解读,以确定这些数据对个体或一组患者的价值。优秀的研究提供了一个关于在被测试关系中预期发现的清晰、可检验的假设或预测。该假设基于实证文献,并基于临床观察或专业知识。它应该在测试新关系或证实先前研究方面具有创新性。任何研究至少有两个假设:零假设假定没有差异或没有影响;(2)实验性或备择假设预测一个事件或结果。通常,零假设没有被陈述或被假定。通过检查自变量(即那些被认为有某种影响的变量)和因变量(即那些被认为受自变量影响或变动的变量)之间的关系来检验假设。它们也分别被称为预测变量和结果变量。统计数据用于检验研究的备择假设或实验性假设。统计模型是根据数据集的性质、类型和其他特征来拟合的。数据通常涉及测量水平,这决定了可用于检验假设的统计模型的类型。名义数据是那些包含两个或更多类别且没有潜在顺序或值的变量。名义数据的例子包括组成员身份指标,如男性或女性。顺序数据是包含顺序或等级但组或级别之间间距未定义的名义数据,如教师排名或教育水平。区间数据是顺序数据,区间之间有明确界定的间距且没有绝对零点。区间数据的一个例子是温度刻度,因为区间之间的差异大小是一致且可测量的(一度)。比率数据是包含绝对零点的区间数据,如学生贷款债务金额。名义和顺序数据是分类数据,其中实体被分为不同的组,而区间和比率数据被认为是连续的,给每个观测值一个独特的分数。在检验假设时,由研究人员来适当地应用统计模型。可以使用几种方法来分析同一个数据集,而这如何完成在很大程度上取决于研究人员假设中的措辞性质。有各种统计软件包,有些是免费的,而有些则收取年度许可费,可用于分析数据。几乎所有的软件包都要求用户了解数据类型以及每种类型统计模型的适当应用。更复杂的软件包要求用户使用该程序的专有编码语言来进行假设检验。这些可能需要花费大量时间来学习,而且未经训练的人很容易忽略错误。强烈建议不熟悉的用户在设计和运行统计模型时咨询统计分析师。生物统计学家的咨询可以在研究的任何阶段进行,但尽早咨询是明智的,以防止将意外偏差引入研究数据,并有助于确保准确性以及足以进行假设检验的收集方法。

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验