June 25, 2015

Defining clinical vs. statistical significance

by Sean Sinden, Public Library of Science

The practice of null hypothesis testing has traditionally been used to interpret the results of studies in a wide variety of scientific fields. Briefly, significance testing involves the calculation of an outcome statistic, known as the P value. The P value represents the probability of finding a difference, by chance, between two sets of values larger than that which was observed, assuming no difference between the two sets of values. Conventionally, if that probability is less than 0.05 the outcome is deemed "statistically significant". If this sounds confusing, it's because it is!

P values are commonly misinterpreted and misused to answer research questions, but in actuality they fail to provide much information to the reader (1). This method of statistical analysis has been met with criticism throughout its history and increasingly so in the past few decades. The shortcomings of significance testing have been identified in both the academic and non-academic literature. I encourage anyone interested in the mechanistic limitations of significance testing to seek out such resources.

The P value doesn't begin to explain the importance of a study's outcome or the amount of the effect observed, though many researchers mistakenly believe it to.

Null hypothesis significance testing does not explain how much better – or worse – a group of individuals did compared with another group, just that there was a difference between the two groups (2). In short, significance testing only gives us statistical significance and says nothing about a study's practical significance or clinical applicability. There are numerous examples of a statistically significance result having no practical significance and vice-versa, two of which I have included below:

A primary HIV prevention medication known as Truvada – a combination of tenofovir and emtricitabine – was approved by the US Food and Drug Administration (FDA) in 2012. The two major side effects of this drug, taken as a daily oral dose, are a mild, non-progressive decrease in kidney function and a small decrease in bone mineral density (3). The effect on kidney function was found to be statistically significant (P = 0.02), but was considered to be "sub-clinical" by the authors (4). In a separate study, the same drug caused a small, but statistically significant decrease in bone mineral density from baseline, the clinical significance of which was unknown but was not associated with an increased occurrence of bone fracture during the study (5). Hence, Truvada does not appear to have any significant clinical side effects (see the Table below). However, in a cohort of 1603 at-risk individuals in the US, 185 people – roughly 12% of respondents – cited concern about side effects as their reason for not taking the medication (6). This is an alarming misconception for a disease that infects approximately 50,000 Americans and 2 million people globally per year (12,13), highlighting the need for better knowledge translation to improve public understanding of research outcomes.

On the other hand, research results that are not found to be statistically significant may have clinical applications. In the field of exercise physiology, a 1% change in performance might be considered clinically applicable.

To put this in perspective, a difference of 1% in the 100m sprint is the difference between Donovan Bailey's 1996 Olympic record of 9.84 seconds, and Asafa Powell's 2007 world record of 9.74 seconds.

Such a difference might not be deemed statistically significant in a typical research study, due to the difficulty of obtaining precision around such a small difference as a result of limited sample size or large variability in measurement (7). Another good example is from a study where researchers found a 2.9-minute improvement in a cycling time trial lasting 160 minutes with a supplement intervention. This difference was not statistically significant (8). However, if the researchers were to look at the results from an application standpoint, their conclusion may have been different: the 2.9 minute time improvement translated into a 1.8% increase in performance, suggesting that competitive athletes would probably benefit from the supplement intervention and that further investigation is warranted (7).

In recent years there has been a move away from using null hypothesis significance testing alone, or at all, in many scientific fields. Making inferences using magnitude-based measures such as confidence intervals, which is becoming increasingly popular, allows researchers to estimate the size of an effect in relation to clinical and practical importance (1,9,10). P-values may still have a place in our statistical assessment of study outcomes, but not should be the defining value for accepting or rejecting an outcome (11). Researchers, scientific writers, and the public need to look at the outcomes of research from a practical standpoint, and not only use the outdated and dichotomous view of statistical significance. Do not make conclusions about research outcomes solely based on significance testing without considering the practical significance and applicability of the observed effect.

Provided by Public Library of Science

This story is republished courtesy of PLOS Blogs: blogs.plos.org.

Citation: Defining clinical vs. statistical significance (2015, June 25) retrieved 17 July 2024 from https://medicalxpress.com/news/2015-06-clinical-statistical-significance.html

This document is subject to copyright. Apart from any fair dealing for the purpose of private study or research, no part may be reproduced without the written permission. The content is provided for information purposes only.

Explore further

Botulinum toxin-A beneficial in painful diabetic neuropathy

6 shares

Feedback to editors

Irritable bowel syndrome following gastroenteritis may last 4+ years in around half of those affected

9 hours ago

Study suggests reviewing current recommendations that discourage exercise before bed

9 hours ago

Children with conduct disorder show widespread brain structural differences, finds new international study

9 hours ago

World-first international guidelines weeds-out potentially critical scientific fraud

9 hours ago

Active commuting linked to lower risks of mental and physical ill health: Strongest benefits seen for cyclists

9 hours ago

Proof-of-principle study shows protein isoform inhibitors may hold the key to making opioids safer

10 hours ago

Automated appointment scheduling, reminder messages may improve postpartum health for those with chronic conditions

11 hours ago

Scientists find small regions of the brain can take micro-naps while the rest of the brain is awake and vice versa

11 hours ago

First health care device powered by body heat made possible by liquid based metals

12 hours ago

Large study confirms siblings of autistic children have 20% chance of autism

12 hours ago

Load comments (0)

Defining clinical vs. statistical significance

Irritable bowel syndrome following gastroenteritis may last 4+ years in around half of those affected

Study suggests reviewing current recommendations that discourage exercise before bed

Children with conduct disorder show widespread brain structural differences, finds new international study

World-first international guidelines weeds-out potentially critical scientific fraud

Active commuting linked to lower risks of mental and physical ill health: Strongest benefits seen for cyclists

Proof-of-principle study shows protein isoform inhibitors may hold the key to making opioids safer

Automated appointment scheduling, reminder messages may improve postpartum health for those with chronic conditions

Scientists find small regions of the brain can take micro-naps while the rest of the brain is awake and vice versa

First health care device powered by body heat made possible by liquid based metals

Large study confirms siblings of autistic children have 20% chance of autism

Botulinum toxin-A beneficial in painful diabetic neuropathy

Statins show promise to reduce major complications following lung surgery

Bendavia does not reduce scarring from angioplasty after heart attack

No real cost difference between types of tests that look for heart disease

Investigational therapy could attack cause of sickle cell crises

Bias pervades the scientific reporting of animal studies

World-first international guidelines weeds-out potentially critical scientific fraud

A treatment for metastasis? Using ferroptosis to attack migrating cancer cells

New study finds early detection of miRNAs in maternal blood may offer potential for predicting preeclampsia

Protein droplets likely don't cause Parkinson's, study suggests

All stem cell therapies are not created equally: Some don't have many stem cells at all, researchers find

New study used 3D-printed sensors to measure spinal cord malformations in embryos

Phys.org

Tech Xplore

Science X

Defining clinical vs. statistical significance

Irritable bowel syndrome following gastroenteritis may last 4+ years in around half of those affected

Study suggests reviewing current recommendations that discourage exercise before bed

Children with conduct disorder show widespread brain structural differences, finds new international study

World-first international guidelines weeds-out potentially critical scientific fraud

Active commuting linked to lower risks of mental and physical ill health: Strongest benefits seen for cyclists

Proof-of-principle study shows protein isoform inhibitors may hold the key to making opioids safer

Automated appointment scheduling, reminder messages may improve postpartum health for those with chronic conditions

Scientists find small regions of the brain can take micro-naps while the rest of the brain is awake and vice versa

First health care device powered by body heat made possible by liquid based metals

Large study confirms siblings of autistic children have 20% chance of autism

Related Stories

Botulinum toxin-A beneficial in painful diabetic neuropathy

Statins show promise to reduce major complications following lung surgery

Bendavia does not reduce scarring from angioplasty after heart attack

No real cost difference between types of tests that look for heart disease

Investigational therapy could attack cause of sickle cell crises

Bias pervades the scientific reporting of animal studies

Recommended for you

World-first international guidelines weeds-out potentially critical scientific fraud

A treatment for metastasis? Using ferroptosis to attack migrating cancer cells

New study finds early detection of miRNAs in maternal blood may offer potential for predicting preeclampsia

Protein droplets likely don't cause Parkinson's, study suggests

All stem cell therapies are not created equally: Some don't have many stem cells at all, researchers find

New study used 3D-printed sensors to measure spinal cord malformations in embryos

Newsletter sign up

Donate and enjoy an ad-free experience