In statistical computations, intuition can be very misleading

A Bad Penny Always Shows Up

To simulate what would happen with a bad (deformed, bent) penny that shows tails more often than heads, you could add more sides to the coin in line 5 of Listing 1:

my @sides  = qw( H H H T T T T );

From seven tosses, the coin would then come up heads three times and tails four times; the script would then correspondingly compute (with random deviation) the p-value from:

$ Rounds: 1000
Tails:    565
p-value:  0.0351

The p-value is approximately 0.04 percent (i.e., well below the specified 5 percent threshold for the significance value). This seriously threatens the null hypothesis that the coin should land on both sides with the same likelihood.

Careful with Your Diagnosis

Experiments that test new medications or treatment procedures for their efficiency define the null hypothesis as "The medication has no effect," set the significance value to around 5 percent, and then alert if the p-value drops below this in the experiment – that is, if there are suddenly good reasons for assuming that the null hypothesis is incorrect. In this case, the miracle cure tested really has shown some positive treatment effects with a high degree of probability.

According to Alex Reinhart's recently published book on statistical blunders [3], however, it is common practice for studies to interpret the significance value incorrectly retroactively, thus giving patients false hope or needlessly causing patients to panic. These base rate fallacies [5] occur in the context of conditional probabilities and are caused by the fact that a certain result already has a certain probability a priori that needs to be considered in the computation.

The following experiment from Reinhart's book shows what for many people is an amazing deviation between popular opinion and precise science: A mammography returns the correct diagnosis for patients with breast cancer with a 90 percent probability. However, the test comes up with a diagnosis of breast cancer for approximately 7 percent of healthy patients, so that – in the case of positive findings – further diagnostic procedures are necessary for clarification. The question is now: Is this test suitable for effectively screening the population? If the mammography detects breast cancer, how great is the probability that a randomly selected woman really needs treatment?

Most people will think about this for a while, and then subtract the 7 percent false positive rate from 100 percent in their heads and end up with a result of around 93 percent. However, this assumption is totally wrong. What is the correct result? Maybe 70 percent? Or even 50 percent? The amazing truth is that a mammography performed on a randomly selected woman correctly diagnoses breast cancer in only around 9 percent of the cases.

Amazing Statistics

If your mind experiment led you to believe that the test accuracy was higher than it actually is, you probably fell into the typical base rate fallacy trap and forgot to consider in your calculations that, on average, only 0.8 percent of women in a given population have breast cancer.

Of 1,000 women, 992 thus do not have breast cancer and in 7 percent of these cases, mammography will return the wrong diagnosis; that is, 70 of the women tested will be given incorrect results. Of the eight women with breast cancer, the test diagnoses the medical condition of seven of these women correctly, which means, in total, of the 77 breast cancer findings after mammography, only seven are correct (i.e., approximately 9 percent). Given this low accuracy rate, it is inadvisable to perform across-the-board tests; instead, only certain high-risk groups should be tested where an inefficient test is still far better than no test at all.

« Previous 1 2 3 Next »

Buy this article as PDF

Express-Checkout as PDF

Price $2.95
(incl. VAT)

Buy Linux Magazine

SINGLE ISSUES

Print Issues

Digital Issues

SUBSCRIPTIONS

Print Subs

Digisubs

TABLET & SMARTPHONE APPS

US / Canada

UK / Australia

Support Our Work

Linux Magazine content is made possible with support from readers like you. Please consider contributing when you’ve found an article to be beneficial.

News

Canonical Releases Ubuntu 24.04

Gnome , Linux , open source , Ubuntu

After a brief pause because of the XZ vulnerability, Ubuntu 24.04 is now available for install.
Linux Servers Targeted by Akira Ransomware

Enterprise Linux , Linux , ransomware , Security

A group of bad actors who have already extorted $42 million have their sights set on the Linux platform.
TUXEDO Computers Unveils Linux Laptop Featuring AMD Ryzen CPU

Games , Hardware , laptop , Linux

This latest release is the first laptop to include the new CPU from Ryzen and Linux preinstalled.
XZ Gets the All-Clear

Arch Linux , Fedora , Linux , open source , Security , Ubuntu

The back door xz vulnerability has been officially reverted for Fedora 40 and versions 38 and 39 were never affected.
Canonical Collaborates with Qualcomm on New Venture

Artificial Inte... , Linux , open source , Security , Ubuntu

This new joint effort is geared toward bringing Ubuntu and Ubuntu Core to Qualcomm-powered devices.
Kodi 21.0 Open-Source Entertainment Hub Released

audio , Multimedia , Music , open source , streaming video , Video

After a year of development, the award-winning Kodi cross-platform, media center software is now available with many new additions and improvements.
Linux Usage Increases in Two Key Areas

Games , Linux , open source , Steam

If market share is your thing, you'll be happy to know that Linux is on the rise in two areas that, if they keep climbing, could have serious meaning for Linux's future.
Vulnerability Discovered in xz Libraries

Fedora , Linux , malware , Security

An urgent alert for Fedora 40 has been posted and users should pay attention.
Canonical Bumps LTS Support to 12 years

Linux , open source , Operating Systems , Ubuntu

If you're worried that your Ubuntu LTS release won't be supported long enough to last, Canonical has a surprise for you in the form of 12 years of security coverage.
Fedora 40 Beta Released Soon

Fedora , Gnome , open source , Plasma , Wayland

With the official release of Fedora 40 coming in April, it's almost time to download the beta and see what's new.

In statistical computations, intuition can be very misleading

A Bad Penny Always Shows Up

Careful with Your Diagnosis

Amazing Statistics

Buy this article as PDF

Buy Linux Magazine

Related content

Subscribe to our Linux Newsletters
Find Linux and Open Source Jobs
Subscribe to our ADMIN Newsletters

Support Our Work

News

Canonical Releases Ubuntu 24.04

Linux Servers Targeted by Akira Ransomware

TUXEDO Computers Unveils Linux Laptop Featuring AMD Ryzen CPU

XZ Gets the All-Clear

Canonical Collaborates with Qualcomm on New Venture

Kodi 21.0 Open-Source Entertainment Hub Released

Linux Usage Increases in Two Key Areas

Vulnerability Discovered in xz Libraries

Canonical Bumps LTS Support to 12 years

Fedora 40 Beta Released Soon

In statistical computations, intuition can be very misleading

A Bad Penny Always Shows Up

Careful with Your Diagnosis

Amazing Statistics

Buy this article as PDF

Buy Linux Magazine

Related content

Subscribe to our Linux Newsletters Find Linux and Open Source Jobs Subscribe to our ADMIN Newsletters

Support Our Work

News

Tag Cloud

Subscribe to our Linux Newsletters
Find Linux and Open Source Jobs
Subscribe to our ADMIN Newsletters