False Positive Paradox Calculator - Bayes' Theorem
Calculate the true probability of having a condition after a positive test. Enter prevalence, sensitivity, and specificity to reveal how low base rates create false positive paradoxes.
Enter the condition's prevalence in the population and the test's sensitivity and specificity, then click to see the real probability that a positive result is genuine.
False Positive Paradox Calculator
Find the true probability of a condition given a positive test result
About the False Positive Paradox Calculator
The false positive paradox — also known as the base-rate fallacy — is a counterintuitive statistical phenomenon where the number of false positive results from a test exceeds the number of true positive results, even when the test itself is highly accurate. The paradox arises whenever the condition being tested for is rare: the small pool of genuinely affected individuals generates far fewer true positives than the much larger pool of healthy individuals generates false alarms, even at a low false-positive rate.
The mathematics behind the paradox is Bayes' theorem. Given the prevalence P(D) of the condition, the test's sensitivity (true positive rate) Se, and its specificity (true negative rate) Sp, the positive predictive value (PPV) — the probability that a person who tested positive actually has the condition — is: PPV = (Se × P(D)) / (Se × P(D) + (1 − Sp) × (1 − P(D))). When P(D) is very small, the denominator is dominated by the false-positive term (1 − Sp) × (1 − P(D)), and even a minuscule false-positive rate applied to the large healthy population overwhelms the small true-positive count.
A classic illustration: suppose a disease affects 0.1% of the population (1 in 1,000), and a test has 99% sensitivity and 99% specificity. Out of 100,000 people, roughly 100 have the disease (0.1%) and 99,900 do not. The test correctly identifies 99 of the 100 sick people (true positives), but also incorrectly flags 1% of the 99,900 healthy people — about 999 false positives. So among all 1,098 people who test positive, only 99 actually have the disease: a PPV of just 9%. A 99% accurate test produces 91% false alarms because prevalence is so low.
This principle has profound implications in medicine, security, and technology. In medical screening programmes, mass testing for rare diseases — even with highly accurate tests — can produce large numbers of anxious patients who are actually healthy, wasting resources and causing harm. Public health guidelines therefore often restrict mass screening to higher-prevalence subpopulations where the PPV is clinically acceptable. In security screening, facial recognition systems with very high accuracy can still produce thousands of false positives when scanning millions of innocent travellers for a handful of suspects. In spam filtering, high false-positive rates erode trust even when the system catches nearly all spam.
The solution, both in medicine and in policy, is to apply Bayes' theorem and factor in the prior probability (prevalence) before interpreting any test result. A positive result on a screening test does not mean you have the disease — it means your probability of having the disease has increased from the population prevalence to the PPV, which may still be very low. Follow-up with a second, more specific confirmatory test then updates the probability again, typically to a much higher value that justifies clinical action. This calculator makes that reasoning transparent and interactive, allowing clinicians, researchers, and policy-makers to explore how changes in prevalence, sensitivity, and specificity shift the PPV and the population breakdown.
False Positive Paradox — Examples
Four scenarios illustrating how prevalence dominates test accuracy in determining the true predictive value.
| Input | PPV | Context |
|---|---|---|
| Prevalence 0.1%, Sensitivity 99%, Specificity 99% | PPV ≈ 9.0% | Rare disease screening. Despite a 99% accurate test, 91 of every 100 positive results are false alarms — the classic false positive paradox. |
| Prevalence 10%, Sensitivity 95%, Specificity 90% | PPV ≈ 51.4% | A more common condition. Higher prevalence dramatically improves the PPV; about half of positive results are genuine. |
| Prevalence 1%, Sensitivity 99.9%, Specificity 98% | PPV ≈ 33.5% | Spam filter analogy. Even an excellent filter still generates many false positives when spam is only 1% of all email. |
| Prevalence 0.01%, Sensitivity 99.5%, Specificity 99% | PPV ≈ 0.99% | Airport security scanner for a very rare threat. 99 of 100 alerts are false alarms, illustrating the needle-in-a-haystack problem. |
How to use the False Positive Paradox Calculator
- Enter the prevalence of the condition — the percentage of the population that has it. For example, if 1 in 200 people are affected, enter 0.5.
- Enter the test's sensitivity (true positive rate): the percentage of genuinely affected people the test correctly identifies as positive.
- Enter the test's specificity (true negative rate): the percentage of genuinely healthy people the test correctly identifies as negative.
- Click 'Calculate Probability'. The calculator applies Bayes' theorem to output the PPV (probability of having the condition given a positive test) and the NPV (probability of not having it given a negative test), plus a breakdown of true/false positives and negatives per 100,000 people.
- Adjust the values to explore how changes in prevalence, sensitivity, and specificity shift the PPV — you will see that prevalence has the biggest impact on whether a positive test is meaningful.
False Positive Paradox — FAQ
What is the false positive paradox?
The false positive paradox occurs when the number of false positive results from a test exceeds the number of true positives, even though the test is accurate. It happens because a low disease prevalence means the large healthy population generates many false alarms — more than the small diseased population generates true alarms — even at a low false-positive rate.
What is sensitivity and specificity?
Sensitivity (the true positive rate) is the probability that a person who has the condition will test positive. Specificity (the true negative rate) is the probability that a person who does not have the condition will test negative. A 95% sensitive test catches 95 out of 100 cases; a 90% specific test correctly clears 90 out of 100 healthy people.
What is PPV and why is it different from accuracy?
PPV (positive predictive value) is the probability that a positive test result reflects a true positive — that the person actually has the condition. Accuracy measures how often the test is correct overall. PPV depends heavily on prevalence, whereas accuracy does not. A test can be 99% accurate yet have a PPV of less than 10% when the condition is rare.
How can I improve the PPV of a test?
The most effective ways to increase PPV are to increase the test's specificity (reducing the false-positive rate), to restrict testing to higher-prevalence subpopulations where the prior probability is already elevated, and to apply sequential confirmatory testing. In sequential testing, a positive first-screen result becomes the new 'prevalence' input for a second, more specific confirmatory test, which updates the probability to a much higher and more clinically actionable value.
What does NPV tell me?
NPV (negative predictive value) is the probability that a person who tests negative truly does not have the condition. For rare diseases, NPV is typically very high: when prevalence is 0.1%, nearly every negative result is a true negative. A high NPV means a negative test result is very reassuring. NPV falls as prevalence increases.
Why does prevalence matter more than test accuracy?
At very low prevalences, even a tiny false-positive rate applied to the enormous healthy population dwarfs the true positives from the small diseased population. Doubling specificity from 95% to 97.5% only halves the false positives per person, but doubling the prevalence doubles the number of true positives — so prevalence drives PPV much more strongly than accuracy does.