Question 1

What is statistical significance in A/B testing?

Accepted Answer

Statistical significance tells you how confident you can be that your test result isn't due to random chance. A result is typically considered significant when the p-value is below 0.05 — meaning there's less than a 5% probability the difference you observed happened randomly.

Question 2

What p-value should I use?

Accepted Answer

For most product experiments, p < 0.05 (95% confidence) is the standard threshold. If the decision is high-stakes or hard to reverse, use p < 0.01. For quick directional tests, p < 0.10 can be acceptable if you acknowledge the higher risk of a false positive.

Question 3

How many visitors do I need for a valid A/B test?

Accepted Answer

It depends on your baseline conversion rate and the minimum effect you want to detect. As a rough guide: if your baseline is around 5% and you want to detect a 20% relative improvement (i.e., to 6%), you need roughly 4,000–5,000 visitors per variant. Use a sample size calculator before starting the test, not after.

Question 4

My test shows a winner but the calculator says it's not significant. What do I do?

Accepted Answer

Wait. Running a test to significance and stopping only when you see a win is called peeking — it inflates your false positive rate significantly. Let the test run to its pre-determined sample size, then evaluate.

Question 5

Can I use this calculator for multivariate tests?

Accepted Answer

This calculator is designed for standard A/B tests (two variants). For multivariate tests with three or more variants, you need to account for multiple comparisons — the false positive rate compounds with each additional variant.

Free A/B Test Significance Calculator

Variant A (Baseline)

Variant B (Variation)

What does "statistically significant" mean?

How do I read the result?

When do I need more data?

Frequently asked questions about the A/B test significance calculator

Document your experiments and learn from them