Confirmation Bias

Metricuno
May 18, 2026
4 min read
Quick answer

Confirmation bias is the tendency to seek out evidence that supports what you already believe and discount the rest. In CRO, it quietly distorts how teams read reviews, interpret test results, and choose which experiments to ship.

Definition
Behavioural science

Confirmation Bias

The tendency to seek and weight information that confirms existing beliefs, while ignoring or downplaying evidence that contradicts them.

Confirmation bias is one of the most studied cognitive biases — a systematic preference for evidence that matches what you already think is true. It shapes how people read product reviews, how teams interpret experiment results, and why two analysts looking at the same dashboard can walk away with opposite conclusions.

In an e-commerce context the bias is rarely loud. It shows up as a product manager skimming the 5-star reviews and skipping the 2-star ones, or a CRO lead calling a test "directionally positive" at 80% confidence because the variant matched their hypothesis. The cost is real: bad features ship, good ones get killed, and the team's belief that they're "data-driven" makes the bias harder to spot.

Also known as
Myside bias
Confirmatory bias

On the shopper side, confirmation bias drives most of what we call review behaviour. A visitor who already wants the product reads the positive reviews and skims past the negative ones; a sceptical visitor does the opposite. Both leave the page feeling more certain than the evidence justifies — and the conversion rate reflects which group your traffic mix favours, not which group is right.

On the team side it's worse, because the stakes scale. Confirmation bias is one of the most expensive cognitive biases in experimentation: it leads to peeking at results, stopping tests early when the curve looks right, and rationalising flat tests as "validating the hypothesis qualitatively". The fix isn't to be smarter — it's to commit to the protocol before the data comes in.

Formula

Perceived_Evidence_Weight = Actual_Evidence_Weight × (1 + Bias_Multiplier × Alignment_With_Prior)

Variables

Actual_Evidence_Weight

Actual evidence weight

Objective strength of a piece of information (e.g. a review's rating, a test's p-value).

Bias_Multiplier

Bias multiplier

How strongly the reader weights confirming vs disconfirming evidence. Empirically 0.3-0.8 in everyday contexts.

Alignment_With_Prior

Alignment with prior belief

+1 if the evidence supports the existing belief, -1 if it contradicts it.

Worked example

A shopper already wants to buy a €120 wireless headphone. They read two reviews of equal length and detail: one 5-star (aligned, +1) and one 2-star (contradictory, -1). Assume Bias_Multiplier = 0.5.

Actual weight of 5-star review: 1.0

Actual weight of 2-star review: 1.0

Bias multiplier: 0.5

Perceived weight: 5-star = 1.5, 2-star = 0.5. The positive review feels 3× more influential than the negative one — even though they carry equal objective weight.

This is why a single contradictory review rarely kills a purchase intent, but a single confirming review often closes it. It's also why "add a testimonial" tests usually win regardless of testimonial quality: you're handing aligned visitors more ammunition.

Confirmation bias also distorts how teams choose which experiments to run. Hypotheses that fit the existing narrative ("our checkout is too long") get prioritised; results that fit the narrative get celebrated; results that don't get re-segmented until they do. The table below shows where it most commonly leaks into a CRO workflow.

Benchmark

Where confirmation bias shows up in a typical CRO workflow

Workflow stageHow the bias appearsTypical costMitigation
Hypothesis generationOnly hypotheses matching team's existing beliefs reach the backlog60-80% of ideas come from the same 2-3 mental modelsPull hypotheses from drop-off data, not opinion
Test designSuccess metric chosen after seeing early dataInflates false-positive rate by 2-4×Pre-register primary metric before launch
Mid-test monitoringPeeking and stopping when results align with hypothesis30-50% of "wins" don't replicateFix sample size; ignore the dashboard until end-of-test
Result interpretationFlat tests reframed as "directional" if they match priorsBad variants ship, good ones get re-tested foreverPre-commit to ship/kill thresholds in the test doc
Post-mortemOnly winning tests get analysed; losers get archivedTeam loses 70% of available learningReview losers and flat tests with equal rigour

Confirmation bias sits inside the broader family of cognitive biases that shape e-commerce decisions — alongside anchoring, loss aversion, and social proof effects. What makes it particularly dangerous in CRO is that it disguises itself as judgement: the team feels more confident, not less, as the bias gets stronger. Process-level fixes (pre-registration, fixed sample sizes, blind result reviews) work; willpower doesn't.

Frequently asked

Confirmation bias FAQ

It's the habit of paying more attention to information that agrees with what you already believe and less attention to information that disagrees. Everyone does it, including data-driven teams — the bias operates below conscious awareness.

It mostly shows up as peeking at results, stopping tests early when the variant is winning, and rationalising flat outcomes as "directional wins". The downstream effect is a false-positive rate two to four times higher than the headline p-value suggests, so a lot of shipped "winners" don't replicate.

Most cognitive biases distort how you weigh a single piece of information (e.g. anchoring on the first number you see). Confirmation bias distorts which information you collect in the first place — it filters your inputs before any reasoning happens, which makes it harder to catch.

Yes, strongly. Visitors who already want a product read positive reviews more carefully and discount negative ones, and sceptical visitors do the reverse. This is why review-summary widgets and verified-purchase filters often move conversion more than adding raw review volume.

Pre-register your primary metric and sample size before launching, write down a kill threshold alongside your success threshold, and have someone who didn't design the test read the result first. Process beats willpower — assume the bias is present and design around it.

Cherry-picking is a deliberate, conscious act of selecting favourable evidence. Confirmation bias is the unconscious version of the same behaviour — you genuinely don't notice the contradictory evidence, or you remember it as weaker than it was. Cherry-picking is dishonest; confirmation bias is human.

It rarely kills a programme outright, but it caps the win rate. Teams with strong confirmation bias tend to plateau at a 15-20% test-win rate that doesn't translate to revenue, because the "wins" are noise dressed up as signal. Programmes that fix the bias usually see win rates settle lower but revenue impact rise.

Confirmation bias drives teams to stop tests early when they're "already winning", which means most peeked-at tests end below the required sample size. Fixing the sample size up front and refusing to look at results until you hit it neutralises most of the practical damage.

Watching session replays is high-bandwidth and emotionally vivid, so a single "aha" recording can override a much larger quantitative signal. Replays are useful for generating hypotheses, but a hypothesis built from three videos is still a hypothesis — it needs a quantitative test before you act on it.

Pre-commit to your decision rule in writing before the test launches: which metric, what threshold, what sample size, and what action you'll take in each outcome (ship, kill, iterate). Once those four lines exist in a doc with a timestamp, most confirmation-bias damage disappears.

Test ideas before you ship them

Run unlimited A/B tests, attach hypotheses to outcomes, and build a searchable archive of what works — and what doesn't.