🧪 Conversion Optimisation

How to A/B Test a Landing Page: What to Test First (and What to Ignore)

📅 July 3, 2025 ⏱ 11 min read 🏷 A/B Testing · CRO · Landing Pages

Most people A/B test in the wrong order — spending weeks tweaking button colors while the headline is losing them 80% of visitors. This guide gives you the exact testing priority order based on impact, the minimum traffic you actually need before testing, which free tools work, and a 30-day roadmap to find your first real conversion win.

Table of Contents

What A/B Testing Actually Is (and What It Isn't)
The Testing Priority Order: High Impact First
How Much Traffic You Actually Need
Statistical Significance — Plain English
The Best A/B Testing Tools (Including Free)
The 30-Day A/B Testing Roadmap
What NOT to Test (Common Wasted Effort)
How to Read Your Results Without Getting Fooled

Related: UiXDraft HTML template bundle — 180+ HTML/CSS/JS templates with commercial license, $35 one-time.

1. What A/B Testing Actually Is (and What It Isn't)

An A/B test shows two versions of a page to two randomly split groups of visitors at the same time. Version A sees the current page. Version B sees the variant with one change. After enough visitors, you compare conversion rates and declare a winner.

That's it. The math is simple. What's hard is discipline:

Only change one element per test. If you change the headline, CTA, and hero image at once and conversion goes up, you don't know which change caused it.
Run both versions simultaneously. Sequential testing (A this week, B next week) is corrupted by traffic quality differences, seasonality, and external events.
Reach statistical significance before calling a winner. Ending the test after 200 visits because "B looks like it's winning" is how you get fooled by noise.

⚠️ The Most Common Mistake

Running a test for 3 days, seeing version B is "winning," and implementing it — without reaching significance. Short tests are dominated by random variance, not real signal. A page with 50 visitors per day needs to run a test for 2–4 weeks minimum before the result is trustworthy.

2. The Testing Priority Order: High Impact First

Not all elements are equal. Test these in order — the top items have the highest potential lift and the clearest hypotheses:

Rank	What to Test	Potential Lift	Why It's High Priority
1	Headline The single most read element on the page	50–400%	80% of visitors read the headline and nothing else. A headline test touches every person who hits the page.
2	CTA Button Copy "Submit" vs "Start my free trial" vs "Get instant access"	15–90%	High leverage: every visitor sees it at the decision point. Copy changes here are free and fast to implement.
3	Hero Image / Video Product screenshot vs lifestyle vs explainer GIF	10–60%	Visuals dominate first impressions. A product screenshot often outperforms stock photography by a significant margin.
4	Lead Form Length 3 fields vs 1 field vs no form (CTA only)	10–50%	Each extra form field costs 5–10% of leads. Removing phone number alone can double form completion rates.
5	Social Proof Placement Testimonials above vs below the fold	8–35%	Moving 1 strong testimonial above the fold has produced significant lifts in cold-traffic campaigns.
6	Pricing Display Show price early vs reveal after features vs pricing page only	5–25%	For high-ticket products, showing price early qualifies visitors and reduces wasted leads. For lower-ticket, anchoring features first then price can lift conversions.

💡 The Rule of Testing Order

Always test the element that's seen by the most visitors first. The headline is seen by 100% of visitors. The FAQ section is seen by maybe 30%. A 10% lift on the headline is worth 3× more than a 10% lift on the FAQ, because the headline touches every single visitor.

3. How Much Traffic You Actually Need

This is where most people get confused. The minimum traffic depends on your current conversion rate and how big a lift you're trying to detect:

📊 Minimum Visitors Per Variant to Detect a Lift

Current CVR: 1%

~5,000

visitors per variant to detect a 20% relative lift (1% → 1.2%)

Current CVR: 2%

~2,500

visitors per variant to detect a 20% relative lift (2% → 2.4%)

Current CVR: 5%

~1,000

visitors per variant to detect a 20% relative lift (5% → 6%)

Current CVR: 10%

~500

visitors per variant to detect a 20% relative lift (10% → 12%)

These are per variant — so double these numbers for your total test traffic. At 90% statistical significance, 95% confidence. Use Google's free CRO statistical significance calculator to calculate your specific numbers.

What If You Have Low Traffic?

If your page gets fewer than 100 visitors a day, traditional A/B testing is unreliable. Instead:

Qualitative testing first — use Hotjar or Microsoft Clarity (free) to watch session recordings and find where visitors drop off or hesitate
5-second test — show your hero section to 5–10 people for exactly 5 seconds, then ask what the page does and who it's for. Confusion reveals headline problems instantly
Implement the known winners — many copy changes (benefit headlines, specific CTA copy, risk removers) are so well-tested across thousands of sites that implementing them without A/B testing is still a net positive
Test one big change at a time — run for a full month to accumulate enough data, and accept a wider confidence interval

4. Statistical Significance — Plain English

Statistical significance answers the question: "How likely is it that version B's results are real — not just random luck?"

80%

Confidence
1 in 5 chance the result is noise. Don't act on this.

90%

Confidence
Acceptable for small, easy-to-reverse changes.

95%

Confidence
Standard threshold. Implement at this level.

In practice: You run a test, you get a result, your testing tool says "95% confidence." That means there's a 5% chance the result is random. For a button copy change that takes 10 minutes to implement, 90% is enough. For a full page redesign with 40 hours of work behind it, wait for 95%.

⚠️ The Peeking Problem

Checking your test every day and stopping when it "looks like a winner" is called peeking bias — it dramatically inflates false positive rates. Set your minimum run time before starting the test, and don't end it early regardless of interim results. Run at minimum until you hit your required sample size AND at least 2 full weeks (to account for day-of-week traffic variation).

5. The Best A/B Testing Tools (Including Free)

Tool	Price	Best For	Limitation
Google Optimize	Free*	HTML/CSS/JS pages with Google Analytics. Easy setup.	*Deprecated — use GA4 Experiments or alternatives below
Microsoft Clarity Best Free	Free	Session recordings, heatmaps — qualitative data before testing	No A/B test runner — pairs with other tools
Hotjar	Free tier	Heatmaps, recordings, and basic A/B with Hotjar Surveys	Free tier limited to 35 sessions/day
VWO	From $199/mo	Full-stack A/B testing for high-traffic sites	Overkill for sub-10K monthly visitors
Optimizely	Enterprise	Large teams, multi-page experiments, personalisation	Pricing requires sales call — not for solo use
A/B Tasty	From ~$99/mo	Mid-market, visual editor, good for no-code testing	Steeper learning curve than basic tools
Manual Split via Cloudflare DIY	Free	Deploy two HTML files, split traffic with CF Workers	Requires developer setup; no built-in stats

For most freelancers and small teams: start with Microsoft Clarity (free) to identify where visitors are dropping off, then implement changes manually and track the result in GA4. You don't need a dedicated A/B testing tool to see your conversion rate change — you need consistent measurement.

Test-ready landing page templates

Clean HTML/CSS/JS structure — easy to create variant pages for split testing. 180+ templates, $35.

Get the Bundle →

6. The 30-Day A/B Testing Roadmap

Week 1

Diagnose Before You Test

Install Microsoft Clarity (free) on your landing page
Watch 20 session recordings — look for where people stop scrolling
Check your GA4 scroll depth — what % of visitors reach the CTA?
Run a 5-second test on your headline with 5 real people
Write your hypothesis: "Changing [X] to [Y] will increase conversions because [Z]"

Week 2

Test #1: Headline

Write 3 headline variants using frameworks from your diagnosis
Pick the strongest 1 challenger against your current headline
Set up the A/B test in your chosen tool
Let it run — do not check results daily. Set calendar reminder for Day 14
Document your hypothesis and expected lift in a simple Google Sheet

Week 3

Read Headline Results + Launch Test #2

Check significance — if ≥90% and sample size met, implement the winner
If not significant yet, let it run another week
Write hypothesis for Test #2: CTA button copy
Create 2–3 CTA variants (first-person, specific outcome, risk-remover variants)
Launch Test #2 with the winning headline now live

Week 4

Review + Plan Next Sprint

Read CTA test results — implement winner
Calculate your cumulative conversion rate improvement
Identify the next highest-impact element to test (hero image or form length)
Review Clarity recordings again with the new version live
Document learnings and carry insights forward to your next test

7. What NOT to Test (Common Wasted Effort)

These are the tests that busy teams run for weeks and get nothing useful from:

Button color — Unless your current button has no contrast with the background, button color tests rarely produce meaningful lifts. The copy on the button moves conversion rates; the hex code doesn't.
Font choices — Between two readable, web-safe font pairs, conversion differences are statistically negligible in virtually all tests.
Stock photo A vs stock photo B — Both are stock photos. Neither is your product. Test "stock photo vs product screenshot" — that's a meaningful variable.
Number of bullet points (4 vs 5) — Micro-copy structure tests rarely reach significance without enormous traffic volumes. Test what those bullets say, not how many there are.
Footer content — Footer A/B tests require massive traffic volumes to reach significance because so few visitors read it. The effort is rarely justified.
Testing on mobile-only traffic — If you're separating desktop and mobile audiences to test separately, make sure each segment has enough traffic independently. Most small sites don't.

8. How to Read Your Results Without Getting Fooled

Novelty Effect

When you change an element on a page your existing audience has seen before, they engage with the new version simply because it's new. This creates an artificial short-term lift that fades. For returning visitor traffic, always run tests for at least 2 weeks to let novelty wear off.

Sample Ratio Mismatch

If your test was supposed to split 50/50 but your tool shows 48/52 or worse — investigate before reading the results. An unequal split can corrupt your data if the discrepancy is caused by a bug, caching issue, or bot traffic.

Segment Pollution

If your page gets traffic from wildly different sources (paid ads, organic, email, direct), a meaningful change for one audience might be noise for another. If your email audience already knows your brand and converts at 15% while your paid traffic converts at 1.5% — they shouldn't be in the same test.

External Events

A competitor going viral, a mention in a newsletter, a Product Hunt launch, a Twitter storm — any external event that changes your traffic quality mid-test corrupts the results. If something significant happened during your test window, note it and consider re-running if the event affected your traffic mix.

📊 The Compound Effect

A 30-day roadmap that produces a 15% lift on the headline and a 12% lift on CTA copy doesn't add up to 27%. It compounds: 1.15 × 1.12 = 1.288 — a 28.8% total lift. Run 4 tests per quarter at an average of 12% lift each and your annual conversion rate is 1.12⁴ = 1.57× — 57% more conversions from the same traffic, with zero ad spend increase.

🧪 Start Testing on a Solid Foundation

Professional Landing Page Templates — Ready to Test

Clean, semantic HTML/CSS/JS code makes it easy to create variant pages, swap elements, and run split tests. 180+ templates, full commercial license, instant download. $35 one-time.

✓ Clean Test-Friendly Code

✓ 180+ Templates

✓ 90+ Lighthouse Score

✓ Commercial License

✓ $35 One-Time

Get the Templates — $35 →

🔒 Secure checkout · Instant download · Full commercial license