Sizing Tools and Fit Finders
A side-by-side comparison of five sizing interventions — from static charts to AR try-on — ranked by implementation cost and measured impact on size-related refunds.
Sizing Tools and Fit Finders
On-product interventions — charts, quizzes, model fit notes, review signals, AR try-on — that reduce size-related returns in apparel and footwear.
Sizing tools and fit finders are the category of pre-purchase interventions designed to help shoppers pick the right size the first time. They range from a static size chart in a modal to a body-measurement quiz that recommends a SKU, model-fit annotations ("model is 178cm wearing M"), aggregated review signals ("runs small"), and AR try-on overlays.
All five attack the same problem from different angles: a checkout that is happy to sell the wrong size. The right choice depends on category, average order value, and how much of your refund volume is genuinely sizing-driven versus styling regret. For most apparel stores in the €1M-€15M band, the answer is a stack of two or three — not the most expensive single tool.
Apparel and footwear return rates sit between 20% and 40%, and roughly half of those returns cite size or fit as the reason. That makes sizing the single largest controllable lever on refund rate — bigger than shipping speed, bigger than product photography, often bigger than copy.
The trade-off across tools is straightforward: the more accurate the recommendation, the more friction it adds and the more it costs to install and maintain. A static chart costs nothing and helps a little; AR try-on costs five figures a year and helps a lot — for the categories where it works. The table below is the shortlist most Shopify and WooCommerce stores actually evaluate.
Sizing interventions compared by setup cost, ongoing effort, and impact on size-related refund rate
| Tool | Setup cost | Ongoing effort | Typical refund-rate lift | Best fit |
|---|---|---|---|---|
| Static size chart (modal) | €0-€200 | Low | 5-10% reduction | Any apparel store, table stakes |
| Model-fit annotations | €0-€500 per shoot | Medium | 8-15% reduction | Premium apparel, lookbook brands |
| Review-driven fit signals | €20-€80/mo (Okendo, Yotpo) | Low | 10-20% reduction | Stores with >50 reviews per SKU |
| Body-measurement quiz | €50-€300/mo (Kiwi, Fit Analytics) | Medium | 20-35% reduction | Mid-tier apparel, broad size range |
| AR try-on | €500-€2,000/mo + 3D assets | High | 25-40% reduction | Eyewear, footwear, premium denim |
The lift numbers are reductions in the sizing-attributed slice of returns, not total returns. A store with a 30% total return rate where 50% of returns are sizing-related has a 15-point sizing problem; a quiz that delivers a 25% reduction takes that to roughly 11 points — about four points off total returns. That maths matters when you're sizing the business case.
Which tool to pick by category and stage
If you sell standard-cut apparel with consistent sizing across SKUs, a review-driven fit signal layered on top of a clean size chart will close most of the gap. Shoppers trust other shoppers more than they trust your chart, and the "runs small / true to size / runs large" badge is the single highest-ROI addition for stores already on Okendo or Yotpo.
If your range spans XS-3XL, or you sell across regions with different sizing conventions, a body-measurement quiz pays for itself fast. Fit Analytics, Kiwi Sizing, and the native Shopify size-recommender apps all hit similar accuracy once you've fed them a few hundred completed purchases. Categories where fit is genuinely three-dimensional — bras, denim, eyewear, footwear — are where AR try-on stops being a gimmick and becomes the highest-impact lever.
The over-engineering trap
Installing AR try-on before you've fixed the size chart is the most common mistake in this category. Diagnose your sizing mismatch first — pull the last 200 returns, tag them by reason, and look at which SKUs drive the volume. If three products generate 40% of size returns, photograph them on three body types before you buy a quiz.
Stack the tools and measure the lift
The highest-performing stacks combine a passive signal (chart + review badge) with an active recommendation (quiz or AR). Passive elements catch shoppers who would have guessed; active elements catch shoppers who would have abandoned. Run them in sequence, not in parallel — a chart, a fit badge, and a quiz CTA all competing on the same PDP creates choice paralysis.
Measure each addition with a clean A/B test on size-attributed return rate, not overall conversion. Sizing tools sometimes nudge conversion down slightly — a more cautious shopper is a better shopper — and the win shows up in the refund line two to four weeks later. If you're not splitting traffic, at minimum compare cohorts pre- and post-launch with a 30-day window on each side.
Reduction in size-attributed refund rate by tool (apparel, midpoint estimates)
Frequently asked questions
For broad apparel, Kiwi Sizing and Fit Analytics are the most-installed measurement quizzes and integrate cleanly with Shopify themes. For footwear and eyewear, look at AR-first vendors like Vyking or FittingBox. Review-based fit signals are typically delivered through your existing reviews app (Okendo, Yotpo, Judge.me) rather than a standalone tool.
They reduce them. Multiple published case studies and our own diagnostic work on the underlying sizing mismatch problem show net reductions of 15-30% in size-attributed returns after a quiz or AR tool launches — the returns don't reappear as styling or quality complaints. The shoppers who would have ordered two sizes simply order one.
Slightly downward to flat, in most cases. A quiz that takes 30 seconds will lose some impatient shoppers but qualify the rest. The net economic impact is positive once you net out the refund savings, but expect to defend a small conversion dip to whoever owns top-line revenue.
Usually not for general apparel at that revenue band. AR pays off in categories where fit is geometric — eyewear, watches, footwear, and increasingly denim. For tops and dresses, model annotations and a measurement quiz deliver 80% of the impact at 20% of the cost.
When a shopper leaves a review, the reviews platform asks a one-tap question: did this item run small, true to size, or large? Aggregated answers appear as a badge on the PDP. Accuracy improves with volume; you need roughly 50 reviews per SKU before the signal becomes reliable.
Yes. Kiwi Sizing, Fit Analytics, and most Shopify-native sizing apps install via theme block or app embed with no code. The work that does require effort is curating the per-SKU measurement data the quiz reads from — that's a merchandising task, not a dev task.
A size chart is a static lookup table the shopper interprets themselves. A fit finder takes inputs — height, weight, usual size in a reference brand, or body measurements — and returns a single SKU recommendation. The fit finder removes the interpretation step, which is where most sizing mistakes are made.
Allow one full return window plus two weeks. For most stores that's 6-8 weeks before the refund-rate change is statistically clean. Conversion impact is visible inside a week; refund impact lags because returns trickle in over the 30-day policy.
The quiz logic can be shared, but the size mapping has to be localised. EU 38 is US 8 in some brands and US 6 in others; a good fit finder lets you maintain a per-region size table. If you sell across regions on Shopify Markets, confirm the tool reads your market-specific size variants.
They're the biggest single category of refund reduction levers for apparel, but they don't solve quality complaints, late shipping, or styling regret. Diagnose your return reasons first — if sizing is under 30% of your refund volume, a fit finder won't move the needle and your effort is better spent elsewhere.
Get an AI expert review of your site
Paste your URL — Metricuno's AI runs the same heuristic checks a senior CRO consultant would, scoring your page and prioritising the fixes that'll move conversion fastest.