Every issue, ranked by the traffic it affects
CrawlX runs 65+ technical checks across 13 categories, then sorts every finding by estimated traffic impact and groups it by root cause — so the top of the list is always the highest-leverage work, not the longest list of warnings.
The highest-leverage fix is always at the top
Most crawlers sort by raw severity, so a thousand cosmetic warnings bury the one change that matters. CrawlX scores every issue by the traffic it actually affects — weighted with real demand from Search Console and GA4 — so triage starts with the work that moves rankings.
Issues by impact
7,839 pages affectedOne template behind 4,000 errors? Fix it once
CrawlX reads the full crawl graph and collapses thousands of identical errors into the single template, directory, or rule that caused them — so you fix the root, not the symptoms. Underneath sits the full check suite: 65+ checks across 13 categories, every page, every crawl.
Check coverage
13 categories · 65+ checksReal performance data, scored where it matters
CrawlX measures Largest Contentful Paint, Interaction to Next Paint, and Cumulative Layout Shift per template against Google's good and needs-work thresholds, pulled from PageSpeed Insights. Then crawl-budget analysis separates the low-value URLs wasting bot time from the high-impact pages that deserve it.
Core Web Vitals
mobile · field dataFrom a wall of warnings to a ranked plan
Six capabilities turn a raw crawl into a prioritised list your team can actually work through — fewer rows, more leverage.
Traffic-impact score
Every issue is sorted by the traffic it actually affects — weighted with Search Console and GA4 demand — not by raw severity, so the top of the list is the work that moves rankings.
Root-cause grouping
One broken template behind thousands of errors collapses into a single fix. Group by template, directory, or rule and resolve the cause once instead of chasing symptoms.
Core Web Vitals
LCP, INP, and CLS measured per template against Google's good / needs-work thresholds, pulled from PageSpeed Insights so field and lab data line up.
Duplicate detection (SimHash)
Near-duplicate and thin pages are clustered with SimHash fingerprints — catching boilerplate and templated content that exact-match checks miss.
Crawl-budget analysis
See where bots burn budget. Low-value URLs — faceted dupes, infinite params, redirect chains — are separated from high-impact pages that deserve the crawl.
Indexability & canonicals
Robots directives, canonical targets, and noindex conflicts resolved per URL, so the pages you want in the index are indexable — and the rest aren't wasting equity.
Detect, score, group
Three passes turn 40,000 raw findings into a short, ranked list of root causes.
Detect
Every crawled URL is run through 65+ checks across 13 categories — status, meta, headings, content, images, links, indexability, schema, hreflang, pagination, security, performance, and resources.
Score
Each finding is scored by estimated traffic impact, weighted with Search Console and GA4 demand and the number of pages affected — so severity becomes a tiebreaker, not the sort order.
Group
Identical issues collapse into the one template, directory, or rule that caused them. Near-duplicate pages are clustered with SimHash, so you fix the root cause once and clear thousands of rows.
A ranked list is the start — CrawlX opens the PR
Triage tells you exactly what to fix first. For high-impact issues, CrawlX can draft the change and open it as a GitHub pull request you review and merge — it never auto-merges and never pushes to your default branch. AI is bring-your-own-key (Anthropic Claude or OpenAI GPT-4o).
Top fix this crawl
#1 by impacttemplates/product.liquid — one template behind 4,118 affected pages.The rest of the CrawlX loop
Triage is one step. See how crawling, AI fixes, the toolkit, integrations, and reports fit together.
Cloud crawl engine
Spider whole sites in the cloud — up to 500,000 URLs per crawl, with JS rendering and render diff.
AI: fixes, content & schema
Bring your own key to draft fixes, score content, tune titles, and generate JSON-LD schema.
Technical-SEO toolkit
Link Explorer, Schema Inspector, Robots Tester, crawl compare, and more — all in one place.
Integrations & API
Search Console, GA4, PageSpeed, GitHub, plus a REST API and signed webhooks.
Reports & collaboration
White-label PDF reports, shareable links, and team roles from Owner to Viewer.
All features
The full overview — crawl, diagnose, and fix, end to end, on one page.
Stop reading reports.
Start with the fix that matters.
Run a crawl and get a ranked, root-grouped plan in the time it takes to read this.