Integrations & a developer-grade API
Pull real search and performance data from Google, push fixes to GitHub, and automate the whole crawl loop over a REST API with scoped keys and signed webhooks.
Already have an account? Log in.
Connected services
liveKeys are environment-scoped: cx_live_ for production, cx_test_ for sandbox.
Your Google data decides what gets fixed first
Connect Search Console and GA4 and CrawlX stops guessing. Impressions, clicks, CTR, and average position flow onto every URL, and sessions overlay the crawl — so a warning on a high-traffic page outranks the same warning on a page nobody visits. PageSpeed Insights adds live Lighthouse scores on top.
Search Console · demand overlay
last 28 daysDrive every crawl from your own stack
Generate environment-scoped keys — cx_live_ for production, cx_test_ for sandboxed runs — each carrying a read or crawl-trigger scope. Trigger crawls, poll their status, list issues and pages ranked by traffic impact, and export reports, all over a clean JSON API.
API keys
2 active404, never confirming what your key can't reach.Crawl events streamed to your endpoint — and verifiably yours
Subscribe to crawl.completed, crawl.failed, and issue.found, and CrawlX POSTs each event to your URL as it happens. Every delivery is signed with HMAC-SHA256, so a single signature check proves it came from CrawlX and wasn't tampered with in transit.
Pull data in, push fixes out
Native Google integrations, GitHub for fixes, and a REST API plus webhooks for everything else.
Search Console
Pull impressions, clicks, CTR, and average position per URL — CrawlX uses real demand to weight every issue, so the top of the list is the work that actually earns traffic.
Google Analytics 4
Overlay sessions and traffic on the crawl so a thin-content warning on a 40k-session page outranks the same warning on a page nobody visits.
PageSpeed Insights
Lighthouse performance, accessibility, and best-practice scores pulled straight onto your pages, alongside lab Core Web Vitals from the crawl.
GitHub
High-impact fixes land as pull requests on a branch. A human reviews and merges — CrawlX never auto-merges and never pushes to your default branch.
REST API
Environment-scoped keys (cx_live_ / cx_test_) with read and crawl-trigger scopes. Trigger crawls, poll status, list issues and pages, and export from your own stack.
Webhooks
Subscribe to crawl.completed, crawl.failed, and issue.found. Every delivery is signed with HMAC-SHA256 so you can verify it really came from CrawlX.
Plus exports in CSV, JSON, PDF, and XML sitemap. Read the full API reference
Connect, call, react
Wire CrawlX into your pipeline once, then let crawls and events flow both ways.
Connect your sources
Authorize Search Console, GA4, PageSpeed Insights, and GitHub in a few clicks. Real demand and performance data start weighting your crawl immediately.
Call the API
Mint a cx_live_ or cx_test_ key, trigger a crawl, then poll the returned crawl_id for status and pull issues, pages, and exports.
React to events
Get a signed POST the moment a crawl finishes, fails, or surfaces a new issue. Verify the HMAC-SHA256 signature and kick off your own automation.
Explore more features
Integrations are one piece of the loop. Here's the rest of what CrawlX does.
Cloud crawl engine
Spider entire sites in the cloud — up to 500,000 URLs per crawl, with JS rendering.
Impact triage & 65+ checks
Every issue ranked by traffic impact and grouped by root cause across 13 categories.
AI: fixes, content & schema
Bring your own key for fix drafting, content scoring, SERP tuning, and JSON-LD.
Technical-SEO toolkit
Link explorer, schema inspector, robots tester, crawl compare, and more.
Reports & collaboration
White-label PDF reports, shareable links, and team roles from Owner to Viewer.
All features
See the full picture — crawl, diagnose, and ship the fix, in one tool.
Wire CrawlX into your stack.
Automate the whole loop.
Connect Google, push fixes to GitHub, and drive crawls from your own code in minutes.