Connected

Integrations & a developer-grade API

Pull real search and performance data from Google, push fixes to GitHub, and automate the whole crawl loop over a REST API with scoped keys and signed webhooks.

Start crawling free Read the API docs

Already have an account? Log in.

Connected services

live

Search Consoleimpressions · clicks · CTR Google Analytics 4sessions · traffic PageSpeed InsightsLighthouse scores GitHubopens fix PRs

Exports CSV JSON PDF XML sitemap

POST /v1/projects/p_1a2b3c4d/crawls202 Accepted

curl -X POST https://api.crawlx.ai/v1/projects/p_1a2b3c4d/crawls \
  -H "Authorization: Bearer cx_live_8f2c…a91d" \
  -H "Content-Type: application/json" \
  -d '{ "mode": "spider", "max_urls": 100000 }'

{
  "crawl_id": "c_2f7a9e1b4d6c8a0e",
  "status": "queued",
  "poll": "/v1/crawls/c_2f7a9e1b4d6c8a0e"
}

Keys are environment-scoped: cx_live_ for production, cx_test_ for sandbox.

Weighted by real demand

Your Google data decides what gets fixed first

Connect Search Console and GA4 and CrawlX stops guessing. Impressions, clicks, CTR, and average position flow onto every URL, and sessions overlay the crawl — so a warning on a high-traffic page outranks the same warning on a page nobody visits. PageSpeed Insights adds live Lighthouse scores on top.

Impressions & clicks CTR & avg position GA4 sessions Lighthouse scores

Search Console · demand overlay

last 28 days

Impressions

1.94M

Clicks

82.1k

CTR

4.2%

Avg. position

9.7

/collections/new-arrivals61.2k clicksThin content

/products/trail-runner-gtx14.8k clicksSlow LCP

/blog/how-to-lace-boots3.1k clicksHealthy

A developer-grade REST API

Drive every crawl from your own stack

Generate environment-scoped keys — cx_live_ for production, cx_test_ for sandboxed runs — each carrying a read or crawl-trigger scope. Trigger crawls, poll their status, list issues and pages ranked by traffic impact, and export reports, all over a clean JSON API.

Trigger crawls Poll status List issues & pages Export reports

Full API reference

API keys

2 active

cx_live_8f2c…a91dcrawl-triggerproduction

cx_test_3b71…0e4freadsandbox

Unknown resources return 404, never confirming what your key can't reach.

Webhooks you can trust

Crawl events streamed to your endpoint — and verifiably yours

Subscribe to crawl.completed, crawl.failed, and issue.found, and CrawlX POSTs each event to your URL as it happens. Every delivery is signed with HMAC-SHA256, so a single signature check proves it came from CrawlX and wasn't tampered with in transit.

crawl.completed crawl.failed issue.found HMAC-SHA256 signed

Event deliveriesstreaming

crawl.completedc_2f7a9e1b · 84,213 URLs12:04:51

issue.foundRedirect chains · 1,204 pages12:04:50

issue.foundSelf-ref canonical · 4,118 pages12:03:18

crawl.completedc_9d31a07c · 11,902 URLs09:41:02

crawl.failedc_5b88f240 · robots blocked08:15:37

X-CrawlX-Signature: sha256=9c1f4e7b2a8d…  verified

Everything that connects

Pull data in, push fixes out

Native Google integrations, GitHub for fixes, and a REST API plus webhooks for everything else.

Search Console

Pull impressions, clicks, CTR, and average position per URL — CrawlX uses real demand to weight every issue, so the top of the list is the work that actually earns traffic.

Google Analytics 4

Overlay sessions and traffic on the crawl so a thin-content warning on a 40k-session page outranks the same warning on a page nobody visits.

PageSpeed Insights

Lighthouse performance, accessibility, and best-practice scores pulled straight onto your pages, alongside lab Core Web Vitals from the crawl.

GitHub

High-impact fixes land as pull requests on a branch. A human reviews and merges — CrawlX never auto-merges and never pushes to your default branch.

REST API

Environment-scoped keys (cx_live_ / cx_test_) with read and crawl-trigger scopes. Trigger crawls, poll status, list issues and pages, and export from your own stack.

Webhooks

Subscribe to crawl.completed, crawl.failed, and issue.found. Every delivery is signed with HMAC-SHA256 so you can verify it really came from CrawlX.

Plus exports in CSV, JSON, PDF, and XML sitemap. Read the full API reference

How it fits together

Connect, call, react

Wire CrawlX into your pipeline once, then let crawls and events flow both ways.

Connect your sources

Authorize Search Console, GA4, PageSpeed Insights, and GitHub in a few clicks. Real demand and performance data start weighting your crawl immediately.

OAuth in clicksDemand-weightedPer-project

Call the API

Mint a cx_live_ or cx_test_ key, trigger a crawl, then poll the returned crawl_id for status and pull issues, pages, and exports.

Scoped keysJSON responsesIdempotent

React to events

Get a signed POST the moment a crawl finishes, fails, or surfaces a new issue. Verify the HMAC-SHA256 signature and kick off your own automation.

Real-timeHMAC-signedRetried

Keep exploring

Explore more features

Integrations are one piece of the loop. Here's the rest of what CrawlX does.

Cloud crawl engine

Spider entire sites in the cloud — up to 500,000 URLs per crawl, with JS rendering.

Impact triage & 65+ checks

Every issue ranked by traffic impact and grouped by root cause across 13 categories.

AI: fixes, content & schema

Bring your own key for fix drafting, content scoring, SERP tuning, and JSON-LD.

Technical-SEO toolkit

Link explorer, schema inspector, robots tester, crawl compare, and more.

Reports & collaboration

White-label PDF reports, shareable links, and team roles from Owner to Viewer.

All features

See the full picture — crawl, diagnose, and ship the fix, in one tool.

Wire CrawlX into your stack.
Automate the whole loop.

Connect Google, push fixes to GitHub, and drive crawls from your own code in minutes.

Start crawling free Read the API docs

curl -X POST https://api.crawlx.ai/v1/projects/p_1a2b3c4d/crawls \ -H "Authorization: Bearer cx_live_8f2c…a91d" \ -H "Content-Type: application/json" \ -d '{ "mode": "spider", "max_urls": 100000 }'

Your Google data decides what gets fixed first

Impressions & clicks CTR & avg position GA4 sessions Lighthouse scores

Drive every crawl from your own stack

Trigger crawls Poll status List issues & pages Export reports

Crawl events streamed to your endpoint — and verifiably yours

crawl.completed crawl.failed issue.found HMAC-SHA256 signed

Integrations & a developer-grade API

Connected services

Your Google data decides what gets fixed first

Search Console · demand overlay

Drive every crawl from your own stack

API keys

Crawl events streamed to your endpoint — and verifiably yours

Pull data in, push fixes out

Search Console

Google Analytics 4

PageSpeed Insights

GitHub

REST API

Webhooks

Connect, call, react

Connect your sources

Call the API

React to events

Explore more features

Cloud crawl engine

Impact triage & 65+ checks

AI: fixes, content & schema

Technical-SEO toolkit

Reports & collaboration

All features

Wire CrawlX into your stack.Automate the whole loop.

Integrations & a developer-grade API

Connected services

Your Google data decides what gets fixed first

Search Console · demand overlay

Drive every crawl from your own stack

API keys

Crawl events streamed to your endpoint — and verifiably yours

Pull data in, push fixes out

Search Console

Google Analytics 4

PageSpeed Insights

GitHub

REST API

Webhooks

Connect, call, react

Connect your sources

Call the API

React to events

Explore more features

Cloud crawl engine

Impact triage & 65+ checks

AI: fixes, content & schema

Technical-SEO toolkit

Reports & collaboration

All features

Wire CrawlX into your stack.Automate the whole loop.

Wire CrawlX into your stack.
Automate the whole loop.

Wire CrawlX into your stack.
Automate the whole loop.