Find logo

Find Sitemap URLs (Utility)

FindFind Sitemap URLs (Utility)

POST/v1/find-sitemap-urls-texau
texau__find-sitemap-urls-texauenrichment
cURL · api.texau.app
84ms
POST/v1/find-sitemap-urls-texau
{
  "domain": "https://acme.com"
}
200 Accepted
{
  "ok": true,
  "data": {
    "domain": "https://acme.com",
    "filtered_url_count": "https://acme.com",
    "sitemap_url": "https://acme.com",
    "total_urls_count": "https://acme.com",
    "url": "https://acme.com"
  }
}

Extracts all URLs from a website's sitemap.xml for SEO analysis and site auditing.

Install

Add find sitemap urls (utility) to your MCP client.

Drop this into claude_desktop_config.json (or your client's equivalent) and the tool shows up in any chat.

claude_desktop_config.json
{
  "mcpServers": {
    "texau": {
      "command": "npx",
      "args": ["-y", "@texau/mcp-server"],
      "env": { "TEXAU_API_KEY": "..." }
    }
  }
}

Tool name: texau__find-sitemap-urls-texau

When to use this.

The "Find Sitemap URLs" action is designed to extract all URLs from a website's sitemap.xml, facilitating SEO analysis and site auditing. By inputting the root domain (e.g., https://example.com), users can efficiently gather essential data for optimizing their online presence. The action outputs key information, including the domain (companyDomain), the total count of discovered URLs (total_urls_count), the count of filtered URLs (filtered_urls_count), and the actual sitemap URLs (sitemap_urls) along with a list of all discovered URLs (urls). This action is particularly useful for digital marketers, SEO professionals, and web developers seeking to enhance website performance and visibility in search engines, making it a vital tool in the enrichment category.

Try it

Run a sample request.

The response is a deterministic, cached example. No live call, no credits used.

Find Sitemap URLs (Utility)

Try it

The root domain to scan for sitemap and URLs (e.g., https://example.com).

Response is cached — no live API call.

Response

Output schema.

Every field returned in `data`. Click rows to expand nested objects.

FieldType
  • domain
    Domain
    text
  • filtered_url_count
    Filtered Url Count
    nullablenumber
  • sitemap_url
    Sitemap Url
    array
  • total_urls_count
    Total Urls Count
    number
  • url
    Url
    nullablearray

Integrate

Copy-pasteable snippets.

Real endpoint: https://v3-api.texau.com/api/v1/find-sitemap-urls-texau. Auth: x-api-key.

cURL · api.texau.app
84ms
POST/v1/find-sitemap-urls-texau
curl -X POST 'https://v3-api.texau.com/api/v1/find-sitemap-urls-texau' \
  -H 'x-api-key: $TEXAU_API_KEY' \
  -H 'content-type: application/json' \
  -d '{"domain":"https://acme.com"}'
200 Accepted
{
  "ok": true,
  "data": {
    "domain": "https://acme.com",
    "filtered_url_count": "https://acme.com",
    "sitemap_url": "https://acme.com",
    "total_urls_count": "https://acme.com",
    "url": "https://acme.com"
  }
}

Compose

How this fits a workflow.

The next 2 actions most operators chain after this one.

Find Sitemap URLs (Utility)EnrichPush to CRM

Output

Results land in a TexAu table.

Sample rows below.

Real result preview coming soon.

InputStatusScore
[email protected]valid96
[email protected]risky54
[email protected]invalid12

Workflow

A real example.

Trigger → find sitemap urls (utility) → enrich → push to your CRM. ~80 ms operator effort, the rest runs in the background.

Step 1TriggerNew row, webhook, or schedule.
Step 2Find Sitemap URLs (Utility)Find action runs.
Step 3EnrichWaterfall fills missing fields.
Step 4Push to CRMHubSpot / Salesforce / Pipedrive.

Built for

Who runs this.

GTM EngineerRevOpsAgency

Reliability

Rate limits & reliability.

  • Per-minute limit300 / min
  • Per-day limit10,000 / day
  • RetriesAutomatic w/ backoff
  • ModeSync

Errors

HTTP status codes.

What each response means and what to do about it.

CodeCauseFix
200 OKAction ran. Data in `data`.Read response.
400 Bad RequestMissing or malformed input.Validate against the input schema.
401 UnauthorizedMissing or invalid `x-api-key`.Re-issue from /api-platform.
403 ForbiddenWorkspace lacks plan tier.Upgrade or contact sales.
404 Not FoundAction key not recognized.Verify the slug.
429 Rate LimitedPer-minute or per-day cap hit.Backoff; reduce concurrency.
500 Server ErrorUnexpected TexAu issue.Retry with backoff.
502 Bad GatewayUpstream provider 5xx.Retry; we surface root cause.
504 TimeoutUpstream slower than maxLatency.Switch to `isAsync` polling.

Pricing

What it costs to run.

Light tier

Pricing tier on /pricing. Per-action credit cost is private.

FAQ.

  • Is this real-time?

    Yes. Synchronous actions return in ~1–4 s. Long-running work uses async polling (see status 504 → switch to async).

  • Do I get charged on failure?

    No. Verified failures cost zero credits. Provider miss / 5xx / timeout cascade to the next provider in the waterfall when applicable.

  • Does it work with Claude / Cursor via MCP?

    Yes. Add the texau MCP server to your client config, then call `texau__find-...` directly.

  • What CRMs can I push results to?

    HubSpot, Salesforce, Pipedrive, Zoho, and GoHighLevel are bidirectional. Smartlead, Instantly, Lemlist, HeyReach, Apollo Sequences, and Reply.io for outbound.

Run Find Sitemap URLs (Utility) in 60 seconds.

Pull your API key, paste the cURL, ship to your CRM.