Apify logo

Scrape Website to Markdown (Apify)

ApifyScrape Website to Markdown (Apify)

POST/v1/apify-website-to-markdown
texau__apify-website-to-markdownenrichment
cURL · api.texau.app
84ms
POST/v1/apify-website-to-markdown
{
  "url": "https://acme.com"
}
200 Accepted
{
  "ok": true,
  "data": {
    "markdown": "sample",
    "title": "sample",
    "metadata_description": "sample",
    "metadata_language": "sample",
    "metadata_keyword": "sample",
    "metadata_robot": "sample",
    "metadata_og_title": "sample",
    "metadata_og_description": "sample",
    "metadata_og_url": "https://acme.com",
    "metadata_og_image": "sample",
    "metadata_source_url": "https://acme.com",
    "metadata_canonical": "sample"
  }
}

Scrapes website content and converts it to structured markdown format using Apify.

Install

Add scrape website to markdown (apify) to your MCP client.

Drop this into claude_desktop_config.json (or your client's equivalent) and the tool shows up in any chat.

claude_desktop_config.json
{
  "mcpServers": {
    "texau": {
      "command": "npx",
      "args": ["-y", "@texau/mcp-server"],
      "env": { "TEXAU_API_KEY": "..." }
    }
  }
}

Tool name: texau__apify-website-to-markdown

When to use this.

The "Scrape Website to Markdown (Apify)" action efficiently extracts content from a specified website and converts it into a structured markdown format, facilitating content enrichment and management. By providing the target website URL as an input parameter, users can seamlessly retrieve essential information. The output includes various fields such as markdown content, title, and comprehensive metadata, including descriptions, keywords, and Open Graph properties, all categorized under enrichment. This action is ideal for developers and content creators looking to repurpose web content for documentation, blogging, or data analysis, ensuring that the extracted information is both accessible and well-structured for further use.

Try it

Run a sample request.

The response is a deterministic, cached example. No live call, no credits used.

Scrape Website to Markdown (Apify)

Try it

Target website URL whose content will be converted to markdown.

Response is cached — no live API call.

Response

Output schema.

Every field returned in `data`. Click rows to expand nested objects.

FieldType
  • markdown
    Markdown
    text
  • title
    Title
    text
  • metadata_description
    Metadata Description
    text
  • metadata_language
    Metadata Language
    nullabletext
  • metadata_keyword
    Metadata Keyword
    nullabletext
  • metadata_robot
    Metadata Robot
    nullabletext
  • metadata_og_title
    Metadata Og Title
    nullabletext
  • metadata_og_description
    Metadata Og Description
    nullabletext
  • metadata_og_url
    Metadata Og Url
    nullabletext
  • metadata_og_image
    Metadata Og Image
    nullabletext
  • metadata_source_url
    Metadata Source Url
    text
  • metadata_canonical
    Metadata Canonical
    nullabletext

Integrate

Copy-pasteable snippets.

Real endpoint: https://v3-api.texau.com/api/v1/apify-website-to-markdown. Auth: x-api-key.

cURL · api.texau.app
84ms
POST/v1/apify-website-to-markdown
curl -X POST 'https://v3-api.texau.com/api/v1/apify-website-to-markdown' \
  -H 'x-api-key: $TEXAU_API_KEY' \
  -H 'content-type: application/json' \
  -d '{"url":"https://acme.com"}'
200 Accepted
{
  "ok": true,
  "data": {
    "markdown": "sample",
    "title": "sample",
    "metadata_description": "sample",
    "metadata_language": "sample",
    "metadata_keyword": "sample",
    "metadata_robot": "sample",
    "metadata_og_title": "sample",
    "metadata_og_description": "sample",
    "metadata_og_url": "https://acme.com",
    "metadata_og_image": "sample",
    "metadata_source_url": "https://acme.com",
    "metadata_canonical": "sample"
  }
}

Output

Results land in a TexAu table.

Sample rows below.

Real result preview coming soon.

InputStatusScore
[email protected]valid96
[email protected]risky54
[email protected]invalid12

Workflow

A real example.

Trigger → scrape website to markdown (apify) → enrich → push to your CRM. ~80 ms operator effort, the rest runs in the background.

Step 1TriggerNew row, webhook, or schedule.
Step 2Scrape Website to Markdown (Apify)Apify action runs.
Step 3EnrichWaterfall fills missing fields.
Step 4Push to CRMHubSpot / Salesforce / Pipedrive.

Built for

Who runs this.

GTM EngineerRevOpsAgency

Reliability

Rate limits & reliability.

  • Per-minute limit60 / min
  • Per-day limit10,000 / day
  • RetriesAutomatic w/ backoff
  • ModeSync

Errors

HTTP status codes.

What each response means and what to do about it.

CodeCauseFix
200 OKAction ran. Data in `data`.Read response.
400 Bad RequestMissing or malformed input.Validate against the input schema.
401 UnauthorizedMissing or invalid `x-api-key`.Re-issue from /api-platform.
403 ForbiddenWorkspace lacks plan tier.Upgrade or contact sales.
404 Not FoundAction key not recognized.Verify the slug.
429 Rate LimitedPer-minute or per-day cap hit.Backoff; reduce concurrency.
500 Server ErrorUnexpected TexAu issue.Retry with backoff.
502 Bad GatewayUpstream provider 5xx.Retry; we surface root cause.
504 TimeoutUpstream slower than maxLatency.Switch to `isAsync` polling.

Pricing

What it costs to run.

Standard tier

Pricing tier on /pricing. Per-action credit cost is private.

FAQ.

  • Is this real-time?

    Yes. Synchronous actions return in ~1–4 s. Long-running work uses async polling (see status 504 → switch to async).

  • Do I get charged on failure?

    No. Verified failures cost zero credits. Provider miss / 5xx / timeout cascade to the next provider in the waterfall when applicable.

  • Does it work with Claude / Cursor via MCP?

    Yes. Add the texau MCP server to your client config, then call `texau__apify-...` directly.

  • What CRMs can I push results to?

    HubSpot, Salesforce, Pipedrive, Zoho, and GoHighLevel are bidirectional. Smartlead, Instantly, Lemlist, HeyReach, Apollo Sequences, and Reply.io for outbound.

Run Scrape Website to Markdown (Apify) in 60 seconds.

Pull your API key, paste the cURL, ship to your CRM.