Scrape Website to Markdown (Apify)
Apify → Scrape Website to Markdown (Apify)
/v1/apify-website-to-markdown{ "url": "https://acme.com" }
{ "ok": true, "data": { "markdown": "sample", "title": "sample", "metadata_description": "sample", "metadata_language": "sample", "metadata_keyword": "sample", "metadata_robot": "sample", "metadata_og_title": "sample", "metadata_og_description": "sample", "metadata_og_url": "https://acme.com", "metadata_og_image": "sample", "metadata_source_url": "https://acme.com", "metadata_canonical": "sample" } }
Scrapes website content and converts it to structured markdown format using Apify.
Install
Add scrape website to markdown (apify) to your MCP client.
Drop this into claude_desktop_config.json (or your client's equivalent) and the tool shows up in any chat.
{ "mcpServers": { "texau": { "command": "npx", "args": ["-y", "@texau/mcp-server"], "env": { "TEXAU_API_KEY": "..." } } } }
Tool name: texau__apify-website-to-markdown
When to use this.
The "Scrape Website to Markdown (Apify)" action efficiently extracts content from a specified website and converts it into a structured markdown format, facilitating content enrichment and management. By providing the target website URL as an input parameter, users can seamlessly retrieve essential information. The output includes various fields such as markdown content, title, and comprehensive metadata, including descriptions, keywords, and Open Graph properties, all categorized under enrichment. This action is ideal for developers and content creators looking to repurpose web content for documentation, blogging, or data analysis, ensuring that the extracted information is both accessible and well-structured for further use.
Try it
Run a sample request.
The response is a deterministic, cached example. No live call, no credits used.
Scrape Website to Markdown (Apify)
Response
Output schema.
Every field returned in `data`. Click rows to expand nested objects.
markdownMarkdowntexttitleTitletextmetadata_descriptionMetadata Descriptiontextmetadata_languageMetadata Languagenullabletextmetadata_keywordMetadata Keywordnullabletextmetadata_robotMetadata Robotnullabletextmetadata_og_titleMetadata Og Titlenullabletextmetadata_og_descriptionMetadata Og Descriptionnullabletextmetadata_og_urlMetadata Og Urlnullabletextmetadata_og_imageMetadata Og Imagenullabletextmetadata_source_urlMetadata Source Urltextmetadata_canonicalMetadata Canonicalnullabletext
Integrate
Copy-pasteable snippets.
Real endpoint: https://v3-api.texau.com/api/v1/apify-website-to-markdown. Auth: x-api-key.
/v1/apify-website-to-markdowncurl -X POST 'https://v3-api.texau.com/api/v1/apify-website-to-markdown' \ -H 'x-api-key: $TEXAU_API_KEY' \ -H 'content-type: application/json' \ -d '{"url":"https://acme.com"}'
{ "ok": true, "data": { "markdown": "sample", "title": "sample", "metadata_description": "sample", "metadata_language": "sample", "metadata_keyword": "sample", "metadata_robot": "sample", "metadata_og_title": "sample", "metadata_og_description": "sample", "metadata_og_url": "https://acme.com", "metadata_og_image": "sample", "metadata_source_url": "https://acme.com", "metadata_canonical": "sample" } }
Compose
How this fits a workflow.
The next 2 actions most operators chain after this one.
enrichment
Search LinkedIn Sales Navigator People (Apify)
Scrape LinkedIn Sales Navigator search results via Apify. Returns a list of profiles including Name, Title, Company, and Location.
enrichment
Search LinkedIn Profiles (Apify)
Search for LinkedIn profiles using filters (Current Company, Job Title, Location, Past Company).
enrichment
Get LinkedIn Posts from Profile (Apify)
Retrieves all LinkedIn posts from a specified user profile using Apify.
Output
Results land in a TexAu table.
Sample rows below.
Real result preview coming soon.
| Input | Status | Score |
|---|---|---|
| [email protected] | valid | 96 |
| [email protected] | risky | 54 |
| [email protected] | invalid | 12 |
Workflow
A real example.
Trigger → scrape website to markdown (apify) → enrich → push to your CRM. ~80 ms operator effort, the rest runs in the background.
Built for
Who runs this.
Reliability
Rate limits & reliability.
- Per-minute limit60 / min
- Per-day limit10,000 / day
- RetriesAutomatic w/ backoff
- ModeSync
Errors
HTTP status codes.
What each response means and what to do about it.
| Code | Cause | Fix |
|---|---|---|
| 200 OK | Action ran. Data in `data`. | Read response. |
| 400 Bad Request | Missing or malformed input. | Validate against the input schema. |
| 401 Unauthorized | Missing or invalid `x-api-key`. | Re-issue from /api-platform. |
| 403 Forbidden | Workspace lacks plan tier. | Upgrade or contact sales. |
| 404 Not Found | Action key not recognized. | Verify the slug. |
| 429 Rate Limited | Per-minute or per-day cap hit. | Backoff; reduce concurrency. |
| 500 Server Error | Unexpected TexAu issue. | Retry with backoff. |
| 502 Bad Gateway | Upstream provider 5xx. | Retry; we surface root cause. |
| 504 Timeout | Upstream slower than maxLatency. | Switch to `isAsync` polling. |
Pricing
What it costs to run.
Pricing tier on /pricing. Per-action credit cost is private.
Related
More Apify actions.
enrichment
Search LinkedIn Sales Navigator People (Apify)
Scrape LinkedIn Sales Navigator search results via Apify. Returns a list of profiles including Name, Title, Company, and Location.
enrichment
Search LinkedIn Profiles (Apify)
Search for LinkedIn profiles using filters (Current Company, Job Title, Location, Past Company).
enrichment
Get LinkedIn Posts from Profile (Apify)
Retrieves all LinkedIn posts from a specified user profile using Apify.
enrichment
Search LinkedIn Companies (Apify)
Retrieve enriched LinkedIn company search results returns detailed company profiles including name, industries, employee counts, locations, LinkedIn URL, websit
FAQ.
Is this real-time?
Yes. Synchronous actions return in ~1–4 s. Long-running work uses async polling (see status 504 → switch to async).
Do I get charged on failure?
No. Verified failures cost zero credits. Provider miss / 5xx / timeout cascade to the next provider in the waterfall when applicable.
Does it work with Claude / Cursor via MCP?
Yes. Add the texau MCP server to your client config, then call `texau__apify-...` directly.
What CRMs can I push results to?
HubSpot, Salesforce, Pipedrive, Zoho, and GoHighLevel are bidirectional. Smartlead, Instantly, Lemlist, HeyReach, Apollo Sequences, and Reply.io for outbound.
Run Scrape Website to Markdown (Apify) in 60 seconds.
Pull your API key, paste the cURL, ship to your CRM.