Webpage to Markdown & Link Extraction Workflow

somdn_product_page

This n8n workflow automates the process of retrieving web pages, converting their content into markdown, and extracting links for storage or analysis. It is designed to handle batch processing efficiently while respecting API rate limits. The workflow begins with a manual trigger, allowing users to initiate the process on demand. It then retrieves URLs from a data source, splits them into batches of 10 for API rate limit compliance, and processes each URL through the Firecrawl.dev API. The API converts page HTML to markdown and extracts links, providing clean, structured content suitable for AI analysis or content management. Users can customize the source of URLs and the output data destination, making it ideal for content scraping, SEO auditing, or data enrichment scenarios.

Node Count

11 – 20 Nodes

Nodes Used

httpRequest, limit, manualTrigger, noOp, set, splitInBatches, splitOut, stickyNote, wait

Reviews

There are no reviews yet.

Be the first to review “Webpage to Markdown & Link Extraction Workflow”

Your email address will not be published. Required fields are marked *