Automate Sitemap URL Processing and PDF URL Filtering

somdn_product_page

This n8n workflow automates the process of fetching, parsing, and filtering sitemap XML files. It is designed to extract URLs, convert them into JSON, and filter out specific file types, such as PDFs. The workflow starts with a manual trigger, then sets a sitemap URL (default is DuckDuckGo’s sitemap). It fetches the sitemap via an HTTP request, converts the XML data into JSON format, and finally filters URLs based on criteria (e.g., URLs ending with ‘.pdf’). This setup is particularly useful for web developers and SEO specialists who need to monitor or analyze specific links within a sitemap, automate content audits, or gather specific resources like PDF files for further processing.

Node Count

6 – 10 Nodes

Nodes Used

filter, httpRequest, manualTrigger, set, splitOut, stickyNote, xml

Reviews

There are no reviews yet.

Be the first to review “Automate Sitemap URL Processing and PDF URL Filtering”

Your email address will not be published. Required fields are marked *