This n8n workflow automates the process of fetching, parsing, and filtering sitemap XML files. It is designed to extract URLs, convert them into JSON, and filter out specific file types, such as PDFs. The workflow starts with a manual trigger, then sets a sitemap URL (default is DuckDuckGo’s sitemap). It fetches the sitemap via an HTTP request, converts the XML data into JSON format, and finally filters URLs based on criteria (e.g., URLs ending with ‘.pdf’). This setup is particularly useful for web developers and SEO specialists who need to monitor or analyze specific links within a sitemap, automate content audits, or gather specific resources like PDF files for further processing.
Automate Sitemap URL Processing and PDF URL Filtering
Node Count | 6 – 10 Nodes |
---|---|
Nodes Used | filter, httpRequest, manualTrigger, set, splitOut, stickyNote, xml |
Reviews
There are no reviews yet.