Automated Sitemap Webpage Scraper to Google Drive

somdn_product_page

This n8n workflow automates the process of scraping multiple web pages listed in a website’s sitemap, extracting meaningful content, and saving detailed reports to Google Drive. Starting with a manual trigger, it fetches the sitemap XML and parses all URLs. For each URL, it performs an HTTP request to scrape the webpage, extracts the page title, content, and technical details such as framework detection (WordPress, Divi) and JavaScript presence. The extracted data is formatted into a Markdown report, including content summaries, and then saved as a file in Google Drive. This workflow is ideal for content audits, SEO analysis, or monitoring website updates efficiently and with minimal manual effort.

Node Count

11 – 20 Nodes

Nodes Used

code, googleDrive, httpRequest, limit, manualTrigger, set, splitInBatches, splitOut, stickyNote, wait, xml

Reviews

There are no reviews yet.

Be the first to review “Automated Sitemap Webpage Scraper to Google Drive”

Your email address will not be published. Required fields are marked *