This n8n workflow automates the process of scraping multiple web pages listed in a website’s sitemap, extracting meaningful content, and saving detailed reports to Google Drive. Starting with a manual trigger, it fetches the sitemap XML and parses all URLs. For each URL, it performs an HTTP request to scrape the webpage, extracts the page title, content, and technical details such as framework detection (WordPress, Divi) and JavaScript presence. The extracted data is formatted into a Markdown report, including content summaries, and then saved as a file in Google Drive. This workflow is ideal for content audits, SEO analysis, or monitoring website updates efficiently and with minimal manual effort.
Automated Sitemap Webpage Scraper to Google Drive
Node Count | 11 – 20 Nodes |
---|---|
Nodes Used | code, googleDrive, httpRequest, limit, manualTrigger, set, splitInBatches, splitOut, stickyNote, wait, xml |
Reviews
There are no reviews yet.