Automated News Scraping and Storage Workflow

This n8n workflow automates the process of collecting news articles from a specified website, processing the data with AI, and storing it in Google Sheets for analysis. It is designed for users who want to regularly gather news content without manual intervention.

The workflow begins with a schedule trigger, which activates the process at defined intervals, such as daily or hourly. The trigger leads to the AI-powered scraper node, which uses the ScrapeGraphAI service to extract news articles from a given website based on natural language prompts. The scraper analyzes the site (e.g., BBC) and retrieves structured information such as article titles, URLs, and categories.

Next, the data is processed in a coding node that formats and filters the extracted articles, ensuring consistent structure and preparing the data for storage. This step maps each article into a predefined format with only the necessary fields.

Finally, the workflow connects to Google Sheets via OAuth2 and appends each set of articles, creating a continuous flow of news data. This enables users to maintain an up-to-date spreadsheet of articles for monitoring trends or conducting further analysis.

This automation is ideal for media monitoring, research, or content aggregation scenarios where timely collection and organized storage of news content are required.