Automated Company Data Scraping, Summarization, and AI Processing Workflow

somdn_product_page

This n8n workflow automates the process of web scraping company data from Indeed using Bright Data’s web unlocking tools, then summarizes and formats this data with AI models before pushing results to webhooks and Airtable. Starting with a manual trigger, it fetches company URLs from Airtable, scrapes detailed company info via Bright Data, and checks if links are valid. Valid links trigger data retrieval, which is then processed by AI models like Google Gemini for summarization and data analysis. The workflow converts markdown responses into HTML for display or further use. Ideal for market research, lead generation, or competitive analysis, this workflow streamlines large-scale data collection and insight generation with minimal manual effort.

Node Count

11 – 20 Nodes

Nodes Used

@n8n/n8n-nodes-langchain.agent, @n8n/n8n-nodes-langchain.chainLlm, @n8n/n8n-nodes-langchain.chainSummarization, @n8n/n8n-nodes-langchain.lmChatGoogleGemini, @n8n/n8n-nodes-langchain.toolHttpRequest, airtable, httpRequest, if, manualTrigger, markdown, set, splitInBatches, stickyNote, wait

Reviews

There are no reviews yet.

Be the first to review “Automated Company Data Scraping, Summarization, and AI Processing Workflow”

Your email address will not be published. Required fields are marked *