Name: Automated Entity Extraction from Web Pages using Google API - Zombie Bunny - n8n Workflows
Availability: InStock

This n8n workflow automates the process of extracting named entities from web pages through Google’s Natural Language API. It is designed to facilitate quick retrieval and analysis of relevant information from URLs submitted via a webhook. The workflow begins with a webhook trigger that receives a URL, followed by fetching the webpage content using an HTTP request. The HTML content is then prepared and sent to Google’s API to analyze and identify entities such as people, locations, and organizations. Finally, the results are returned to the requester, providing structured insights for research, content enrichment, or data analysis.

### Workflow Steps:

1. **Webhook Trigger:** The workflow is triggered when a POST request is received at the webhook endpoint with a JSON body containing the URL to analyze.

2. **Fetch Web Page:** The URL from the payload is used to make an HTTP request fetching the page content.

3. **Prepare Content:** The retrieved HTML content is processed through a code node to trim or prepare the data before API submission.

4. **Entity Analysis:** The cleaned HTML content is sent to Google’s Natural Language API using an HTTP request to analyze entities within the page.

5. **Respond to Request:** The API response, containing identified entities with details, is returned to the user through the webhook response.

### Practical Use Cases:

– Automating the collection of structured data from competitor websites.

– Content analysis for SEO and keyword extraction.

– Enhancing research databases with entity metadata.

– Monitoring web content for specific entities or topics.

This workflow is especially valuable for developers and data analysts looking to automate the extraction of meaningful content from web sources quickly and efficiently.