This n8n workflow automates the handling of various media messages received via WhatsApp, including audio, video, and images, and generates intelligent responses using AI models. When a message arrives through the WhatsApp trigger, it is split into parts, then processed based on its type. Audio messages are transcribed using Google Gemini, videos are described, and images are analyzed to extract content. Text messages are summarized for easier understanding. An AI agent consolidates the information, providing relevant, factual responses. This workflow is practical for building AI-powered WhatsApp chatbots capable of handling diverse media inputs and delivering context-aware replies, making it ideal for customer support, information retrieval, or interactive assistance.
WhatsApp Media Message Processing and AI Response Workflow
Node Count | >20 Nodes |
---|---|
Nodes Used | @n8n/n8n-nodes-langchain.agent, @n8n/n8n-nodes-langchain.chainLlm, @n8n/n8n-nodes-langchain.lmChatGoogleGemini, @n8n/n8n-nodes-langchain.memoryBufferWindow, @n8n/n8n-nodes-langchain.toolWikipedia, httpRequest, set, splitOut, stickyNote, switch, wait, whatsApp, whatsAppTrigger |
Reviews
There are no reviews yet.