Automated Video Narration and Voiceover Generation Workflow

somdn_product_page

This n8n workflow automates the process of extracting frames from a video, generating a narration script, and creating a voiceover, then storing the final audio on Google Drive. It begins by downloading a video from an online source, using Python code with OpenCV to evenly extract up to 90 frames for analysis. These frames are batched and resized to optimize processing capacity.

The workflow sends batches of frames to a multimodal Large Language Model (LLM) integrated with OpenAI, which generates a cohesive narration script based on visual content. Subsequently, the full script is processed to produce a natural-sounding voiceover using OpenAI’s text-to-speech capabilities. The final audio file is uploaded to Google Drive for easy access.

This workflow is ideal for creators, educators, or marketers looking to generate engaging video narrations or voiceovers automatically from visual content, streamlining content creation and enhancing multimedia projects.

Node Count

>20 Nodes

Nodes Used

@n8n/n8n-nodes-langchain.chainLlm, @n8n/n8n-nodes-langchain.lmChatOpenAi, @n8n/n8n-nodes-langchain.openAi, aggregate, code, convertToFile, editImage, googleDrive, httpRequest, manualTrigger, splitInBatches, splitOut, stickyNote, wait

Reviews

There are no reviews yet.

Be the first to review “Automated Video Narration and Voiceover Generation Workflow”

Your email address will not be published. Required fields are marked *