Automated Video Narration and Voiceover Generation Workflow

somdn_product_page

This n8n workflow automates the process of creating a narrated voiceover for a video by extracting frames, generating a script, and converting it into audio. It begins with downloading a video from an online source, then uses Python with OpenCV to extract a set number of evenly distributed frames, ensuring efficient processing. These frames are batched into smaller groups, resized, and then sent to a multimodal language model (via OpenAI) to generate a descriptive narration script in chunks, which are combined into a full script. Finally, the script is converted into a voiceover audio clip using OpenAI’s Text-to-Speech capabilities, and the resulting audio is uploaded to Google Drive. This workflow is ideal for content creators, educators, or marketers wanting to automatically produce narrated videos from existing footage, saving time and effort in manual editing and voiceover creation.

Node Count

>20 Nodes

Nodes Used

@n8n/n8n-nodes-langchain.chainLlm, @n8n/n8n-nodes-langchain.lmChatOpenAi, @n8n/n8n-nodes-langchain.openAi, aggregate, code, convertToFile, editImage, googleDrive, httpRequest, manualTrigger, splitInBatches, splitOut, stickyNote, wait

Reviews

There are no reviews yet.

Be the first to review “Automated Video Narration and Voiceover Generation Workflow”

Your email address will not be published. Required fields are marked *