Name: Automated Document Parsing and Content Structuring Workflow - Zombie Bunny - n8n Workflows
Availability: InStock

This n8n workflow automates the extraction, analysis, and structuring of content from PDF documents. It begins with either manual initiation or external trigger to fetch a PDF via URL or Google Drive, then encodes and sends the document to Chunkr.ai for parsing. The workflow employs AI models to generate a nested table of contents based on detected section headers and document hierarchy. It extracts section content—text, HTML, and Markdown—and compiles a comprehensive, well-structured output. Final steps include dynamically creating a complete HTML document or Markdown file, which can be used for publishing or further processing. This workflow is ideal for automating document management, content indexing, and digital publication pipelines.

Node Count	>20 Nodes
Nodes Used	@n8n/n8n-nodes-langchain.agent, @n8n/n8n-nodes-langchain.lmChatGoogleGemini, @n8n/n8n-nodes-langchain.outputParserAutofixing, @n8n/n8n-nodes-langchain.outputParserStructured, code, convertToFile, executeWorkflowTrigger, extractFromFile, googleDrive, html, httpRequest, manualTrigger, merge, moveBinaryData, set, stickyNote, stopAndError, switch, wait