Automated Thai Document OCR and Data Extraction to Google Sheets

somdn_product_page

This n8n workflow automates the process of extracting structured information from Thai PDF documents and storing it in Google Sheets. It begins with a manual trigger for testing, then loads PDF files from a specific folder. The PDFs are processed through Typhoon OCR to extract text, which is then sent to a language model to structure the data into JSON format. A custom script parses the JSON, extracting fields such as document ID, date, subject, sender, attachments, details, signatories, contacts, and download links. Finally, the structured data is appended to a designated Google Sheet for record-keeping or further analysis. This workflow is ideal for organizations that handle a large volume of official Thai documents, streamlining data entry and reducing manual effort.

Node Count

6 – 10 Nodes

Nodes Used

@n8n/n8n-nodes-langchain.chainLlm, @n8n/n8n-nodes-langchain.lmChatOpenRouter, code, executeCommand, googleSheets, manualTrigger, readWriteFile, stickyNote

Reviews

There are no reviews yet.

Be the first to review “Automated Thai Document OCR and Data Extraction to Google Sheets”

Your email address will not be published. Required fields are marked *