Automated Image-to-Text Extraction and Response

somdn_product_page

This n8n workflow automates the process of receiving an image via Telegram, extracting text using OCR, refining the extracted text with an AI agent, and replying back in Telegram. It begins with a Telegram trigger that activates when a user sends a photo. The workflow retrieves the image file, applies OCR with Tesseract to extract text, cleans and escapes special characters in the text, then uses a language model (Google Gemini) to enhance clarity and correctness. Finally, the refined text is sent back to the user on Telegram. This workflow is ideal for automating information extraction from images shared via Telegram, making it useful in scenarios like automated data entry, document processing, or customer support where images need to be converted to readable text efficiently.

Node Count

6 – 10 Nodes

Nodes Used

@n8n/n8n-nodes-langchain.agent, @n8n/n8n-nodes-langchain.lmChatGoogleGemini, code, n8n-nodes-tesseractjs.tesseractNode, set, telegram, telegramTrigger

Reviews

There are no reviews yet.

Be the first to review “Automated Image-to-Text Extraction and Response”

Your email address will not be published. Required fields are marked *