This n8n workflow automates the process of receiving an image via Telegram, extracting text using OCR, refining the extracted text with an AI agent, and replying back in Telegram. It begins with a Telegram trigger that activates when a user sends a photo. The workflow retrieves the image file, applies OCR with Tesseract to extract text, cleans and escapes special characters in the text, then uses a language model (Google Gemini) to enhance clarity and correctness. Finally, the refined text is sent back to the user on Telegram. This workflow is ideal for automating information extraction from images shared via Telegram, making it useful in scenarios like automated data entry, document processing, or customer support where images need to be converted to readable text efficiently.

Node Count	6 – 10 Nodes
Nodes Used	@n8n/n8n-nodes-langchain.agent, @n8n/n8n-nodes-langchain.lmChatGoogleGemini, code, n8n-nodes-tesseractjs.tesseractNode, set, telegram, telegramTrigger