node-red-contrib-tesseract 1.1.4

Tesseract.js is a pure Javascript port of the popular Tesseract OCR engine and performs offline text recognition.

npm install node-red-contrib-tesseract

Tesseract.js is a pure Javascript port of the popular Tesseract OCR engine. It performs all OCR tasks locally without requiring a connection to any external service.

Tesseract was originally developed at Hewlett-Packard Laboratories Bristol and at Hewlett-Packard Co, Greeley Colorado between 1985 and 1994, with some more changes made in 1996 to port to Windows, and some C++izing in 1998. In 2005 Tesseract was open sourced by HP. Since 2006 it is developed by Google.

Tesseract flow

This Node-RED implementation of Tesseract.js has been provided by Sjoerd van der Hoorn.

Settings

Input

  • msg.payload - Local filename, URL, or image buffer.

Output

  • msg.payload - String with recognized text.
  • msg.tesseract - Object with recognized text split out per line and word, plus confidence information.
{
    text: "Text from image\nSecond line",
    confidence: 87,
    lines: 
    [
        {
            text: "Text from image",
            confidence: 93,
            words:
            [
                {
                    text: "Text",
                    confidence: 97
                },
                {
                    ...
                }
            ]
        },
        {
            ...
        }
    ]
}

Additional information

Node Info

Version: 1.1.4
Updated 9 months, 2 weeks ago
License: ISC
Rating: not yet rated

Downloads

1 in the last day
9 in the last week
57 in the last month

Nodes

  • tesseract

Keywords

  • node-red
  • ocr
  • tesseract
  • text recognition

Maintainers