node-red-contrib-tesseract 1.1.4

Tesseract.js is a pure Javascript port of the popular Tesseract OCR engine and performs offline text recognition.

npm install node-red-contrib-tesseract

Tesseract

Tesseract.js is a pure Javascript port of the popular Tesseract OCR engine. It performs all OCR tasks locally without requiring a connection to any external service.

Tesseract was originally developed at Hewlett-Packard Laboratories Bristol and at Hewlett-Packard Co, Greeley Colorado between 1985 and 1994, with some more changes made in 1996 to port to Windows, and some C++izing in 1998. In 2005 Tesseract was open sourced by HP. Since 2006 it is developed by Google.

Tesseract flow

This Node-RED implementation of Tesseract.js has been provided by Sjoerd van der Hoorn.

Settings

Input

  • msg.payload - Local filename, URL, or image buffer.

Output

  • msg.payload - String with recognized text.
  • msg.tesseract - Object with recognized text split out per line and word, plus confidence information.
{
	text: "Text from image\nSecond line",
	confidence: 87,
	lines: 
	[
		{
			text: "Text from image",
			confidence: 93,
			words:
			[
				{
					text: "Text",
					confidence: 97
				},
				{
					...
				}
			]
		},
		{
			...
		}
	]
}

Additional information

Node Info

Version: 1.1.4
Updated 5 years, 8 months ago
License: ISC
Rating: 3.3 4

Categories

Actions

Rate:

Downloads

34 in the last week

Nodes

  • tesseract

Keywords

  • node-red
  • ocr
  • tesseract
  • text recognition

Maintainers