EOCR2 Class

Manages a complete context for the font-dependent printed character reader implemented in EasyOCR2.

Namespace: Euresys::Open_eVision

License(s): EasyOCR2

Constructors

EOCR2

Constructs an EOCR2 context.

Properties

GetReadText

Outputs an EOCR2Text structure containing the detailed detection and recognition results.

GetAllowedCharacterTypes SetAllowedCharacterTypes

Sets which character types are expected when EOCR2::EnabledTopology is false. The set of expected character types is represented by a bitwise combination of different EasyOCR2CharacterFilter.

GetCharacterDatabase SetCharacterDatabase

Sets the EOCR2CharacterDatabase used for recognizing text.

GetCharsHeight SetCharsHeight

Sets the expected character height in pixels.

GetCharsMaxFragmentation SetCharsMaxFragmentation

Sets the CharsMaxFragmentation parameter for the segmentation algorithm. This will determine the minimum size a blob Synonym of object. should be in order to be considered a potential character. A high setting will allow only larger blobs, a low setting will also allow smaller blobs.

The minimum blob size to be considered a potential character is defined as: CharsMaxFragmentation * CharsHeight * CharsWidth

GetCharsSpacingBias SetCharsSpacingBias

Sets the CharSpacingBias parameter for the topology fitting algorithm, which optimizes the spacing of the bounding boxes to optimally fit the detected blobs. This will determine whether the method is biased toward finding narrow spacing, wide spacing or is neutral.

GetCharsWidth SetCharsWidth

This property is deprecated: Use EOCR2::CharsWidthRange instead.

GetCharsWidthBias SetCharsWidthBias

Sets the CharsWidthBias parameter for the topology fitting algorithm, which optimizes the width of the bounding boxes to optimally fit the detected blobs. This will determine whether the method is biased toward finding narrow boxes, wide boxes or is neutral.

GetCharsWidthRange SetCharsWidthRange

Sets the range of expected character widths in pixels.

GetCharsWidthTolerance SetCharsWidthTolerance

This property is deprecated: Use EOCR2::CharsWidthRange instead.

GetClassifier SetClassifier

Sets the EOCR2Classifier parameter for the recognition algorithm. This will determine which classifier will be used for the recognition.

GetDetectionDelta SetDetectionDelta

Sets the DetectionDelta parameter for the segmentation algorithm. This will determine the range of grayscale-values used to determine the stability of a blob. A low setting will make the algorithm more sensitive to noise, a high setting will make the algorithm insensitive to blobs with low contrast to the background.

GetDetectionMethod SetDetectionMethod

Sets the EOCR2DetectionMethod parameter for the topology fitting algorithm, which will place text boxes on the segmentation results, matching the given topology.

GetEnableCutLargeCharacter SetEnableCutLargeCharacter

Sets whether EOCR2 detection should split or not segmented blobs into multiple characters if they are wider than the given character width range parameter. The default setting for this parameter is false and it is only applicable when the topology is not required or when the detectionMethod is set to EOCR2DetectionMethod.Proportional.

GetEnabledTopology SetEnabledTopology

Sets whether the topology is required or not. If it is, EOCR2 will try to detect the topology accordingly with the EOCR2DetectionMethod parameter. The default setting for this parameter is true and topology has to be given with EOCR2::Topology.

GetEnableGPU SetEnableGPU

Sets/Gets whether EOCR2 uses a GPU to accelerate its processing.

GetEnableOffSizeCharacter SetEnableOffSizeCharacter

Sets whether EOCR2 allows or not the detection of characters whose size (width and height) is out of the size parameters if they are in the vicinity of characters in valid size range. The default setting for this parameter is true and it is only applicable when the topology is not required.

GetEnableSecondPassGlobalSegmentation SetEnableSecondPassGlobalSegmentation

Sets whether EOCR2 segmentation should do or not do an extra pass to determine the best threshold using the EOCR2SegmentationMethod.Global segmentation method. The default setting for this parameter is false and it is only applicable when the segmentation method is set to EOCR2SegmentationMethod.Global.

GetGlobalSegmentationRelativeThreshold SetGlobalSegmentationRelativeThreshold

Sets/Gets the fraction of the image pixels that will be set below the threshold used when the segmentation method is set to EOCR2SegmentationMethod.Global. It is only used when the GlobalSegmentationThresholdMode value is EThresholdMode.Relative .

GetGlobalSegmentationThresholdMode SetGlobalSegmentationThresholdMode

Sets/Gets the EThresholdMode used when the segmentation method is set to EOCR2SegmentationMethod.Global. From the EThresholdMode, a threshold will be computed during the segmentation. While using EasyOCR2TextPolarity.WhiteOnBlack, pixels above or equal to the threshold will be segmented. While using EasyOCR2TextPolarity.BlackOnWhite, pixels under the threshold will be segmented.

GetGPUIndexes SetGPUIndexes

Sets/gets the GPUs to use when computing.

GetMaxVariation SetMaxVariation

Sets the maxVariation parameter for the segmentation algorithm. This parameter determines how stable a blob in the image should be in order to be considered a potential character, a region with clearly defined edges is generally considered stable while a blurry region is not. A high setting allows more unstable blobs, a low setting allows only very stable blobs.

GetNumDetectionPasses SetNumDetectionPasses

Sets the NumDetectionPasses parameter for the topology fitting algorithm, which will place text boxes on the segmentation results (blobs), matching the given topology. The first pass will consider all blobs, subsequent passes will only consider those blobs that are inside the text boxes from the previous pass, sometimes resulting in a more optimal fit.

GetRelativeSpacesWidthRange SetRelativeSpacesWidthRange

Sets the range of expected spaces between words as a fraction of the character width.

GetRepasteObjects SetRepasteObjects

Sets whether EOCR2 groups or does not group the blobs believed to belong to the same character. The default setting for this parameter is true and it is only applicable when the topology is not required.

GetSegmentationMethod SetSegmentationMethod

Sets the EOCR2SegmentationMethod parameter for the segmentation algorithm, which will detect blobs in the image.

GetTextAngleRange SetTextAngleRange

Sets the TextAngleRange parameter for the topology fitting algorithm, which will attempt to find the angle of the text in the image with respect to the horizontal.

This will determine the center of the range of angles that will be tested, defined as: TextAngleRange.min() <= angle <= TextAngleRange.max()

GetTextAngleTolerance SetTextAngleTolerance

This property is deprecated: Use EOCR2::TextAngleRange instead

GetTextBaseAngle SetTextBaseAngle

This property is deprecated: Use EOCR2::TextAngleRange instead

GetTextPolarity SetTextPolarity

Sets the TextPolarity parameter for the segmentation algorithm. This will determine whether the algorithm searches for light blobs in a dark background or for dark blobs in a light background.

GetTimeOut SetTimeOut

Time-out for the EOCR2::Read, EOCR2::Detect and EOCR2::Recognize methods.

GetTopology SetTopology

Sets the topology of the text that should be found in the image. A modified version of Regex expressions are used, where:

.(dot) represents any character (not including a space).
L represents a letter.
Lu represents an uppercase letter.
Ll represents a lowercase letter.
N represents a digit.
P represents a punctuation character !"#&'()*,-./:;<>?[\]_{|}~
S represents the symbols $;+-<=>|~
\n represents a line break.
' ' (space) represents a space between two words.

Combinations can be made, for example: [LN] represents an alpha-numeric character. To specify multiple characters, simply add {n} at the end for n characters. If the amount of characters is uncertain, specify {n,m} for a minimum of n characters and a maximum of m characters.

The topology "[LuN]{3,5}PN{4} \n .{5} LL" represents a text comprised of 2 lines:
The first line has 1 word composed of 3 to 5 uppercase alpha-numeric characters, followed by a punctuation character and 4 numbers.
The second line has 2 words. The first word comprises of 5 wildcard characters, the second word has 2 alphabetic characters (upper- or lowercase).

Methods

AddCharactersToDatabase

Reads reference characters from disk and adds them to the database used for text recognition. The characters can be read from a trueType (.ttf/.ttc) file or from an EasyOCR2 database file.

AddClassifierForSymbol

Adds the EOCR2Classifier for the given specific symbol combination during the recognition instead of the one set by EOCR2::Classifier.

ClearCharacterDatabase

Clears the reference character database from this EOCR2 instance.

ClearResult

Clear the current detected text if it exists.

Detect

Finds the text in an image as follows:

Detects potential characters in the image following the given text polarity.
Fits bounding boxes to the detected characters, following the given topology and character width/height.
Extracts the detected characters from the image.

The detected characters are output as an EOCR2Text structure.

DrawDetection

Draws the bounding boxes found by the topology detection algorithm.

DrawDetectionWithCurrentPen

This method is deprecated: Use the overload of EOCR2::DrawDetection taking an EDrawAdapter by using an instance of EWindowsDrawAdapter.

DrawRecognition

Draws the recognized text next to the character bounding box in the image.

DrawRecognitionWithCurrentPen

This method is deprecated: Use the overload of EOCR2::DrawRecognition taking an EDrawAdapter by using an instance of EWindowsDrawAdapter.

DrawSegmentation

Draws the blobs found by the segmentation algorithm.

DrawSegmentationWithCurrentPen

This method is deprecated: Use the overload of EOCR2::DrawSegmentation taking an EDrawAdapter by using an instance of EWindowsDrawAdapter.

GetClassifierForSymbol

Gets the EOCR2Classifier for the given specific symbol combination during the recognition instead of the one set by EOCR2::Classifier.

HitTestChar

Tests the cursor position for the presence of a character. If one is present under the cursor, it returns true and fills the EOCR2Char object In a general content, the term object should be understood with the meaning of a class instance. In EasyObject, an object is a maximally-sized area of adjacent connected pixels belonging to the layer foreground. passed as parameter.

HitTestLine

Tests the cursor position for the presence of a line. If one is present under the cursor, it returns true and fills the EOCR2Line object passed as parameter.

HitTestText

Tests the cursor position for the presence of a text. If one is present under the cursor, it returns true and fills the EOCR2Text object passed as parameter.

HitTestWord

Tests the cursor position for the presence of a word. If one is present under the cursor, it returns true and fills the EOCR2Word object passed as parameter.

Learn

Learns reference characters from a given EOCR2Text/EOCR2Line/EOCR2Word/EOCR2Char instance, containing user-specified character codes.

Load

Loads a model, containing parameter settings used for all operations in EOCR2::EOCR2, from disk.

operator=

Assignment operator, copies another EOCR2 instance to this one.

Read

Performs all steps required for reading text from an image:

Detects potential characters in the image following the given text polarity and character width/height.
Fits bounding boxes to the detected characters, following the given topology and character width/height.
Recognizes the detected characters using the given reference character database.

The read text is output as a string.

Recognize

Recognizes the characters in a given EOCR2Text instance, based on a given reference font.

RemoveClassifierForSymbol

Removes the EOCR2Classifier for the given specific symbol combination during the recognition so the one set by EOCR2::Classifier will be used.

Save

Saves the model to disk, containing the current parameter setting used for all operations in EOCR2.

SaveCharacterDatabase

Saves the current reference character database to disk.

EOCR2 Class

Manages a complete context for the font-dependent printed character reader implemented in EasyOCR2.

Namespace: Euresys.Open_eVision

License(s): EasyOCR2

Constructors

EOCR2

Constructs an EOCR2 context.

Properties

AllowedCharacterTypes

Sets which character types are expected when EOCR2.EnabledTopology is false. The set of expected character types is represented by a bitwise combination of different EasyOCR2CharacterFilter.

CharacterDatabase

Sets the EOCR2CharacterDatabase used for recognizing text.

CharsHeight

Sets the expected character height in pixels.

CharsMaxFragmentation

Sets the CharsMaxFragmentation parameter for the segmentation algorithm. This will determine the minimum size a blob should be in order to be considered a potential character. A high setting will allow only larger blobs, a low setting will also allow smaller blobs.

The minimum blob size to be considered a potential character is defined as: CharsMaxFragmentation * CharsHeight * CharsWidth

CharsSpacingBias

CharsWidth

This property is deprecated: Use EOCR2.CharsWidthRange instead.

CharsWidthBias

CharsWidthRange

Sets the range of expected character widths in pixels.

CharsWidthTolerance

This property is deprecated: Use EOCR2.CharsWidthRange instead.

Classifier

Sets the EOCR2Classifier parameter for the recognition algorithm. This will determine which classifier will be used for the recognition.

DetectionDelta

DetectionMethod

Sets the EOCR2DetectionMethod parameter for the topology fitting algorithm, which will place text boxes on the segmentation results, matching the given topology.

EnableCutLargeCharacter

EnabledTopology

EnableGPU

Sets/Gets whether EOCR2 uses a GPU to accelerate its processing.

EnableOffSizeCharacter

EnableSecondPassGlobalSegmentation

GlobalSegmentationRelativeThreshold

GlobalSegmentationThresholdMode

GPUIndexes

Sets/gets the GPUs to use when computing.

MaxVariation

NumDetectionPasses

ReadText

Outputs an EOCR2Text structure containing the detailed detection and recognition results.

RelativeSpacesWidthRange

Sets the range of expected spaces between words as a fraction of the character width.

RepasteObjects

SegmentationMethod

Sets the EOCR2SegmentationMethod parameter for the segmentation algorithm, which will detect blobs in the image.

TextAngleRange

Sets the TextAngleRange parameter for the topology fitting algorithm, which will attempt to find the angle of the text in the image with respect to the horizontal.

This will determine the center of the range of angles that will be tested, defined as: TextAngleRange.min() <= angle <= TextAngleRange.max()

TextAngleTolerance

This property is deprecated: Use EOCR2.TextAngleRange instead

TextBaseAngle

This property is deprecated: Use EOCR2.TextAngleRange instead

TextPolarity

Sets the TextPolarity parameter for the segmentation algorithm. This will determine whether the algorithm searches for light blobs in a dark background or for dark blobs in a light background.

TimeOut

Time-out for the EOCR2.Read, EOCR2.Detect and EOCR2.Recognize methods.

Topology

Sets the topology of the text that should be found in the image. A modified version of Regex expressions are used, where:

.(dot) represents any character (not including a space).
L represents a letter.
Lu represents an uppercase letter.
Ll represents a lowercase letter.
N represents a digit.
P represents a punctuation character !"#&'()*,-./:;<>?[\]_{|}~
S represents the symbols $;+-<=>|~
\n represents a line break.
' ' (space) represents a space between two words.

Methods

AddCharactersToDatabase

Reads reference characters from disk and adds them to the database used for text recognition. The characters can be read from a trueType (.ttf/.ttc) file or from an EasyOCR2 database file.

AddClassifierForSymbol

Adds the EOCR2Classifier for the given specific symbol combination during the recognition instead of the one set by EOCR2.Classifier.

ClearCharacterDatabase

Clears the reference character database from this EOCR2 instance.

ClearResult

Clear the current detected text if it exists.

Detect

Finds the text in an image as follows:

Detects potential characters in the image following the given text polarity.
Fits bounding boxes to the detected characters, following the given topology and character width/height.
Extracts the detected characters from the image.

The detected characters are output as an EOCR2Text structure.

DrawDetection

Draws the bounding boxes found by the topology detection algorithm.

DrawDetectionWithCurrentPen

This method is deprecated: Use the overload of EOCR2.DrawDetection taking an EDrawAdapter by using an instance of EWindowsDrawAdapter.

DrawRecognition

Draws the recognized text next to the character bounding box in the image.

DrawRecognitionWithCurrentPen

This method is deprecated: Use the overload of EOCR2.DrawRecognition taking an EDrawAdapter by using an instance of EWindowsDrawAdapter.

DrawSegmentation

Draws the blobs found by the segmentation algorithm.

DrawSegmentationWithCurrentPen

This method is deprecated: Use the overload of EOCR2.DrawSegmentation taking an EDrawAdapter by using an instance of EWindowsDrawAdapter.

GetClassifierForSymbol

Gets the EOCR2Classifier for the given specific symbol combination during the recognition instead of the one set by EOCR2.Classifier.

HitTestChar

Tests the cursor position for the presence of a character. If one is present under the cursor, it returns true and fills the EOCR2Char object passed as parameter.

HitTestLine

Tests the cursor position for the presence of a line. If one is present under the cursor, it returns true and fills the EOCR2Line object passed as parameter.

HitTestText

Tests the cursor position for the presence of a text. If one is present under the cursor, it returns true and fills the EOCR2Text object passed as parameter.

HitTestWord

Tests the cursor position for the presence of a word. If one is present under the cursor, it returns true and fills the EOCR2Word object passed as parameter.

Learn

Learns reference characters from a given EOCR2Text/EOCR2Line/EOCR2Word/EOCR2Char instance, containing user-specified character codes.

Load

Loads a model, containing parameter settings used for all operations in EOCR2.EOCR2, from disk.

Read

Performs all steps required for reading text from an image:

Detects potential characters in the image following the given text polarity and character width/height.
Fits bounding boxes to the detected characters, following the given topology and character width/height.
Recognizes the detected characters using the given reference character database.

The read text is output as a string.

Recognize

Recognizes the characters in a given EOCR2Text instance, based on a given reference font.

RemoveClassifierForSymbol

Removes the EOCR2Classifier for the given specific symbol combination during the recognition so the one set by EOCR2.Classifier will be used.

Save

Saves the model to disk, containing the current parameter setting used for all operations in EOCR2.

SaveCharacterDatabase

Saves the current reference character database to disk.

EOCR2 Class

Manages a complete context for the font-dependent printed character reader implemented in EasyOCR2.

Module: open_evision

License(s): EasyOCR2

Constructors

__init__

Constructs an EOCR2 context.

Properties

AllowedCharacterTypes

Sets which character types are expected when EOCR2.EnabledTopology is false. The set of expected character types is represented by a bitwise combination of different EasyOCR2CharacterFilter.

CharacterDatabase

Sets the EOCR2CharacterDatabase used for recognizing text.

CharsHeight

Sets the expected character height in pixels.

CharsMaxFragmentation

The minimum blob size to be considered a potential character is defined as: CharsMaxFragmentation * CharsHeight * CharsWidth

CharsSpacingBias

CharsWidth

This property is deprecated: Use EOCR2.CharsWidthRange instead.

CharsWidthBias

CharsWidthRange

Sets the range of expected character widths in pixels.

CharsWidthTolerance

This property is deprecated: Use EOCR2.CharsWidthRange instead.

Classifier

Sets the EOCR2Classifier parameter for the recognition algorithm. This will determine which classifier will be used for the recognition.

DetectionDelta

DetectionMethod

Sets the EOCR2DetectionMethod parameter for the topology fitting algorithm, which will place text boxes on the segmentation results, matching the given topology.

EnableCutLargeCharacter

EnabledTopology

EnableGPU

Sets/Gets whether EOCR2 uses a GPU to accelerate its processing.

EnableOffSizeCharacter

EnableSecondPassGlobalSegmentation

GlobalSegmentationRelativeThreshold

GlobalSegmentationThresholdMode

GPUIndexes

Sets/gets the GPUs to use when computing.

MaxVariation

NumDetectionPasses

ReadText

Outputs an EOCR2Text structure containing the detailed detection and recognition results.

RelativeSpacesWidthRange

Sets the range of expected spaces between words as a fraction of the character width.

RepasteObjects

SegmentationMethod

Sets the EOCR2SegmentationMethod parameter for the segmentation algorithm, which will detect blobs in the image.

TextAngleRange

Sets the TextAngleRange parameter for the topology fitting algorithm, which will attempt to find the angle of the text in the image with respect to the horizontal.

This will determine the center of the range of angles that will be tested, defined as: TextAngleRange.min() <= angle <= TextAngleRange.max()

TextAngleTolerance

This property is deprecated: Use EOCR2.TextAngleRange instead

TextBaseAngle

This property is deprecated: Use EOCR2.TextAngleRange instead

TextPolarity

Sets the TextPolarity parameter for the segmentation algorithm. This will determine whether the algorithm searches for light blobs in a dark background or for dark blobs in a light background.

TimeOut

Time-out for the EOCR2.Read, EOCR2.Detect and EOCR2.Recognize methods.

Topology

Sets the topology of the text that should be found in the image. A modified version of Regex expressions are used, where:
- .(dot) represents any character (not including a space). - L represents a letter. - Lu represents an uppercase letter. - Ll represents a lowercase letter. - N represents a digit. - P represents a punctuation character !"#&'()*,-./:;<>?[\]_{|}~ - S represents the symbols $;+-<=>|~ - \n represents a line break. - ' ' (space) represents a space between two words.

Methods

__init__

Constructs an EOCR2 context.

AddCharactersToDatabase

Reads reference characters from disk and adds them to the database used for text recognition. The characters can be read from a trueType (.ttf/.ttc) file or from an EasyOCR2 database file.

AddClassifierForSymbol

Adds the EOCR2Classifier for the given specific symbol combination during the recognition instead of the one set by EOCR2.Classifier.

ClearCharacterDatabase

Clears the reference character database from this EOCR2 instance.

ClearResult

Clear the current detected text if it exists.

Detect

Finds the text in an image as follows:

Detects potential characters in the image following the given text polarity.
Fits bounding boxes to the detected characters, following the given topology and character width/height.
Extracts the detected characters from the image.

The detected characters are output as an EOCR2Text structure.

DrawDetection

Draws the bounding boxes found by the topology detection algorithm.

DrawRecognition

Draws the recognized text next to the character bounding box in the image.

DrawSegmentation

Draws the blobs found by the segmentation algorithm.

GetClassifierForSymbol

Gets the EOCR2Classifier for the given specific symbol combination during the recognition instead of the one set by EOCR2.Classifier.

HitTestChar

Tests the cursor position for the presence of a character. If one is present under the cursor, it returns true and fills the EOCR2Char object passed as parameter.

HitTestLine

Tests the cursor position for the presence of a line. If one is present under the cursor, it returns true and fills the EOCR2Line object passed as parameter.

HitTestText

Tests the cursor position for the presence of a text. If one is present under the cursor, it returns true and fills the EOCR2Text object passed as parameter.

HitTestWord

Tests the cursor position for the presence of a word. If one is present under the cursor, it returns true and fills the EOCR2Word object passed as parameter.

Learn

Learns reference characters from a given EOCR2Text/EOCR2Line/EOCR2Word/EOCR2Char instance, containing user-specified character codes.

Load

Loads a model, containing parameter settings used for all operations in EOCR2.__init__, from disk.

Read

Performs all steps required for reading text from an image:

Detects potential characters in the image following the given text polarity and character width/height.
Fits bounding boxes to the detected characters, following the given topology and character width/height.
Recognizes the detected characters using the given reference character database.

The read text is output as a string.

Recognize

Recognizes the characters in a given EOCR2Text instance, based on a given reference font.

RemoveClassifierForSymbol

Removes the EOCR2Classifier for the given specific symbol combination during the recognition so the one set by EOCR2.Classifier will be used.

Save

Saves the model to disk, containing the current parameter setting used for all operations in EOCR2.

SaveCharacterDatabase

Saves the current reference character database to disk.