OCR (EasyOCR)
This function block detects and extracts text from images using an OCR engine. It is designed for easy integration: provide an image, adjust a confidence threshold, and optionally allow automatic rotation to improve recognition of rotated text.
π₯ Inputs
ImageGrayscale or color image that contains the text to be recognized.
π€ Outputs
ResultImage annotated with detected boxes and optional text labels (if visualization is enabled downstream).Whole TextFull recognized text as a single string (words joined).TextsIndividual recognized text segments as a list.BoxesBounding polygons (4 points) for each detected text area.
πΉοΈ Controls
Auto RotationToggle to try rotated versions of the input image when looking for text. Useful when text may appear rotated (e.g., phones, tilted signs).Show TextsToggle to include recognized text labels on the annotated image output (when output visualization is active).ThresholdSlider to set minimum confidence required for a detection to be accepted. Higher values reduce false positives but may miss faint text.
β¨ Features
Multi-angle recognition when
Auto Rotationis enabled to improve detection on rotated text.Confidence-based filtering via
Thresholdto control detection strictness.Returns both textual results and exact bounding polygons for each detection, enabling downstream processing or logging.
Will use available hardware acceleration when present to improve performance.
π How it runs
When an image is provided to Image, the block analyzes the image for text, applying the configured threshold and rotation behavior. Accepted detections are returned as text and bounding boxes. If visualization is connected, an annotated image with detected boxes (and optional labels) is produced on the Result output.
π Quick start
Connect an image-producing block (for example a camera or an image loader) to
Image.Toggle
Auto Rotationif the text may be rotated.Adjust
Thresholdto balance between sensitivity and false positives.Use the
Boxesoutput to draw or log detections, or connect theResultimage to a display block to preview.
π‘ Tips and Tricks
Preprocess small or noisy text with
Image ResizerorSuper Resolutionto improve readability before OCR.Crop to the region of interest using
Image ROI SelectorImage ROIso OCR only processes relevant areas (faster and more accurate).Improve contrast and reduce noise with
Contrast Optimization,Denoising, orAdjust Colorsbefore feeding the image.Remove distracting backgrounds with
Background Removal (RMBG-1.4)orBackground Removal (BiRefNet)when text overlays complex scenes.For quick visual checks, connect the annotated
ResulttoShow Imageto inspect box placement and text labels.To compare recognition quality or fallback strategies, try the alternative
OCRblock and compare results.Save recognized text and logs using
CSV Exportor assemble structured logs withData to JSONfor downstream systems.If many false positives appear, increase
Thresholdor apply targeted masking usingApply Maskto exclude irrelevant regions.
π οΈ Troubleshooting
No text detected: increase image size with
Image Resizeror improve clarity withDenoising/Contrast Optimization.Low confidence detections: raise the
Thresholdslider and/or clean the background with background removal blocks.Text appears rotated or upside-down: enable
Auto Rotation. If only one angle is problematic, crop withImage ROI Selectand rotate externally.Many false positives on textured backgrounds: restrict processing area using
Image ROI Selectand useApply Maskto block noisy regions.Want structured output: route
TextsandBoxesintoData to JSONorCSV Exportfor saving or downstream processing.
Last updated
Was this helpful?