Text Detection
This function block finds text regions in an image and visualizes them. It is tuned for detecting oriented text areas (rotated or tilted) and returns both a preview image and structured location data for further processing.
π₯ Inputs
Image Any Input image to be analyzed for text.
π€ Outputs
Image Any Annotated image with detected text regions drawn.
Referance Point List of reference points (corner points) for each detected text region.
Referance Rectangles List of bounding rectangles for each detected text region.
Number of Detected Text Total number of text regions found.
πΉοΈ Controls
Confidence Adjusts the minimum confidence required for a detection to be accepted. Higher values reduce false positives but may miss faint text.
NMS Threshold Adjusts how overlapping detections are merged. Lower values make merging stricter, reducing duplicate boxes over the same text.
βοΈ Running mechanism
When executed, the block examines the provided image, searches for regions that look like text, filters results by confidence, merges overlapping detections, and scales found regions back to the input image size. The block then outputs an annotated preview image, a list of reference points and rectangles for each detection, and the total count of detected text regions.
π― Features
Detects rotated and angled text regions, not just horizontal lines.
Provides both visual feedback (annotated image) and structured outputs (points, rectangles, count) for downstream processing.
Adjustable sensitivity via
ConfidenceandNMS Thresholdcontrols to tune precision vs recall.
π Usage instructions
Provide an image to the
Image Anyinput (from a camera, file loader or stream).Adjust the
Confidenceslider to balance false positives vs missed text.Adjust the
NMS Thresholdslider if multiple overlapping boxes appear over the same text area.Use the annotated
Image Anyoutput to preview detections, and useReferance Point/Referance Rectanglesfor extraction, cropping, or passing to recognition blocks.
π‘ Tips and Tricks
If the image is very large, use
Image Resizerto downscale for faster processing, then map rectangle coordinates back to the original if needed.For actual text recognition after detection, connect the detected crop areas to
OCRorOCR (EasyOCR)blocks.To focus on a specific area, crop first with
Image ROIorImage ROI Selectand feed the cropped image into this block.Use
Show Imageto preview the annotated output, andDraw Result On Imageto overlay custom status text based on detection results.Save results with
Image Writeor log examples withImage Loggerfor later review.
π οΈ Troubleshooting
No detections: Lower the
Confidencevalue or provide a clearer.Too many small boxes or duplicates: Increase the
Confidencevalue and lower theNMS Thresholdto merge overlapping detections.Detections misplaced after resizing: Ensure any resizing steps are compensated when interpreting
Referance Rectanglescoordinates.If you only need recognized text (not locations), chain this block to
OCR/OCR (EasyOCR)and use the count output to gate downstream logic.
Last updated
Was this helpful?