~4 min11 / 12

PDF & Image / OCR Activities

Extract text from PDFs (digital or scanned), manipulate PDF files, capture screenshots, find images on screen using template matching, and run OCR. 9 PDF activities and 4 Image/OCR activities.

Read PDF Text

Extracts digital text from a PDF file. Fast and accurate for PDFs created by Word, Excel, or other digital tools.

ParameterTypeRequiredDescription
FilePathStringYesAbsolute path to the PDF file
PagesStringNoPage range to extract, e.g. 1-3 or 2. Leave empty for all pages.
OutputVariableStringYesString variable to store the extracted text

Read PDF Table

Extracts a table from a PDF page into a DataTable variable.

ParameterTypeRequiredDescription
FilePathStringYesPDF file path
PageInt32YesPage number (1-based) containing the table
TableIndexInt32No (default: 0)Zero-based index of the table on that page
OutputVariableStringYesDataTable variable to store the extracted table

Merge PDFs

Combines multiple PDF files into a single PDF in the specified order.

ParameterTypeRequiredDescription
InputPathsStringArray / comma-separatedYesOrdered list of source PDF file paths
OutputPathStringYesPath for the merged PDF to create
OverwriteBooleanNo (default: false)Replace output file if it exists

Extract PDF Page Range

Extracts a range of pages from a PDF into a new PDF file.

ParameterTypeRequiredDescription
InputPathStringYesSource PDF file
PagesStringYesPage range, e.g. 1-5 or 3,7,10
OutputPathStringYesOutput PDF path

Get PDF Page Count

Returns the total number of pages in a PDF file.

ParameterTypeRequiredDescription
FilePathStringYesPDF file path
OutputVariableStringYesInt32 variable to store the page count

Read PDF With OCR

Extracts text from a scanned PDF using Optical Character Recognition. Slower than Read PDF Text but handles image-based PDFs.

ParameterTypeRequiredDescription
FilePathStringYesScanned PDF file path
LanguageStringNo (default: en)OCR language code
OutputVariableStringYesString variable to store the extracted text

Extract PDF Images

Extracts all embedded images from a PDF and saves them to a folder.

ParameterTypeRequiredDescription
FilePathStringYesPDF file path
OutputFolderStringYesFolder to save extracted images into
OutputVariableStringNoStringArray variable holding the saved image file paths

Export PDF

Exports a PDF to another format (e.g., converts to images).

ParameterTypeRequiredDescription
FilePathStringYesPDF file path
OutputFolderStringYesFolder for the exported files
FormatPNG | JPEG | TIFFNo (default: PNG)Output image format
DpiInt32No (default: 150)Resolution in DPI

Manage PDF Password

Encrypts a PDF with a password, or decrypts a password-protected PDF.

ParameterTypeRequiredDescription
InputPathStringYesSource PDF
OutputPathStringYesOutput PDF path
ActionEncrypt | DecryptYesWhether to add or remove password protection
PasswordStringYesPassword to apply or verify

Find Image On Screen

Searches the screen (or a captured screenshot) for a template image using pixel-level template matching. Returns the screen coordinates if found.

ParameterTypeRequiredDescription
TemplatePathStringYesFile path of the template image to search for
ConfidenceDouble (0–1)No (default: 0.9)Minimum match score (lower = more lenient)
FoundVariableStringYesBoolean variable — true if the template was found
LocationVariableStringNoVariable holding screen coordinates {'{'}X, Y{'}'} of the centre of the match
workflow
1Find Image On Screen:
2 TemplatePath: "templates\submit-button.png"
3 Confidence: 0.85
4 FoundVariable: "btnFound"
5 LocationVariable: "btnPoint"
6 
7If: Condition="${btnFound}"
8 Then:
9 Click: Selector=... // use the coordinates to click

Click Image

Finds a template image on screen and clicks its centre. Combines Find Image On Screen and Click into one activity.

ParameterTypeRequiredDescription
TemplatePathStringYesTemplate image file path
ConfidenceDoubleNo (default: 0.9)Match threshold
ClickTypeSingle | Double | RightNo (default: Single)Click type
TimeoutMsInt32NoMax wait for the image to appear

Wait For Image

Waits until a template image appears on screen, then continues execution.

ParameterTypeRequiredDescription
TemplatePathStringYesTemplate image to wait for
ConfidenceDoubleNo (default: 0.9)Match threshold
TimeoutMsInt32No (default: 30000)Max wait before throwing

Read Text With OCR

Runs Optical Character Recognition on an image file or captured screenshot and returns the extracted text.

ParameterTypeRequiredDescription
ImagePathStringYesPath to the image file (.png, .jpg, .tiff, .bmp)
LanguageStringNo (default: en)Tesseract OCR language code
OutputVariableStringYesString variable to store the extracted text
OCR accuracy tipsFor best results: capture the smallest region containing the target text, ensure good contrast between text and background, use a DPI of at least 150, and avoid rotated text. For scanned PDFs, use Read PDF With OCR instead.
Was this helpful?