PDF & Image / OCR Activities

Extract text from PDFs (digital or scanned), manipulate PDF files, capture screenshots, find images on screen using template matching, and run OCR. 9 PDF activities and 4 Image/OCR activities.

Read PDF Text

Extracts digital text from a PDF file. Fast and accurate for PDFs created by Word, Excel, or other digital tools.

Parameter	Type	Required	Description
FilePath	String	Yes	Absolute path to the PDF file
Pages	String	No	Page range to extract, e.g. `1-3` or `2`. Leave empty for all pages.
OutputVariable	String	Yes	String variable to store the extracted text

Read PDF Table

Extracts a table from a PDF page into a DataTable variable.

Parameter	Type	Required	Description
FilePath	String	Yes	PDF file path
Page	Int32	Yes	Page number (1-based) containing the table
TableIndex	Int32	No (default: 0)	Zero-based index of the table on that page
OutputVariable	String	Yes	DataTable variable to store the extracted table

Merge PDFs

Combines multiple PDF files into a single PDF in the specified order.

Parameter	Type	Required	Description
InputPaths	StringArray / comma-separated	Yes	Ordered list of source PDF file paths
OutputPath	String	Yes	Path for the merged PDF to create
Overwrite	Boolean	No (default: false)	Replace output file if it exists

Extract PDF Page Range

Extracts a range of pages from a PDF into a new PDF file.

Parameter	Type	Required	Description
InputPath	String	Yes	Source PDF file
Pages	String	Yes	Page range, e.g. `1-5` or `3,7,10`
OutputPath	String	Yes	Output PDF path

Get PDF Page Count

Returns the total number of pages in a PDF file.

Parameter	Type	Required	Description
FilePath	String	Yes	PDF file path
OutputVariable	String	Yes	Int32 variable to store the page count

Read PDF With OCR

Extracts text from a scanned PDF using Optical Character Recognition. Slower than Read PDF Text but handles image-based PDFs.

Parameter	Type	Required	Description
FilePath	String	Yes	Scanned PDF file path
Language	String	No (default: en)	OCR language code
OutputVariable	String	Yes	String variable to store the extracted text

Extract PDF Images

Extracts all embedded images from a PDF and saves them to a folder.

Parameter	Type	Required	Description
FilePath	String	Yes	PDF file path
OutputFolder	String	Yes	Folder to save extracted images into
OutputVariable	String	No	StringArray variable holding the saved image file paths

Export PDF

Exports a PDF to another format (e.g., converts to images).

Parameter	Type	Required	Description
FilePath	String	Yes	PDF file path
OutputFolder	String	Yes	Folder for the exported files
Format	PNG \| JPEG \| TIFF	No (default: PNG)	Output image format
Dpi	Int32	No (default: 150)	Resolution in DPI

Manage PDF Password

Encrypts a PDF with a password, or decrypts a password-protected PDF.

Parameter	Type	Required	Description
InputPath	String	Yes	Source PDF
OutputPath	String	Yes	Output PDF path
Action	Encrypt \| Decrypt	Yes	Whether to add or remove password protection
Password	String	Yes	Password to apply or verify

Find Image On Screen

Searches the screen (or a captured screenshot) for a template image using pixel-level template matching. Returns the screen coordinates if found.

Parameter	Type	Required	Description
TemplatePath	String	Yes	File path of the template image to search for
Confidence	Double (0–1)	No (default: 0.9)	Minimum match score (lower = more lenient)
FoundVariable	String	Yes	Boolean variable — true if the template was found
LocationVariable	String	No	Variable holding screen coordinates {'{'}X, Y{'}'} of the centre of the match

workflow

1	Find Image On Screen:
2	TemplatePath: "templates\submit-button.png"
3	Confidence: 0.85
4	FoundVariable: "btnFound"
5	LocationVariable: "btnPoint"
6
7	If: Condition="${btnFound}"
8	Then:
9	Click: Selector=... // use the coordinates to click

Click Image

Finds a template image on screen and clicks its centre. Combines Find Image On Screen and Click into one activity.

Parameter	Type	Required	Description
TemplatePath	String	Yes	Template image file path
Confidence	Double	No (default: 0.9)	Match threshold
ClickType	Single \| Double \| Right	No (default: Single)	Click type
TimeoutMs	Int32	No	Max wait for the image to appear

Wait For Image

Waits until a template image appears on screen, then continues execution.

Parameter	Type	Required	Description
TemplatePath	String	Yes	Template image to wait for
Confidence	Double	No (default: 0.9)	Match threshold
TimeoutMs	Int32	No (default: 30000)	Max wait before throwing

Read Text With OCR

Runs Optical Character Recognition on an image file or captured screenshot and returns the extracted text.

Parameter	Type	Required	Description
ImagePath	String	Yes	Path to the image file (.png, .jpg, .tiff, .bmp)
Language	String	No (default: en)	Tesseract OCR language code
OutputVariable	String	Yes	String variable to store the extracted text

OCR accuracy tipsFor best results: capture the smallest region containing the target text, ensure good contrast between text and background, use a DPI of at least 150, and avoid rotated text. For scanned PDFs, use Read PDF With OCR instead.