PDF - Apply OCR (Standard) – Encodian Customer Help

Power Automate Connector: Encodian – PDF

Recommendation

We recommend that all new solutions use the 'PDF - Apply OCR (AI)' action as opposed to the 'PDF Apply OCR (Standard)' action. Our new AI action provides greater OCR accuracy, handwriting support, whilst using fewer Encodian credits, reducing the cost of adding OCR capabilities to your technology solutions.

Overview

The 'PDF - Apply OCR (Standard)' performs OCR on the PDF document provided.

Example Flow

Please refer to the following article:

Power Automate Templates

Credit Count

OCR is a resource-intensive operation; therefore, an Encodian Flowr credit is used for every page OCR'd. For example, OCR'ing a 10-page document uses 10 credits.

Default Parameters

The default 'PDF - Apply OCR (Standard)' flow action parameters are detailed below:

Filename: The PDF filename (including the file extension).
File Content: A Base64 encoded representation of the PDF file to be processed.

Please refer to the Obtaining the 'File Contents' Parameter article for guidance on how to obtain the 'File Content' parameter ready to provide to an Encodian flow action.

Additional Guidance

The 'PDF - Apply OCR (Standard)' flow action will increase the output PDF document's file size. If 'Clean Operations' are performed, the file size increase will be higher still.

Advanced Parameters

The advanced 'PDF - Apply OCR (Standard)' flow action parameters are detailed below:

Language: Select the language used for OCR processing
OCR Type: Quality ensures enhanced recognition accuracy, Fast provides standard OCR accuracy with increased processing speed.
PDF/A Compliant: Sets whether the resulting document should conform to PDF/A format.
PDF/A Compliance Level: Sets the required level of PDF/A compliance
Clean Operations: Set whether page-level clean-up operations should be performed. The default option will auto-rotate and auto-deskew each page within the PDF Document.

The following advanced options only apply when the 'Clean Operations' property has been set to 'Specific':

Language: Set the language used for OCR processing.
Auto Rotate: Automatically detects orientation and rotates it so that the text on it is upright.
Deskew: Detects the skew angle and rotates to remove that skew.
Despeckle: Automatically detects speckles and removes them.
Adjust Brightness and Contrast: This action analyzes a document and automatically adjusts brightness and contrast based on the analysis.
Remove Border: Locates border pixels and removes the pixels from the document.
Smooth Background: This works only on colour and grayscale documents. This operation smooths background colours to eliminate or de-emphasise noise.
Smooth Objects: This only works on bi-tonal documents; it looks at groups of pixels, finds isolated bumps and pits in the edges of those objects, and fills them in.
Remove Dot Shading: This action will remove shaded regions from bi-tonal documents.
Image Detergent: Image Detergent works by changing pixels of similar colour values to a central colour value, which has the result of smoothing the image wherever regions of those colours appear.
Average Filter: Performs a 3x3 Average filter smoothing operation on the document, placing the output in the centre of the window.
Remove Hole Punch: Detects and removes hole punch marks from a bi-tonal document.
Binarize: Computes all necessary parameters by analysing the input data before actually performing the binarization. The algorithm is tuned to typical document images, consisting of dark text on a brighter background. It is robust to shadows, noise, and background images.

Further advanced operations:

Operation ID: (Advanced) The ID of a parent operation; please refer to: Flow Action Return Options: File Content vs. Operation ID
Return File: (Advanced) Sets whether the action should return a file or, alternatively an operation ID: Flow Action Return Options: File Content vs. Operation ID

Return Parameters

The 'PDF - Apply OCR (Standard)' flow action returns the following data.

Action Specific Return Values

Filename - The filename of the PDF document
File Content - The PDF document

Standard Return Values

OperationId - The unique ID assigned to this operation.
HttpStatusCode - The HTTP Status code for the response.
HttpStatusMessage - The HTTP Status message for the response.
Errors - An array of error messages should an error occur.
Operation Status - Indicates whether the operation has been completed, has been queued or has failed.

PDF - Apply OCR (Standard)

Recommendation

Overview

Example Flow

Power Automate Templates

Credit Count

Default Parameters

Additional Guidance

Advanced Parameters

Return Parameters

Action Specific Return Values

Standard Return Values

0 Comments

Review Documentation

Create a post

Submit a Ticket

Related articles

Recommendation

Overview

Example Flow

Power Automate Templates

Credit Count

Default Parameters

Additional Guidance

Advanced Parameters

Return Parameters

Action Specific Return Values

Standard Return Values

0 Comments

Review Documentation

Create a post

Submit a Ticket