PDF - Apply OCR (AI)

***COMING SOON***

Overview

The 'PDF - Apply OCR (AI)' performs OCR on the PDF document provided using artificial itelligence.

Default Parameters

The default 'PDF - Apply OCR (AI)' flow action parameters are detailed below:

  • File Content: A Base64 encoded representation of the PDF file to be processed

Please refer to the Obtaining the 'File Contents' Parameter article for guidance on how to obtain the 'File Content' parameter ready to provide to an Encodian flow action. 

Credit Count

If none of the 'Advanced Parameters' are enabled then credits are calculated as follows:

1 page = 0.1 credits, for example a 7-page document would utilise 0.7 credits

If one or more of the 'Advanced Parameter' are enabled then credits are calculated as follows:

1 page = 0.2 credits, for example a 7-page document would utilise 1.4 credits

Advanced Parameters

The advanced 'PDF - Apply OCR (AI)' flow action parameters are detailed below:

  • Adjust Brightness and Contrast: This action analyses a document and automatically adjusts brightness and contrast based on the analysis.
  • Average Filter: Performs a 3x3 Average filter smoothing operation on the document, placing the output in the centre of the window.
  • Binarize: Computes all necessary parameters by analysing the input data before actually performing the binarization. The algorithm is tuned to typical document images, consisting of dark text on brighter background. It is robust to shadows, noise and background images.
  • Deskew: Detects the skew angle and rotates to remove that skew.
  • Despeckle: Automatically detects speckles and removes them.
  • Image Detergent: Image Detergent works by changing pixels of similar colour values to a central colour value, which has the result of smoothing the image wherever regions of those colours appear.
  • Remove Border: Locates border pixels and removes the pixels from the document.
  • Remove Dot Shading: This action will remove shaded regions from bi-tonal documents.
  • Remove Hole Punch: Detects and removes hole punch marks from a bi-tonal document.
  • Rotate: Automatically detects orientation and rotates it so that the text on it is upright oriented.
  • Rotate Confidence Level: Sets the minimum confidence percentage (0 to 100) used to control whether the rotation is applied.
  • Smooth Background: This works only on colour and grayscale documents. This operation smooths background colours to eliminate or de-emphasise noise.
  • Smooth Objects: This only works on bi-tonal documents, it looks at groups of pixels, and finds isolated bumps and pits in the edges of those objects, and fills them in.

Return Parameters

The 'PDF - Apply OCR (AI)' flow action returns the following data. 

Action Specific Return Values

  • File Content - The PDF document

Standard Return Values

  • OperationId - The unique ID assigned to this operation.
  • HttpStatusCode - The HTTP Status code for the response.
  • HttpStatusMessage - The HTTP Status message for the response.
  • Errors - An array of error messages should an error occur.
  • Operation Status - Indicates whether the operation has been completed, has been queued or has failed.
Have more questions? Submit a request

0 Comments

Please sign in to leave a comment.
Powered by Zendesk