0

The possibility of verifying if a document is fully OCR’d

NEW request; The possibility of verifying if a document is fully OCR’d.  For example; a document that has already been OCR’d, when we call the flow to OCR the document, is there a utility or action that can be used to check if all pages in the document are OCR’d, if all pages have a text layer and are searchable, then there is no need to perform the OCR again.  If you need more information about this please let me know.

3 comments

  • Avatar
    Jay Goodison Official comment

    Hi Marjolaine Ghaby 

    You can already check the presence of a text layer within a PDF file using the Get PDF Document Information action. This returns a boolean property named 'Has Text Layer'.

    However, this check is not a page level validation, it is checking the presence of a text layer in the document itself so its possible some pages may not have a text layer. 

    We are currently working on a dedicated action for this check named 'PDF - Validate Text Layer' which will allow you to specify whether to perform a page level or document level check.

  • 0
    Avatar
    Marjolaine Ghaby

    Hello Jay,

    That's great news.  Please keep us posted on progress.

    Thank you.

    Marjolaine

  • 0
    Avatar
    Jay Goodison

    Marjolaine Ghaby 

    We have completed development of the new action - PDF - Validate Text Layer – Encodian Customer Help 

    This will ship to our platform at the end of this month, it should then be available globally within Power Automate within 4 to 6 weeks later

Please sign in to leave a comment.