Currently the PDF - Extract Text action extracts the entire text layer in a single string. For multi page documents, customers sometimes have the need to process page by page. Could there be an option to extract the text into an array of strings per page?
Date
Votes
1 comment
-
Official comment Hello Alex Rackwitz
We are working on a new action called 'PDF - Extract Text by Page' which will enable you to perform the following tasks:
- Extract text from a specific page or pages
- Extract all text as a simple string array, i.e. ["text page 1","text page 2","text page 3"]
- Extract all text as a JSON object, i.e. {[{"Page":1,"Text":"text page 1"},{"Page":2,"Text":"text page 2"}]}
We are aiming to deliver this new action by July / Aug 25
Please sign in to leave a comment.