Extract text layer from PDF by page

Alex Rackwitz

11 April 2025 13:47

Completed

Currently the PDF - Extract Text action extracts the entire text layer in a single string. For multi page documents, customers sometimes have the need to process page by page. Could there be an option to extract the text into an array of strings per page?

Date Votes

2 comments

Jay Goodison 17 July 2025 14:24 Official comment

Hello Alex Rackwitz

We have just released the following action: PDF - Extract Text by Page

This feature was shipped yesterday and will be available within Power Automate within 3 > 6 weeks depending on how quickly Microsoft complete the release cycle.

Edited by Jay Goodison 17 July 2025 14:24

Comment actions Permalink
0
Jay Goodison 08 May 2025 07:38
Hello Alex Rackwitz

We are working on a new action called 'PDF - Extract Text by Page' which will enable you to perform the following tasks:
- Extract text from a specific page or pages
- Extract all text as a simple string array, i.e. ["text page 1","text page 2","text page 3"]
- Extract all text as a JSON object, i.e. {[{"Page":1,"Text":"text page 1"},{"Page":2,"Text":"text page 2"}]}
We are aiming to deliver this new action by July / Aug 25
Edited by Jay Goodison 17 July 2025 14:24

Comment actions Permalink

Please sign in to leave a comment.

Review Documentation

Explore the knowledge base and contained articles.

Create a post

Raise a feature request or ask for support

Submit a Ticket

Can't find what you're looking for?