Skip to main content
cancel
Showing results for 
Search instead for 
Did you mean: 

Earn the coveted Fabric Analytics Engineer certification. 100% off your exam for a limited time only!

Reply
anshpalash
Helper II
Helper II

Import Full PDF into Power BI instead of a few tables or pages

Hi,

 

When I am trying to import some pdf files into Power BI, I see an option of only importing selected tables or pages. However, I want to import the whole pdf as a single dataset. How can I do that? 

 

I would really appreciate any help. Thanks!

1 ACCEPTED SOLUTION
ImkeF
Super User
Super User

Hi @anshpalash ,
just import 1 page with the user interface. Then delete the last 2 steps of the query that will have been created Automatically so that just the step "Source" remains:

 

image.png

 

That will create a table with all tables and pages from your pdf:

image.png
Filter only "Tables" in columnn "Kind" and then expand the column "Data".
That will create 1 consolidated table with the whole pdf-content in one query/dataset:

 

Imke Feldmann (The BIccountant)

If you liked my solution, please give it a thumbs up. And if I did answer your question, please mark this post as a solution. Thanks!

How to integrate M-code into your solution -- How to get your questions answered quickly -- How to provide sample data -- Check out more PBI- learning resources here -- Performance Tipps for M-queries

View solution in original post

3 REPLIES 3
v-stephen-msft
Community Support
Community Support

Hi @anshpalash ,

 

Is the solution provided by @ImkeF helpful to you? 

You can also specify the Start page and End Page as optional parameters for your PDF connection. You can also specify these parameters in the M formula language, using the following format:

Pdf.Tables(File.Contents("c:\sample.pdf"), [StartPage=10, EndPage=11])

 

Best Regards,

Stephen Tao

 

If this post helps, then please consider Accept it as the solution to help the other members find it more quickly.

Syndicate_Admin
Administrator
Administrator

Hi @anshpalash ,
just import 1 page with the user interface. Then delete the last 2 steps of the query that will have been created Automatically so that just the step "Source" remains:

 

That will create a table with all tables and pages from your pdf:

image.png
Filter only "Tables" in columnn "Kind" and then expand the column "Data".
That will create 1 consolidated table with the whole pdf-content in one query/dataset:

 

ImkeF
Super User
Super User

Hi @anshpalash ,
just import 1 page with the user interface. Then delete the last 2 steps of the query that will have been created Automatically so that just the step "Source" remains:

 

image.png

 

That will create a table with all tables and pages from your pdf:

image.png
Filter only "Tables" in columnn "Kind" and then expand the column "Data".
That will create 1 consolidated table with the whole pdf-content in one query/dataset:

 

Imke Feldmann (The BIccountant)

If you liked my solution, please give it a thumbs up. And if I did answer your question, please mark this post as a solution. Thanks!

How to integrate M-code into your solution -- How to get your questions answered quickly -- How to provide sample data -- Check out more PBI- learning resources here -- Performance Tipps for M-queries

Helpful resources

Announcements
April AMA free

Microsoft Fabric AMA Livestream

Join us Tuesday, April 09, 9:00 – 10:00 AM PST for a live, expert-led Q&A session on all things Microsoft Fabric!

March Fabric Community Update

Fabric Community Update - March 2024

Find out what's new and trending in the Fabric Community.

Top Solution Authors
Top Kudoed Authors