Skip to main content
cancel
Showing results for 
Search instead for 
Did you mean: 

Register now to learn Fabric in free live sessions led by the best Microsoft experts. From Apr 16 to May 9, in English and Spanish.

Reply
anshpalash
Helper II
Helper II

Import Full PDF into Power BI instead of a few tables or pages

Hi,

 

When I am trying to import some pdf files into Power BI, I see an option of only importing selected tables or pages. However, I want to import the whole pdf as a single dataset. How can I do that? 

 

I would really appreciate any help. Thanks!

1 ACCEPTED SOLUTION
ImkeF
Super User
Super User

Hi @anshpalash ,
just import 1 page with the user interface. Then delete the last 2 steps of the query that will have been created Automatically so that just the step "Source" remains:

 

image.png

 

That will create a table with all tables and pages from your pdf:

image.png
Filter only "Tables" in columnn "Kind" and then expand the column "Data".
That will create 1 consolidated table with the whole pdf-content in one query/dataset:

 

Imke Feldmann (The BIccountant)

If you liked my solution, please give it a thumbs up. And if I did answer your question, please mark this post as a solution. Thanks!

How to integrate M-code into your solution -- How to get your questions answered quickly -- How to provide sample data -- Check out more PBI- learning resources here -- Performance Tipps for M-queries

View solution in original post

3 REPLIES 3
v-stephen-msft
Community Support
Community Support

Hi @anshpalash ,

 

Is the solution provided by @ImkeF helpful to you? 

You can also specify the Start page and End Page as optional parameters for your PDF connection. You can also specify these parameters in the M formula language, using the following format:

Pdf.Tables(File.Contents("c:\sample.pdf"), [StartPage=10, EndPage=11])

 

Best Regards,

Stephen Tao

 

If this post helps, then please consider Accept it as the solution to help the other members find it more quickly.

Syndicate_Admin
Administrator
Administrator

Hi @anshpalash ,
just import 1 page with the user interface. Then delete the last 2 steps of the query that will have been created Automatically so that just the step "Source" remains:

 

That will create a table with all tables and pages from your pdf:

image.png
Filter only "Tables" in columnn "Kind" and then expand the column "Data".
That will create 1 consolidated table with the whole pdf-content in one query/dataset:

 

ImkeF
Super User
Super User

Hi @anshpalash ,
just import 1 page with the user interface. Then delete the last 2 steps of the query that will have been created Automatically so that just the step "Source" remains:

 

image.png

 

That will create a table with all tables and pages from your pdf:

image.png
Filter only "Tables" in columnn "Kind" and then expand the column "Data".
That will create 1 consolidated table with the whole pdf-content in one query/dataset:

 

Imke Feldmann (The BIccountant)

If you liked my solution, please give it a thumbs up. And if I did answer your question, please mark this post as a solution. Thanks!

How to integrate M-code into your solution -- How to get your questions answered quickly -- How to provide sample data -- Check out more PBI- learning resources here -- Performance Tipps for M-queries

Helpful resources

Announcements
Microsoft Fabric Learn Together

Microsoft Fabric Learn Together

Covering the world! 9:00-10:30 AM Sydney, 4:00-5:30 PM CET (Paris/Berlin), 7:00-8:30 PM Mexico City

PBI_APRIL_CAROUSEL1

Power BI Monthly Update - April 2024

Check out the April 2024 Power BI update to learn about new features.

April Fabric Community Update

Fabric Community Update - April 2024

Find out what's new and trending in the Fabric Community.

Top Solution Authors
Top Kudoed Authors