I'm trying to import data from a PDF file, but a table bigger than one page is imported in Power Bi as multiple tables.
Is there any workaround?
I make a test for your scenario:
I have a pdf with a large table in 23 pages (from page 1 to page 23),
1.open Edit queries, create a new blank query,
2.open its Advanced editor, write the code
letSource = Pdf.Tables(File.Contents("C:\desktop\case\5\5.10\date.pdf"), [StartPage=1, EndPage=23])inSource
3.Expand "Data" column
4.Filter "Kind" column to keep only "Table"
5.Then we will get all data in one table
I could remove some useless columns,keep only last two columns, then "Use the first row as headers"
Community Support Team _ Maggie LiIf this post helps, then please consider Accept it as the solution to help the other members find it more quickly.
You are a lifesaver! This just saved me hours of cleanup on multiple pdfs. Thank you!
Awesome Magie! Gold tip!
Working around a problem like this, without result till see this. Anyway, still having an issue: on my pdf file, the table as date on it , but on two different rows:
when it should be 2019-07-23 on the same row. Any expert advice on how to solve this?
V @v-juanli-msft ,
thanks for your answer, I've learned something new.
If I have understood well, this code doesn't solve my case since I have more the one table in the same PDF and I don't know in advance the page of each table.
Learn how to create your own user groups today!
Click here to read more about the November 2021 Updates!
Join us, in-person, December 7–9 in Las Vegas, for the largest gathering of the Microsoft community in the world.