Skip to main content
cancel
Showing results for 
Search instead for 
Did you mean: 

Register now to learn Fabric in free live sessions led by the best Microsoft experts. From Apr 16 to May 9, in English and Spanish.

Reply
Anonymous
Not applicable

Extract tables from word document

Hi there

Does anyone manage to scrape data from word documents via PowerBI / Power Query?

Is this doable?

Thank you so much

 

1 ACCEPTED SOLUTION
Anonymous
Not applicable

@MattAllington has a pattern to extract tables into Power BI from Word saved as HTML - see http://exceleratorbi.com.au/import-tabular-data-pdf-using-power-query/

View solution in original post

2 REPLIES 2
v-caliao-msft
Employee
Employee

Hi @Anonymous,

 

Yes, we can achieve this requirement. You need to leverage the fact that Microsoft Word .docx files are actually ZIP files containing a group of XML files.  We will decompress the ZIP file and parse the XML to pull information into Power Query.

 

Reference
http://www.excelandpowerbi.com/?p=201
http://www.excelandpowerbi.com/?p=146

 

Regards,

Charlie Liao

Anonymous
Not applicable

@MattAllington has a pattern to extract tables into Power BI from Word saved as HTML - see http://exceleratorbi.com.au/import-tabular-data-pdf-using-power-query/

Helpful resources

Announcements
Microsoft Fabric Learn Together

Microsoft Fabric Learn Together

Covering the world! 9:00-10:30 AM Sydney, 4:00-5:30 PM CET (Paris/Berlin), 7:00-8:30 PM Mexico City

PBI_APRIL_CAROUSEL1

Power BI Monthly Update - April 2024

Check out the April 2024 Power BI update to learn about new features.

April Fabric Community Update

Fabric Community Update - April 2024

Find out what's new and trending in the Fabric Community.