Skip to main content
cancel
Showing results for 
Search instead for 
Did you mean: 

Register now to learn Fabric in free live sessions led by the best Microsoft experts. From Apr 16 to May 9, in English and Spanish.

Reply
nowreena21
Frequent Visitor

File size is getting double when loading from Azure blob storage to Power BI Desktop

Hello Community,

 

I am facing issue while loading csv file from Azure blob storage. I am reading a csv file which is around 4 GB on Azure blob storage but PBI load more than 8 GB for it. 

I have checked all below:
1. My file only has one data source connected (Azure blob storage)

2. I have only one page in the PBI .pbix file

Can anyone help on it?

2 ACCEPTED SOLUTIONS
lbendlin
Super User
Super User

You know that Power BI loads certain files twice, and runs queries twice, to get to the meta data, right?

 

Apart from the observed doubling in network traffic, are there any other adverse effects? Did the dataset size increase accordingly or did it stay the same?

View solution in original post

lbendlin
Super User
Super User

10 GB is only the limit for the initial dataset size.  If you set the dataset to incremental refresh then you can grow the dataset size beyond 10 GB during subsequent refreshes.

 

View solution in original post

3 REPLIES 3
lbendlin
Super User
Super User

10 GB is only the limit for the initial dataset size.  If you set the dataset to incremental refresh then you can grow the dataset size beyond 10 GB during subsequent refreshes.

 

nowreena21
Frequent Visitor

Thank you @lbendlin , 

 

This makes sense, the pbix file is getting compressed significantly, nothing is duplicvating just the load size is doubled which makes refresh taking longer time. 

 

Also, Do you have idea on load size limit on power BI? I read that power BI premium support 10GB dataset size, so wanted to make sure that it isn't causing any load error in future if file load increases above 10gbs. 

lbendlin
Super User
Super User

You know that Power BI loads certain files twice, and runs queries twice, to get to the meta data, right?

 

Apart from the observed doubling in network traffic, are there any other adverse effects? Did the dataset size increase accordingly or did it stay the same?

Helpful resources

Announcements
Microsoft Fabric Learn Together

Microsoft Fabric Learn Together

Covering the world! 9:00-10:30 AM Sydney, 4:00-5:30 PM CET (Paris/Berlin), 7:00-8:30 PM Mexico City

PBI_APRIL_CAROUSEL1

Power BI Monthly Update - April 2024

Check out the April 2024 Power BI update to learn about new features.

April Fabric Community Update

Fabric Community Update - April 2024

Find out what's new and trending in the Fabric Community.

Top Solution Authors
Top Kudoed Authors