Skip to main content
cancel
Showing results for 
Search instead for 
Did you mean: 

Register now to learn Fabric in free live sessions led by the best Microsoft experts. From Apr 16 to May 9, in English and Spanish.

Reply
rishirajdeb
Advocate I
Advocate I

Incrementally refresh datasets from dataflows

Hi All,

In my present project, we are using dataflows to perform data transformations centrally and reusing the dataflow entities in different reports. The models are created mainly from the dataflows and the reports are live connected to the datasets (models).

The data size we are dealing with is large and we intend to implement incremental refresh where for the refresh partitions, we would only refresh last 2 months data.

For the dataflows, it’s well documented and we can set up incremental refresh as required and is working fine as per my test. However, I could not find any official documentation regarding the incremental refresh from datasets to dataflows. This is important for us because otherwise we would end up doing full truncate & load every time during our model refreshes (while refreshing datasets from dataflows).

I did some testing on this, and to my surprise it seems working for datasets as well! I have even checked from SSMS connecting the xmla endpoint of the dataset (we are using premium capacity BTW) and checked the individual partitions – and the time stamps also suggest that after initial full load, for all subsequent refreshes, only the months falling under refresh partition as per the incremental refresh policy – got refreshed!

Now all I found on the net suggests the other way round (as query folding does not happen on dataflow – incremental refresh would not work). Can please anyone help me to understand if this is a latest update from Microsoft and if so, is there any official documentation link which we can refer for best practices etc.? (this will give us more confidence in terms of the project design)

Note: I have tried separately with the dataflow ‘enhanced compute engine’ set to ‘optimized’ & ‘On’ – and it seems working for both the cases!

 

** Sorry for posting this under 'Desktop', only after posting I realized it should have been posted under 'Service' but haven't found any option to switch the product. Hope it's fine?

 

@GilbertQ  @ibarrau @v-janeyg-msft @Burningsuit @lbendlin 

1 REPLY 1
GilbertQ
Super User
Super User

Hi @rishirajdeb 

 

As far as I am aware what you are doing is valid and will work with incremental refreshing.

 

Incremental refreshing does work on dataflows.

 

And also on the dataset incremental refreshing also works. If you then have the "Enhanced Compute Engine" enabled that should allow incremental refreshing to happen even quicker on your dataset.





Did I answer your question? Mark my post as a solution!

Proud to be a Super User!







Power BI Blog

Helpful resources

Announcements
Microsoft Fabric Learn Together

Microsoft Fabric Learn Together

Covering the world! 9:00-10:30 AM Sydney, 4:00-5:30 PM CET (Paris/Berlin), 7:00-8:30 PM Mexico City

PBI_APRIL_CAROUSEL1

Power BI Monthly Update - April 2024

Check out the April 2024 Power BI update to learn about new features.

April Fabric Community Update

Fabric Community Update - April 2024

Find out what's new and trending in the Fabric Community.