Skip to main content
cancel
Showing results for 
Search instead for 
Did you mean: 

Register now to learn Fabric in free live sessions led by the best Microsoft experts. From Apr 16 to May 9, in English and Spanish.

Reply
matteoumbri
Frequent Visitor

One Dataflow vs Multiple dataflows

Hi everyone, 

 

I have several dataflows, for simplicity say 1, 2 and 3 (all of them getting data from a SQL Server database with native queries) having the same refresh frequency needs - 8 times a day starting from 7:30am. 

They query different data, and are not linked/dependent each other.

What if I merge them all into one main dataflow: would overall total refresh time be higher or lower than keeping them separated? That is, will the refreshes occur sequential entity by entity, or will the engine run start them all at the same time and run in parallel?

Any possible concerns / drawbacks of such approach? 

What if one entity refresh fails, will there be any automatic retry? Will the succeeded entity be anyway saved and accessible, or will 5he entire dataflow "failed" and the accessible data will be the one of the previous successful refresh? 

Many thanks

1 ACCEPTED SOLUTION
v-eqin-msft
Community Support
Community Support

Hi @matteoumbri ,

 

Dataflows, which require different refresh timings, can all be scheduled individually.To better understand how a dataflow refresh operation performs, review the Refresh History for the dataflow by navigating to Dataflow > Settings > Refresh History. You can also select the dataflow in the Workspace > context menu (…) > Refresh History.

Eyelyn9_0-1637549457062.png

Refer to:

Understanding and optimizing dataflows refresh - Power BI | Microsoft Docs

Develop solutions with dataflows - Power BI | Microsoft Docs

 

 

Best Regards,
Eyelyn Qin
If this post helps, then please consider Accept it as the solution to help the other members find it more quickly.

View solution in original post

2 REPLIES 2
v-eqin-msft
Community Support
Community Support

Hi @matteoumbri ,

 

Dataflows, which require different refresh timings, can all be scheduled individually.To better understand how a dataflow refresh operation performs, review the Refresh History for the dataflow by navigating to Dataflow > Settings > Refresh History. You can also select the dataflow in the Workspace > context menu (…) > Refresh History.

Eyelyn9_0-1637549457062.png

Refer to:

Understanding and optimizing dataflows refresh - Power BI | Microsoft Docs

Develop solutions with dataflows - Power BI | Microsoft Docs

 

 

Best Regards,
Eyelyn Qin
If this post helps, then please consider Accept it as the solution to help the other members find it more quickly.

Hi @v-eqin-msft ,

thanks for reply, I have checked the Refresh History csv files where you can see individual refresh duration for the single entities of a dataflow, and found some interesting data:

- even very small entities (based on SELECT statements on very small SQL Server tables) always last at least 30 seconds, never lower than 30 seconds

- a query (exactly the same) when executed for refreshing an entity inside a dataflow with multiple entities, takes always less time than the the same query which is run in a dataflow with 1 entity only. Weird, I would assume their duration should be the same

 

Anyway, if this is how it works, I am going to merge into 1 dataflow all those 7-8 queries that have same refresh schedule needs, and are all part of the same dataset

Thanks

Matteo

Helpful resources

Announcements
Microsoft Fabric Learn Together

Microsoft Fabric Learn Together

Covering the world! 9:00-10:30 AM Sydney, 4:00-5:30 PM CET (Paris/Berlin), 7:00-8:30 PM Mexico City

PBI_APRIL_CAROUSEL1

Power BI Monthly Update - April 2024

Check out the April 2024 Power BI update to learn about new features.

April Fabric Community Update

Fabric Community Update - April 2024

Find out what's new and trending in the Fabric Community.

Top Kudoed Authors