cancel
Showing results for 
Search instead for 
Did you mean: 
Reply
Anonymous
Not applicable

Data Flow refresh - parallel or separately

Hello,

 

I was exploring the Power BI Data Flow service and it looks very promising.

 

However, I couldn't find any information about how the Data Flow Service refreshes it's entities. 

In Power BI Desktop it is possible to enable or disable parallel loading of tables from the settings. I couldn't find this kind of option from the Data Flow Service or any documentation of how the entities are refreshed. 

In some situations we prefer to disable the parallel loading of tables and run the queries separately as our data source (OData service) can sometimes get too much load if a lot of queries are run in parallel. 

 

My questions are:

Does the Data Flow service load the entities in parallel or separately?

Is it possible to enable or disable parallel loading for entities in Data Flow?

 

If I understood correctly, with Data Flow Service the data is loaded to Azure Data Lake Storage and then we use this Data Flow service as our data source when creating reports in Power BI Desktop. However the Power BI Desktop's option for parallel loading does not affect the original data source, so this option is not valid for controlling the load on our server if we build the reports with Data Flows. 

 

1 ACCEPTED SOLUTION
v-yuta-msft
Community Support
Community Support

@Anonymous ,

 

Currently there's no such kind of doc which can clarify the working details about dataflow entity refresh. But it seems to more related to Azure Lake Storage. I would suggest you to post this question to azure forum to achieve more help.

 

Community Support Team _ Jimmy Tao

If this post helps, then please consider Accept it as the solution to help the other members find it more quickly.

View solution in original post

3 REPLIES 3
v-yuta-msft
Community Support
Community Support

@Anonymous ,

 

Currently there's no such kind of doc which can clarify the working details about dataflow entity refresh. But it seems to more related to Azure Lake Storage. I would suggest you to post this question to azure forum to achieve more help.

 

Community Support Team _ Jimmy Tao

If this post helps, then please consider Accept it as the solution to help the other members find it more quickly.

Anonymous
Not applicable

I have the same problem where i need to control the order in which the entities in a dataFlow refress in order to avoid API call quota limits on the source side (Google Analytics).

 

To me this has nothing to do with DataLake store but rather with the way DataFlows decides to treat the refresh calls towards the datasource.

 

Is there any documentation that would explain how multiple entities in a flow are refreshed (sequentially or parallel)?

 

Thanks

 

Bogdan

Any update on this issue? I am using Invoke After as a work around as I have multiple tables using the same API source and I'm hitting the quota limit. But it takes a long time to refresh as I give exessive breathing room between queries since sometimes the API works slow..

Helpful resources

Announcements
Carousel_PBI_Wave1

2023 Release Wave 1 Plans

Power BI release plans for 2023 release wave 1 describes all new features releasing from April 2023 through September 2023.

Power BI Summit Carousel 2

Global Power BI Training

Make sure you register today for the Power BI Summit 2023. Don't miss all of the great sessions and speakers!

BizApps LATAM 2023

Business Application LATAM Summit 2023

Join the biggest FREE Business Applications Event in LATAM this February.

Power Platform Bootcamp

Global Power Platform Bootcamp

In this bootcamp we will deep-dive into Microsoft’s Power Platform stack with hands-on sessions and labs, delivered to you by experts and community leaders.

Top Solution Authors
Top Kudoed Authors