My setup: PowerBI Desktop connecting to Hiveserver2 using Hortonworks ODBC driver
Issue: When I issue "Get Data" and load a Hive table, PowerBI loads all data from the table. Since DirectQuery is not supported for ODBC sources, this data is now within my laptop. Then I perform "Edit queries" and try to "Add new column" and then press "Close & Apply". When I do this, I see that PowerBI queries the Hive table again for all data i.e. importing the table again fully. This is an un-necessary step and slows down my report building process significantly. I am on a high-latency line.
Since DirectQuery is not supported in ODBC , I fail to understand why PowerBI is repeatedly importing this data without any valid rationale. Please help.
Thanks for your support.
In Power Query, once you make changes which will add steps, it will execute entire Power Query when you "Save & Close" Query Editor. That's the reason why it will re-import the table.
Thanks for the answer.
If PBI is creating the new column through a HS2 query, then we can definitely see the rationale.
But it looks like a regular import of the entire table again – which hardly serves any purpose. It hurts when we work on a high-latency line.
Am I missing something here?