Register now to learn Fabric in free live sessions led by the best Microsoft experts. From Apr 16 to May 9, in English and Spanish.
I currently use Power BI Desktop to connect to Apache Hive through a HortonWorks Hive ODBC connection. I pass SQL-like statements to Hive in Power BI to process the statements on the server and then have Hive return the results.
My issue is that returning the data to Power BI is extremely slow. For instance, it takes up to an hour to return a "table" of about 151M records in Power BI. When I use a database management system to query Hive -- like DBeaver, for instance -- I can get around this by running the queries through the Tez engine, with the statement below:
set hive.execution.engine=tez;
More on the Tez engine here.
Running these statements through the Tez engine takes about 1/100th of the time. (BTW: DBeaver connects to Hive through JDBC drivers, which looks like Power BI does not yet support.)
Is there a way to force Power BI to run queries through the Tez engine?
Covering the world! 9:00-10:30 AM Sydney, 4:00-5:30 PM CET (Paris/Berlin), 7:00-8:30 PM Mexico City
Check out the April 2024 Power BI update to learn about new features.
User | Count |
---|---|
106 | |
93 | |
75 | |
62 | |
50 |
User | Count |
---|---|
147 | |
107 | |
105 | |
87 | |
61 |