Skip to main content
cancel
Showing results for 
Search instead for 
Did you mean: 

Register now to learn Fabric in free live sessions led by the best Microsoft experts. From Apr 16 to May 9, in English and Spanish.

Reply
Anonymous
Not applicable

PBI Desktop connection to Apache Spark

Hi. Can someone guide me how to connect PBI Desktop to APACHE SPARK installed on a local windows machine? What should be the server details I should pass?  

I have seen guidance for databricks & hdinsight but I need guidance for connection to local machine installed spark here. Any help is appreciable.

3 REPLIES 3
v-xuding-msft
Community Support
Community Support

Hi @Anonymous ,

You could try the following steps.

1. Add below properties to /home/user/spark/conf/hive-site.xml and /home/user/apachehive/conf/hive-site.xml

<property>
<name> hive.server2.transport.mode </name>
<value> http </value>
</property>
<property>
<name> hive.server2.thrift.http.port </name>
<value> 10001 </value>
</property>
<property>
<name>  hive.server2.http.endpoint </name>
<value> cliservice </value>
</property>

2. Start hive metastore => start hive => start spark thriftserver
The commands are available below

cd /home/user/apachehive/bin/  //get into the directory of hive
./hive --service metastore& //start metastore
./hive //to start hive
cd /home/user/spark/sbin/ //get into the directory of spark
./start-thriftserver //to start spark thrift server

3. Open the powerbi and click on get data after that go to the spark and click on connect button.

4.PNG

4. Now in the server put http://hostipaddres:10001/cliservice

Select HTTP in Protocol

In Data Connectivity mode check on DirectQuery (means dont want to import data just directly work on it ) or Import (means import the data and then work on it )

5.PNG

5. Sign in your account and then choose the tables which you want  and load it.

 

Reference this: Connect PowerBI with Spark

 

Best Regards,

Xue Ding

If this post helps, then please consider Accept it as the solution to help the other members find it more quickly.

Best Regards,
Xue Ding
If this post helps, then please consider Accept it as the solution to help the other members find it more quickly.
Anonymous
Not applicable

Mine is a windows installation. Given paths seems to be for linux? I cant see a hive-site.xml file under /spark/conf folder. I also dont see a /apachehive/conf folder in my spark installtion. My spark installation is the latest version of spark release available. Please guide.
Anonymous
Not applicable

Any suggestions here pls?

Helpful resources

Announcements
Microsoft Fabric Learn Together

Microsoft Fabric Learn Together

Covering the world! 9:00-10:30 AM Sydney, 4:00-5:30 PM CET (Paris/Berlin), 7:00-8:30 PM Mexico City

PBI_APRIL_CAROUSEL1

Power BI Monthly Update - April 2024

Check out the April 2024 Power BI update to learn about new features.

April Fabric Community Update

Fabric Community Update - April 2024

Find out what's new and trending in the Fabric Community.