Solved: Re: Writing to a Lakehouse in different workspace ...

AdamFry · ‎04-10-2024

Hi there, I've been struggling to find good documentation on how to read data from a lakehouse in one workspace and after applying some transformations, write it to a different lakehouse in a different workspace. Is this possible? I have the following workspaces:

WORKSPACE_BRONZE that contains LAKEHOUSE_BRONZE

WORKSPACE_SILVER that contains LAKEHOUSE_SILVER

LAKEHOUSE_BRONZE has a CSV file in the files section. I have a notebook and have added both lakehouses to the notebook. I have some code like this to read the file:

FILE_TO_PROCESS = "MYFILE.csv"

BASE_PATH_TO_FOLDER = "MY/FOLDER/PENDING/"

df = spark.read.format("csv").option("header","true").load(BASE_PATH_TO_FOLDER + FILE_TO_PROCESS)

After applying some schema validations and transformations like adding a column for the file name, changing data types from strings to their actual types and renaming the columns to remove spaces, my dataframe is looking good and now I'd like to append this dataframe to a delta table in my silver lakehouse.

When I do the following:

df.write.format("delta").mode("append").option("delta.columnMapping.mode", "name").saveAsTable("my_special_table")

It will write to the bronze (default) lakehouse. I've tried prefixing the table name with LAKEHOUSE_SILVER but I get an error that the schema is not found:

df.write.format("delta").mode("append").option("delta.columnMapping.mode", "name").saveAsTable("LAKEHOUSE_SILVER.my_special_table")

One thing I tried was making the silver lakehouse the default lakehouse and then providing the full abfss file path when reading the file from bronze. That actually works but I thought there could be scenarios where I have multiple lakehouse sources from multiple workspaces and I won't be able to solve this by managing the default lakehouse so in general, it would be nice to understand how I can explicitly write to a given lakehouse in a given workspace but I am struggling to find the syntax. Can anyone point me to documentation or help me understand the syntax?

Thank you very much in advance if anyone can shed some light here!

frithjof_v · ‎04-10-2024

You can use the fully qualified path to write to a Lakehouse in another workspace.

Please see this article, it helped me:

https://murggu.medium.com/databricks-and-fabric-writing-to-onelake-and-adls-gen2-671dcf24cf33

So, to write to a table (new or existing) in a Lakehouse in another workspace, I think it is possible to write it like this:

df.write.format("delta").mode("append").save(f"abfss://{workspace_name}@onelake.dfs.fabric.microsoft.com/{lakehouse_name}.Lakehouse/Tables/{table_name}")

or if your objects names have special characters or whitespace, could use the id's:

df.write.format("delta").mode("append").save(f"abfss://{workspace_id}@onelake.dfs.fabric.microsoft.com/{lakehouse_id}/Tables/{table_name}")

For reading you could also use the fully qualified path, as you have already done. Then I think the whole process should be independent of the default lakehouse.

View solution in original post

frithjof_v · ‎04-10-2024

You can use the fully qualified path to write to a Lakehouse in another workspace.

Please see this article, it helped me:

https://murggu.medium.com/databricks-and-fabric-writing-to-onelake-and-adls-gen2-671dcf24cf33

So, to write to a table (new or existing) in a Lakehouse in another workspace, I think it is possible to write it like this:

df.write.format("delta").mode("append").save(f"abfss://{workspace_name}@onelake.dfs.fabric.microsoft.com/{lakehouse_name}.Lakehouse/Tables/{table_name}")

or if your objects names have special characters or whitespace, could use the id's:

df.write.format("delta").mode("append").save(f"abfss://{workspace_id}@onelake.dfs.fabric.microsoft.com/{lakehouse_id}/Tables/{table_name}")

For reading you could also use the fully qualified path, as you have already done. Then I think the whole process should be independent of the default lakehouse.

AdamFry · ‎04-11-2024

Thank you so much!

v-gchenna-msft · ‎04-12-2024

Hi @AdamFry ,

Glad to that your issue got resolved. Please continue using Fabric Community on your further queries.

Element115 · ‎04-10-2024

For syntax and stuff, trying asking Copilot or ChatGPT. I usually get pretty good feedback.

AdamFry · ‎04-10-2024

Apologies for not using the code block for the code in my post, I tried editing my post to add it but I got an invalid html error so hopefully this is ok posted as is.

Writing to a Lakehouse in different workspace from Notebook

Helpful resources

New forum boards available in Synapse

Fabric certifications survey

Fabric Monthly Update - April 2024

Fabric Community Update - April 2024