Register now to learn Fabric in free live sessions led by the best Microsoft experts. From Apr 16 to May 9, in English and Spanish.
Hello everybody.
I read some documentations about modeling using two dataflows and one dataset. The first dataflow would bring the data from the source without any transformation being applied (raw data). The second dataflow would be the layer where we would apply the necessary transformations. When we use a dataflow as a source to create a second dataflow, it comes in the model as an entity, and when we try to do some transformation we get the error message bellow. I am also leaving the image with the modeling I want to implement.
My question is: how do I apply transformations to the second Dataflow since I can't because the first one exists as an entity? What is the correct way to do this modeling?
Thank you very much.
https://dataandai.files.wordpress.com/2020/06/organizedataflows_linkedentitiesnamingconvention.png
https://i.imgur.com/acaycqg.png
Solved! Go to Solution.
A reference is just a pointer, nothing is duplicated. When a dataflow refreshes, data is read from the source and stored in data lake storage behind the scenes. A second dataflow will read from that store.
If you plan on doing large transformations (aggregations, merges, sorts) or have a large volume of data, seperate ingest and transform dataflows can be helpful, or if you want to leverage the ingest dataflow in another way.
Hello everybody.
I read some documentations about modeling using two dataflows and one dataset. The first dataflow would bring the data from the source without any transformation being applied (raw data). The second dataflow would be the layer where we would apply the necessary transformations. When we use a dataflow as a source to create a second dataflow, it comes in the model as an entity, and when we try to do some transformation we get the error message bellow. I am also leaving the image with the modeling I want to implement.
My question is: how do I apply transformations to the second Dataflow since I can't because the first one exists as an entity? What is the correct way to do this modeling?
Thank you very much.
https://dataandai.files.wordpress.com/2020/06/organizedataflows_linkedentitiesnamingconvention.png
Right click on it and create a Reference query to the Linked Table, then make the transformations you need. If the Workspace is not in Premium capacity, you'll have to uncheck "Enable Load" on the Linked Table.
Hi Jeff.
Thank you for the answer.
When I create a reference, I'm doubling the base, in terms of data, right?
Given this, is it more interesting that I have a single dataflow bringing the data from the base already with the transformation or modeling as I initially proposed?
Thanks.
A reference is just a pointer, nothing is duplicated. When a dataflow refreshes, data is read from the source and stored in data lake storage behind the scenes. A second dataflow will read from that store.
If you plan on doing large transformations (aggregations, merges, sorts) or have a large volume of data, seperate ingest and transform dataflows can be helpful, or if you want to leverage the ingest dataflow in another way.
Understood. Perfect Jeff.
Thank you so much. Have a good Day!
Covering the world! 9:00-10:30 AM Sydney, 4:00-5:30 PM CET (Paris/Berlin), 7:00-8:30 PM Mexico City
Check out the April 2024 Power BI update to learn about new features.