Skip to main content
cancel
Showing results for 
Search instead for 
Did you mean: 

Register now to learn Fabric in free live sessions led by the best Microsoft experts. From Apr 16 to May 9, in English and Spanish.

Reply
LenFi
Frequent Visitor

Does importing a PBI Dataflow into a semantic model duplicate the source data?

Let's take an example where you use import mode to get source data into a dataflow. You then combine multiple dataflows where your output is your dataflow entity. Once you import your dataflow entity into a semantic model(dataset), does that mean that your source data is duplicated? 

This is the response I got from Chat GPT, but I cannot find any documentation regarding this: 
"Imported Semantic Model: When you import data from a dataflow into a Power BI dataset, you're essentially creating a semantic model based on the dataflow's entities. However, this process doesn't duplicate the underlying data. Instead, it establishes metadata references to the data stored in the dataflow. The imported semantic model provides a structured representation of the data for analysis and visualization purposes.

Therefore, while you may have multiple artifacts (dataflows, datasets) referencing the same underlying data, there's no duplication of the actual data. This architecture promotes data consistency, reusability, and efficient management of data assets within the Power BI ecosystem."

Has anyone looked into this and can perhaps point to more information? Thank you!!

1 ACCEPTED SOLUTION
lbendlin
Super User
Super User

Once you import your dataflow entity into a semantic model(dataset), does that mean that your source data is duplicated? 

yes, yes it does.   Only use dataflows if they provide value. Avoid using them if the Semantic Model can get the same data from the original source reliably and with good performance.  Or consider using Direct Query where appropriate.

View solution in original post

1 REPLY 1
lbendlin
Super User
Super User

Once you import your dataflow entity into a semantic model(dataset), does that mean that your source data is duplicated? 

yes, yes it does.   Only use dataflows if they provide value. Avoid using them if the Semantic Model can get the same data from the original source reliably and with good performance.  Or consider using Direct Query where appropriate.

Helpful resources

Announcements
Microsoft Fabric Learn Together

Microsoft Fabric Learn Together

Covering the world! 9:00-10:30 AM Sydney, 4:00-5:30 PM CET (Paris/Berlin), 7:00-8:30 PM Mexico City

PBI_APRIL_CAROUSEL1

Power BI Monthly Update - April 2024

Check out the April 2024 Power BI update to learn about new features.

April Fabric Community Update

Fabric Community Update - April 2024

Find out what's new and trending in the Fabric Community.

Top Solution Authors
Top Kudoed Authors