Skip to main content
cancel
Showing results for 
Search instead for 
Did you mean: 

Earn the coveted Fabric Analytics Engineer certification. 100% off your exam for a limited time only!

Reply
aggiedan97
New Member

Remove Duplicates creating a copy not filtering source

I have a large source data set and am attempting to extract a unique sub-set of data. All the instructions I've read reference the same instructions making a duplicate of the source table with reference, removing unnecessary columns and removing duplicate rows. Every time a change is made in subsequent steps, power query re-reads the original source file.

 

My goal is for the distince sub-set to only contain the distinct rows, not a filtered sub-set. For example, if I have 500,000 transactions in the original data set and select 3 columns for a dimension with 100 distinct records, I expect the dimension table to only have 100 rows and subsequent changes will only evaluate 100 records, not the orginal 500,000.

 

A dimension table should be additive and full scans should not be necessary of the orginating dataset every time a subsequent metadata change is made.

1 ACCEPTED SOLUTION
watkinnc
Super User
Super User

I would actually duplicate the table instead of referencing it. This way it goes through the same logic, but is a separate query. Then you can make your distinct table that does not force the other table to evaluate. You might also wish to disable loading of the original query, if you are using it in other queries.

--Nate


I’m usually answering from my phone, which means the results are visualized only in my mind. You’ll need to use my answer to know that it works—but it will work!!

View solution in original post

2 REPLIES 2
watkinnc
Super User
Super User

I would actually duplicate the table instead of referencing it. This way it goes through the same logic, but is a separate query. Then you can make your distinct table that does not force the other table to evaluate. You might also wish to disable loading of the original query, if you are using it in other queries.

--Nate


I’m usually answering from my phone, which means the results are visualized only in my mind. You’ll need to use my answer to know that it works—but it will work!!
V-pazhen-msft
Community Support
Community Support

@aggiedan97 
You can just create a table visual with only required data by using filters, slicer or other features in power bi. Then you should able to export the summarized data only.Export data from a Power BI visualization - Power BI | Microsoft Docs

 


Paul Zheng _ Community Support Team
If this post helps, please Accept it as the solution to help the other members find it more quickly.

Helpful resources

Announcements
April AMA free

Microsoft Fabric AMA Livestream

Join us Tuesday, April 09, 9:00 – 10:00 AM PST for a live, expert-led Q&A session on all things Microsoft Fabric!

March Fabric Community Update

Fabric Community Update - March 2024

Find out what's new and trending in the Fabric Community.

Top Solution Authors
Top Kudoed Authors