Skip to main content
cancel
Showing results for 
Search instead for 
Did you mean: 

Register now to learn Fabric in free live sessions led by the best Microsoft experts. From Apr 16 to May 9, in English and Spanish.

Reply
Anonymous
Not applicable

[Analysing Duplicate Values] Comparing multiple rows for similar values across multiple columns

I am doing analysis of duplicate entries for a survey. I want to analyse the duplicates to determine which one to keep before deleting the rest of the entries.

 

I was able to learn how to group the entries by the unique staff ID, and using the count row function i got this screen below:

 

pbicylim_1-1610873468070.png

 

My next step was to click the individual entries (example: A0612) to see all the duplicate entries.  In the example below, I have one unique staff ID with 8 entries. This could have been because the staff submitted multiple survey responses.

 

pbicylim_0-1610873441704.png

 

Question:

1. I have over 30 columns. How do i quickly analyse the similarities of row entries? My initial thought was to unpivot the other columns, and compare the values for each attribute column headers. But is there an easier method than this?

 

2. After I have decided which duplicates to delete, how do i ensure these changes get reflected in the master sheet? The long method would be for me to manually delete each row one by one. Is there a faster way around this? 

1 REPLY 1
artemus
Employee
Employee

You can use Table.FuzzyGroup to accomplish this. This function will group similar column values together.

 

Start by unpivoting the data into attribute/value pairs (Question/value)

Then do a group on all the columns with a count rows

Finally, replace the Table.Group function with Table.FuzzyGroup function. 

 

Helpful resources

Announcements
Microsoft Fabric Learn Together

Microsoft Fabric Learn Together

Covering the world! 9:00-10:30 AM Sydney, 4:00-5:30 PM CET (Paris/Berlin), 7:00-8:30 PM Mexico City

PBI_APRIL_CAROUSEL1

Power BI Monthly Update - April 2024

Check out the April 2024 Power BI update to learn about new features.

April Fabric Community Update

Fabric Community Update - April 2024

Find out what's new and trending in the Fabric Community.

Top Solution Authors