Skip to main content
cancel
Showing results for 
Search instead for 
Did you mean: 

Earn the coveted Fabric Analytics Engineer certification. 100% off your exam for a limited time only!

Reply
Wilhelm
Advocate II
Advocate II

Remove duplicates not removing all duplicates

I have a table with a single column which I remove duplicates from (default function).  When I view the data, there are 7228 rows and 7214 distinct values.  What am I missing?

2018-10-12_10-08-11.png2018-10-12_09-59-23.png

 

 

 

 

 

 

 

2 REPLIES 2
v-chuncz-msft
Community Support
Community Support

@Wilhelm,

 

This may be caused by case sensitivity or trailing blanks. You may add a calculated column to have a check.

Column =
COUNTROWS ( FILTER ( Table1, Table1[Column1] = EARLIER ( Table1[Column1] ) ) )
Community Support Team _ Sam Zha
If this post helps, then please consider Accept it as the solution to help the other members find it more quickly.

Thanks, @v-chuncz-msft.  That's a great idea to check for text interpretation (case, spaces), however, I have removed all spaces using trim() and also confirmed that case is correct.

 

When filtering for a known duplicate before removing duplicates, all duplicates are indeed removed, but when the filtering step is deleted, the remove duplicates step leaves some of the duplicates.  This indicates that it is rather a set quantity / order issue than a text interpretation issue.  Any thoughts?

Helpful resources

Announcements
April AMA free

Microsoft Fabric AMA Livestream

Join us Tuesday, April 09, 9:00 – 10:00 AM PST for a live, expert-led Q&A session on all things Microsoft Fabric!

March Fabric Community Update

Fabric Community Update - March 2024

Find out what's new and trending in the Fabric Community.