Register now to learn Fabric in free live sessions led by the best Microsoft experts. From Apr 16 to May 9, in English and Spanish.
Hello, we have recently came across an issue when removing duplicates based on a column.
Column contained 12000 unique values, however, after "Remove Duplicates" step was applied, some of the values got eliminated, making the table to return only 10.5k, 10.8k or any random lower number. With every refresh, different values would get dropped off. This was done on July 2017 Desktop version, and interestingly, only some people have encountered the issue on identical file, others didn't.
We have tried Group By and counting rows for each value, no duplicates.
Column in question was nvarchar(10) fields in SQL Server. Couldn't reproduce issue with Excel, but can with SQL Server dataset, happy to provide PBIX file.
Lastly, inserting "UPPERCASE" step, applying it to the column in question prevented the issue, despite the fact that values all appear to be uppercase already.
You must be a registered user to add a comment. If you've already registered, sign in. Otherwise, register and sign in.