Earn the coveted Fabric Analytics Engineer certification. 100% off your exam for a limited time only!
When I am building my data model and delete the duplicates in a query, is there a way to see how many duplicate rows will be removed?
Also, is there a way to see which rows are to be duplicated?
Thank you.
Solved! Go to Solution.
Hello @WillBeeSEA
add a Index-columnn, then remove duplicates without checking the Index-Column. Afterwards you can filter your Original table. Here the M-code
let
Source = Table.FromRows(Json.Document(Binary.Decompress(Binary.FromText("i45WMlTSUXJUitVBZhmhsJzALGO4GIJlApGNBQA=", BinaryEncoding.Base64), Compression.Deflate)), let _t = ((type nullable text) meta [Serialized.Text = true]) in type table [Key = _t, Text = _t]),
#"Changed Type" = Table.TransformColumnTypes(Source,{{"Key", Int64.Type}, {"Text", type text}}),
#"Added Index" = Table.AddIndexColumn(#"Changed Type", "Index", 0, 1),
RemoveDuplicates = Table.Distinct(#"Added Index", {"Key", "Text"}),
RemovedRows = Table.SelectRows(#"Added Index", each not List.Contains(RemoveDuplicates[Index], [Index]))
in
RemovedRows
Copy paste this code to the advanced editor in a new blank query to see how the solution works.
If this post helps or solves your problem, please mark it as solution (to help other users find useful content and to acknowledge the work of users that helped you)
Kudoes are nice too
Have fun
Jimmy
Hello @WillBeeSEA
add a Index-columnn, then remove duplicates without checking the Index-Column. Afterwards you can filter your Original table. Here the M-code
let
Source = Table.FromRows(Json.Document(Binary.Decompress(Binary.FromText("i45WMlTSUXJUitVBZhmhsJzALGO4GIJlApGNBQA=", BinaryEncoding.Base64), Compression.Deflate)), let _t = ((type nullable text) meta [Serialized.Text = true]) in type table [Key = _t, Text = _t]),
#"Changed Type" = Table.TransformColumnTypes(Source,{{"Key", Int64.Type}, {"Text", type text}}),
#"Added Index" = Table.AddIndexColumn(#"Changed Type", "Index", 0, 1),
RemoveDuplicates = Table.Distinct(#"Added Index", {"Key", "Text"}),
RemovedRows = Table.SelectRows(#"Added Index", each not List.Contains(RemoveDuplicates[Index], [Index]))
in
RemovedRows
Copy paste this code to the advanced editor in a new blank query to see how the solution works.
If this post helps or solves your problem, please mark it as solution (to help other users find useful content and to acknowledge the work of users that helped you)
Kudoes are nice too
Have fun
Jimmy
You can do a group by (add all the columns as keys), and add a Count column.