Earn the coveted Fabric Analytics Engineer certification. 100% off your exam for a limited time only!
I know how to remove the duplicate rows in the Query Editor. Is there a way to know how many duplcates were present?
For example, in excel when we remove duplicates, it tells us "This many duplicate rows" right? Is there a way to know that number in power bi when we execute the remove duplicate step?
Thanks for your help!
Solved! Go to Solution.
You can check all rows in power query. The top 1000 is only if the result has more than 1000. If you run "Keep duplicates" first then you can add a custom step like this to check it
= Table.RowCount(#"Keep duplicates")
Hope this helps
Happy to help!
Hi @Anonymous ,
I dont know if you know htis, there is also a wonderful features in power bi, which gives the count of Distinct in a row..
Edit queries--> View-> Column distribution.
This gives the distinct count in each row.
Thanks,
Tejaswi
I usually just load the data in then create a measure on count - distinct count.
You can check all rows in power query. The top 1000 is only if the result has more than 1000. If you run "Keep duplicates" first then you can add a custom step like this to check it
= Table.RowCount(#"Keep duplicates")
Hope this helps
Happy to help!
Hi, yes you can. First select the columns you want to check for duplicates. Then keep duplicates:
Then you can group by and count if there are a lot of rows.
After you are sure of what the data is showing you can remove duplicates if you are sure.
Another way is to create a DAX table or connnect with DAX Studio and write a query like this:
EVALUATE FILTER ( ADDCOLUMNS ( SUMMARIZE ( muestradatos, muestradatos[Afiliado] ), "CantDuplicados", CALCULATE ( COUNTROWS ( muestradatos ) ) ), [CantDuplicados] > 1 )
Column1 should be the field you want to check for duplicates. You can add more columns or check for all. The result of that query should be which columns have duplicates and a count of them.
Hope this helps,
If this post helps, then please consider Accept it as the solution to help the other members find it more quickly.
Happy to help!
You can do a GROUPBY in power query and add columns for count and distinct count.
User | Count |
---|---|
140 | |
113 | |
104 | |
76 | |
63 |
User | Count |
---|---|
135 | |
126 | |
110 | |
70 | |
61 |