Register now to learn Fabric in free live sessions led by the best Microsoft experts. From Apr 16 to May 9, in English and Spanish.
I'm tyring to remove duplicate on the ID column but would like to keep the row based on the date with the lowest value. Basically, keep the ID with the earliest instance. I tried the Sort -> Table.Buffer -> Remove Duplicate approach but that process is incredibly slow most likely because the dataset is an append of multiple large CSV files.
Are there other approaches that are more efficient?
Thanks in advance!
David
Solved! Go to Solution.
I'd suggest doing a Group By in the query editor. Group by ID and use Min as the aggregation type for the Date column.
Hi @dyabes
Attached the sample file for your reference.
Regards,
Cherie
Hi Alexis
I am also facing similar issue: I have three columns : Supplier Name, Status and Points. there is duplicate value in Supplier and unique value in Status and Points. I need to display the Supplier with lowest points and display corresponding status. Basically need to remove duplicate suppliers keeping the lowest points record
Regards
Arun
I'd suggest doing a Group By in the query editor. Group by ID and use Min as the aggregation type for the Date column.
Thank you. I think I should have provided the complete dataset I'm wokring on. I also need to keep the corresponding row values from other columns
Hi @dyabes
Attached the sample file for your reference.
Regards,
Cherie
Hi Dear
You need to use Group By.
Select Group By -> Select ID column as Group BY and then in Operation select Min and in Column Select Date.
You will get your result.
You can do what I suggested an then merge the extra column(s) back in after.
Covering the world! 9:00-10:30 AM Sydney, 4:00-5:30 PM CET (Paris/Berlin), 7:00-8:30 PM Mexico City
Check out the April 2024 Power BI update to learn about new features.
User | Count |
---|---|
110 | |
95 | |
76 | |
65 | |
51 |
User | Count |
---|---|
146 | |
109 | |
106 | |
88 | |
61 |