Earn the coveted Fabric Analytics Engineer certification. 100% off your exam for a limited time only!
I've got a whole bunch of maintenance records and some guys that are sloppy about records entry. I was able to dedupe "same day" entries in the Maintenance Finish Time column by merging the park name, task type and finished columns (after I changed the timestamp to date). I just ran a dedupe on the merged column. What that doesn't work for it ones that are closed within six days of another. I need to see if there's a way to remove "near" duplicates. I've put some sample "raw" data below.
Hi @nickintosh
Please confirm your requirement
In the table below, you need to delete the rows which that are closed within six days of another.
Park Name | Maintenance Task | Maintenance Finish Time | |
Alberson Park | Litter Pick Up | 11/17/2018 | |
Alberson Park | Litter Pick Up | 11/16/2018 | delete |
Alberson Park | Litter Pick Up | 11/16/2018 | delete |
Alberson Park | Litter Pick Up | 11/15/2018 | delete |
Alberson Park | Litter Pick Up | 11/14/2018 | delete |
Alberson Park | Litter Pick Up | 11/13/2018 | delete |
Alberson Park | Litter Pick Up | 11/12/2018 | delete |
Alberson Park | Litter Pick Up | 11/11/2018 | delete |
Alberson Park | Litter Pick Up | 11/10/2018 | |
Alberson Park | Litter Pick Up | 11/9/2018 | |
Alberson Park | Empty Trash Receptacle | 11/15/2018 |
Is my understnading right?
Best Regards
Maggie
Hi,
My suggestion is to clean the data with trim and clean option before you merge them that's way you can eliminate the duplicates.
User | Count |
---|---|
125 | |
106 | |
99 | |
63 | |
62 |
User | Count |
---|---|
135 | |
116 | |
101 | |
71 | |
61 |