Earn a 50% discount on the DP-600 certification exam by completing the Fabric 30 Days to Learn It challenge.
Greetings,
I have the following tables. I would like to remove rows if WHStatus (green) is DPi OR DPa. And row would be duplicate if all columns in red are the same.
So if I have 6 columns wiht WHStatus of TPa, they are not duplicate. They're separate data.
If I have 2 columns with DPi and the same piece/user, they are duplicate.
Solved! Go to Solution.
hello, @JBDelvaux
let
Source = your_table,
dp = {"DPi", "DPa"},
dist = {"UserId", "WHStatus", "Journal", "Piece"},
whs_groups = Table.Group(Source, "WHStatus", {{"all", each _}}),
mod_tbl =
Table.AddColumn(
whs_groups, "mod",
(x) =>
if List.Contains(dp, x[WHStatus])
then Table.Distinct(x[all], dist)
else x[all]
),
z = Table.Combine(mod_tbl[mod])
in
z
hello, @JBDelvaux
let
Source = your_table,
dp = {"DPi", "DPa"},
dist = {"UserId", "WHStatus", "Journal", "Piece"},
whs_groups = Table.Group(Source, "WHStatus", {{"all", each _}}),
mod_tbl =
Table.AddColumn(
whs_groups, "mod",
(x) =>
if List.Contains(dp, x[WHStatus])
then Table.Distinct(x[all], dist)
else x[all]
),
z = Table.Combine(mod_tbl[mod])
in
z
I was more thinking along the lines of
- cutting the table into two parts based on X
- deduplicate the affected part
- append them back together
@lbendlin no doubt this works either. I wonder what is faster on hudge dataset.
There's always Table.Buffer 🙂
Please provide sample data (with sensitive information removed) that covers your issue or question completely, in a usable format (not as a screenshot). Leave out anything not related to the issue.
If you are unsure how to do that please refer to https://community.fabric.microsoft.com/t5/Community-Blog/How-to-provide-sample-data-in-the-Power-BI-...
Please show the expected outcome based on the sample data you provided.
If you want to get answers faster please refer to https://community.fabric.microsoft.com/t5/Desktop/How-to-Get-Your-Question-Answered-Quickly/m-p/1447...