Skip to main content
cancel
Showing results for 
Search instead for 
Did you mean: 

Earn a 50% discount on the DP-600 certification exam by completing the Fabric 30 Days to Learn It challenge.

Reply
JBDelvaux
Helper I
Helper I

How to remove duplicate Only if column is X

Greetings,

I have the following tables. I would like to remove rows if WHStatus (green) is DPi OR DPa. And row would be duplicate if all columns in red are the same.

So if I have 6 columns wiht WHStatus of TPa, they are not duplicate. They're separate data.

If I have 2 columns with DPi and the same piece/user, they are duplicate.

JBDelvaux_0-1695887736768.png

 

1 ACCEPTED SOLUTION
AlienSx
Super User
Super User

hello, @JBDelvaux 

let
    Source = your_table,
    dp = {"DPi", "DPa"},
    dist = {"UserId", "WHStatus", "Journal", "Piece"},
    whs_groups = Table.Group(Source, "WHStatus", {{"all", each _}}),
    mod_tbl = 
        Table.AddColumn(
            whs_groups, "mod", 
            (x) => 
                if List.Contains(dp, x[WHStatus])
                then Table.Distinct(x[all], dist)
                else x[all]
        ),
    z = Table.Combine(mod_tbl[mod])
    
in
    z

View solution in original post

5 REPLIES 5
AlienSx
Super User
Super User

hello, @JBDelvaux 

let
    Source = your_table,
    dp = {"DPi", "DPa"},
    dist = {"UserId", "WHStatus", "Journal", "Piece"},
    whs_groups = Table.Group(Source, "WHStatus", {{"all", each _}}),
    mod_tbl = 
        Table.AddColumn(
            whs_groups, "mod", 
            (x) => 
                if List.Contains(dp, x[WHStatus])
                then Table.Distinct(x[all], dist)
                else x[all]
        ),
    z = Table.Combine(mod_tbl[mod])
    
in
    z

I was more thinking along the lines of 

 

- cutting the table into two parts based on X

- deduplicate the affected part

- append them back together

@lbendlin no doubt this works either. I wonder what is faster on hudge dataset.

There's always Table.Buffer 🙂

lbendlin
Super User
Super User

Please provide sample data (with sensitive information removed) that covers your issue or question completely, in a usable format (not as a screenshot). Leave out anything not related to the issue.
If you are unsure how to do that please refer to https://community.fabric.microsoft.com/t5/Community-Blog/How-to-provide-sample-data-in-the-Power-BI-...
Please show the expected outcome based on the sample data you provided.

If you want to get answers faster please refer to https://community.fabric.microsoft.com/t5/Desktop/How-to-Get-Your-Question-Answered-Quickly/m-p/1447...

Helpful resources

Announcements
LearnSurvey

Fabric certifications survey

Certification feedback opportunity for the community.

PBI_APRIL_CAROUSEL1

Power BI Monthly Update - April 2024

Check out the April 2024 Power BI update to learn about new features.

April Fabric Community Update

Fabric Community Update - April 2024

Find out what's new and trending in the Fabric Community.

Top Solution Authors
Top Kudoed Authors