Skip to main content
cancel
Showing results for 
Search instead for 
Did you mean: 

Register now to learn Fabric in free live sessions led by the best Microsoft experts. From Apr 16 to May 9, in English and Spanish.

Reply
samroth
Employee
Employee

Create column based off substrings from different table

I have two tables.


The first is my main table with a bunch of information on items. One of the columns is an "Area Path". The following is a made-up example with similar information to what can be found in my actual table.

Produce\Fruit\Apples

Produce\Fruit\Apples\Fuji

Produce\Fruit\Apples\Gala

Produce\Fruit\Apples\Gala

Produce\Fruit\Apples\Granny Smith

Produce\Fruit\Grapes\Red

Produce\Fruit\Grapes\Green

Produce\Fruit\Pears

Produce\Vegetables\Squash\Zucchini\Green

Produce\Vegetables\Squash\Zucchini\Yellow

Produce\Vegetables\Squash\Acorn

Produce\Vegetables\Squash\Butternut

Produce\Vegetables\Broccoli

Produce\Vegetables\Cauliflower

Produce\Vegetables

Produce\Mushroom\Cremini

 

My second table has one column "Root Paths". These are mostly smaller substrings of what can be found in "Area Path". Everything in this table will be unique and this table is static. Here are examples:

Produce\Fruit\Apples

Produce\Fruit\Apples\Fuji

Produce\Fruit\Grapes

Produce\Fruit

Produce\Vegetables\Squash\Zucchini

Produce\Vegetables\Squash\Acorn

Produce\Vegetables\Squash

Produce\Vegetables\Broccoli

Produce\Vegetables

Produce\Mushroom\Button

 

I want to add a column to the first table with the corresponding Root Paths for each Area Path. The corresponding Root Path is going to be the most specific Root Path that is a substring of the Area Path. So for example, for Area Path: Produce\Fruit\Apples\Fuji, the Root Path would be Produce\Fruit\Apples\Fuji. For Area Path, Produce\Fruit\Apples\Gala, it would just be Produce\Fruit\Apples. If there isn't a match, then the Root Path would be null.

 

So for the first table this is what I want:

Area Path

Root Path

Produce\Fruit\Apples

Produce\Fruit\Apples

Produce\Fruit\Apples\Fuji

Produce\Fruit\Apples\Fuji

Produce\Fruit\Apples\Gala

Produce\Fruit\Apples

Produce\Fruit\Apples\Gala

Produce\Fruit\Apples

Produce\Fruit\Apples\Granny Smith

Produce\Fruit\Apples

Produce\Fruit\Grapes\Red

Produce\Fruit\Grapes

Produce\Fruit\Grapes\Green

Produce\Fruit\Grapes

Produce\Fruit\Pears

Produce\Fruit

Produce\Vegetables\Squash\Zucchini\Green

Produce\Vegetables\Squash\Zucchini

Produce\Vegetables\Squash\Zucchini\Yellow

Produce\Vegetables\Squash\Zucchini

Produce\Vegetables\Squash\Acorn

Produce\Vegetables\Squash\Acorn

Produce\Vegetables\Squash\Butternut

Produce\Vegetables\Squash

Produce\Vegetables\Broccoli

Produce\Vegetables\Broccoli

Produce\Vegetables\Cauliflower

Produce\Vegetables

Produce\Vegetables

Produce\Vegetables

Produce\Mushroom\Cremini

null

 

 

I tried using Merge query with Left Outer (all from first, matching from second) with fuzzy matching. I can't get a value for fuzzy matching that gives me accurate results. It either misses correct matches or falsely matches.

 

I also tried to create a custom column by do this:

Table.SelectRows(#"Root Paths", (T) => Text.Contains([Area Path], T[Root Path]))

 

This gave me WAY too many null values and missed so many matches for a reason that I cannot determine.

 

Please let me know what I should be doing.

4 REPLIES 4
Ashish_Mathur
Super User
Super User

Hi,

This M code seems to work

let
    Source = Table.NestedJoin(Area_path, {"Text"}, Root_path, {"Text"}, "Root_path", JoinKind.LeftOuter),
    #"Expanded Root_path" = Table.ExpandTableColumn(Source, "Root_path", {"Text"}, {"Text.1"}),
    #"Inserted Text Before Delimiter" = Table.AddColumn(#"Expanded Root_path", "Text Before Delimiter", each Text.BeforeDelimiter([Text], "\", {0, RelativePosition.FromEnd}), type text),
    #"Merged Queries" = Table.NestedJoin(#"Inserted Text Before Delimiter", {"Text Before Delimiter"}, Root_path, {"Text"}, "Root_path", JoinKind.LeftOuter),
    #"Expanded Root_path1" = Table.ExpandTableColumn(#"Merged Queries", "Root_path", {"Text"}, {"Text.2"}),
    #"Added Custom" = Table.AddColumn(#"Expanded Root_path1", "Custom", each if [Text.1]=null then [Text.2] else [Text.1]),
    #"Removed Columns" = Table.RemoveColumns(#"Added Custom",{"Text.1", "Text Before Delimiter", "Text.2"}),
    #"Sorted Rows" = Table.Sort(#"Removed Columns",{{"Text", Order.Ascending}})
in
    #"Sorted Rows"

Hope this helps.

Untitled.png


Regards,
Ashish Mathur
http://www.ashishmathur.com
https://www.linkedin.com/in/excelenthusiasts/
Greg_Deckler
Super User
Super User

@samroth - I did it this way (below). PBIX is attached below sig.

Column = 
    VAR __Table = 
        ADDCOLUMNS(
            ALL('Table (4)'[Root Path]),
            "Depth",LEN([Root Path]) - LEN(SUBSTITUTE([Root Path],"\","")),
            "Match",FIND([Root Path],[Area Path],,0)
        )
    VAR __BestDepth = MAXX(FILTER(__Table,[Match]>0),[Depth])
RETURN
    MAXX(FILTER(__Table,[Match]>0 && [Depth] = __BestDepth),[Root Path])

 


@ me in replies or I'll lose your thread!!!
Instead of a Kudo, please vote for this idea
Become an expert!: Enterprise DNA
External Tools: MSHGQM
YouTube Channel!: Microsoft Hates Greg
Latest book!:
The Definitive Guide to Power Query (M)

DAX is easy, CALCULATE makes DAX hard...

Thanks Greg, where I am supposed to put this code to run in the pbix?

Hi , @samroth 

As shown in the attached file provided by @Greg_Deckler ,you just need to create a calculated column.

50.png

Best Regards,
Community Support Team _ Eason
If this post helps, then please consider Accept it as the solution to help the other members find it more quickly.

Helpful resources

Announcements
Microsoft Fabric Learn Together

Microsoft Fabric Learn Together

Covering the world! 9:00-10:30 AM Sydney, 4:00-5:30 PM CET (Paris/Berlin), 7:00-8:30 PM Mexico City

PBI_APRIL_CAROUSEL1

Power BI Monthly Update - April 2024

Check out the April 2024 Power BI update to learn about new features.

April Fabric Community Update

Fabric Community Update - April 2024

Find out what's new and trending in the Fabric Community.