Skip to main content
cancel
Showing results for 
Search instead for 
Did you mean: 

Register now to learn Fabric in free live sessions led by the best Microsoft experts. From Apr 16 to May 9, in English and Spanish.

Reply
V24
Helper I
Helper I

Text extraction

I have 2 types of inputs in my column:

1st: <div class=“External”><div style=“font-family”;”> Required text to be extracted</div></Div>

2nd: <div class =“External”> Required text to be extracted</div>

How do I extract the required text when the column has the above kind of rows as inputs

8 REPLIES 8
dufoq3
Super User
Super User

@Hi @V24, possible via UI

dufoq3_0-1706708559420.png

let
    Source = Table.FromRows(Json.Document(Binary.Decompress(Binary.FromText("i45Wiik1MDBOTsksU0jOSSwutn3UMMe1oiS1KC8x51HDXLBsKkJNcUllTipITVp+XoluWmJuZk4lUJk1XKlCUGphaWZRaopCSWpFiUJJvkJSqgKQVZSYXJKaAjFIH2gSssH6LjC+UqwOhosUsDqJNHuUYmMB", BinaryEncoding.Base64), Compression.Deflate)), let _t = ((type nullable text) meta [Serialized.Text = true]) in type table [Column1 = _t]),
    #"Extracted Text Before Delimiter" = Table.TransformColumns(Source, {{"Column1", each Text.BeforeDelimiter(_, "</div>"), type text}}),
    #"Extracted Text After Delimiter" = Table.TransformColumns(#"Extracted Text Before Delimiter", {{"Column1", each Text.AfterDelimiter(_, ">", {0, RelativePosition.FromEnd}), type text}}),
    #"Trimmed Text" = Table.TransformColumns(#"Extracted Text After Delimiter",{{"Column1", Text.Trim, type text}})
in
    #"Trimmed Text"

 


Note: Check this link to learn how to use my query.
Check this link if you don't know how to provide sample data.

Hi thanks a ton for your solution. I am new to powerBI. Just a quick question: I did not understand the source part. What should I replace here? And can I directly use this code if so can I just paste it in M code and is there any other thing I need to change apart from source and column?

@V24,

save your query as MyData for example. Create new blank query, open it in Advanced editor. Delete whole code and paste there the one I created. Edit 1st step Source = and delete selected part (see picture below). Replace whole this part with MyData so Source = MyData, (don't forget comma at the end as you see in the picture)

dufoq3_0-1706710426617.png

 


Note: Check this link to learn how to use my query.
Check this link if you don't know how to provide sample data.

slorin
Super User
Super User

Add a new step with the above formulas by replacing Your_Source and Your_Column with previous step and column name

Stéphane

slorin
Super User
Super User

You're on the Power Query forum, not the DAX forum

Stéphane

Sorry, that is not what I meant. I want to edit it in the power query itself. Is there any M code or something which can help me get the desired output?

thanks Stéphane

slorin
Super User
Super User

Hi,

 

= Table.TransformColumns(Your_Source, {{"Your_Column", 
each Text.BetweenDelimiters(_, ">", "<", {Text.Length(Text.Select(_,"<"))/2-1, RelativePosition.FromStart}), type text}})

or

= Table.TransformColumns(Your_Source, {{"Your_Column",
each Text.AfterDelimiter(Text.BeforeDelimiter(_, "</"), ">", {0, RelativePosition.FromEnd}), type text}})

 

Stéphane 

Hi, tried using this syntax to create a new column. It threw an error : A cyclic reference was encountered during evaluation.

Also to be clear: I have both the above types in my column. Want to extract the specific text from both the types using a single dax command. Thank you for your help.

Helpful resources

Announcements
Microsoft Fabric Learn Together

Microsoft Fabric Learn Together

Covering the world! 9:00-10:30 AM Sydney, 4:00-5:30 PM CET (Paris/Berlin), 7:00-8:30 PM Mexico City

PBI_APRIL_CAROUSEL1

Power BI Monthly Update - April 2024

Check out the April 2024 Power BI update to learn about new features.

April Fabric Community Update

Fabric Community Update - April 2024

Find out what's new and trending in the Fabric Community.

Top Solution Authors
Top Kudoed Authors