Skip to main content
cancel
Showing results for 
Search instead for 
Did you mean: 

Earn a 50% discount on the DP-600 certification exam by completing the Fabric 30 Days to Learn It challenge.

Reply
ryland91
New Member

remove string patterns within a column

Hello,

 

I have a text column in PowerQuery that contains strings with the pattern like below. I want to replace all such cases 

 

 

 

s:xxx: where xxx can be any number from 0 to 999

Examples s:1:, s:34:, s:677:

 

 

 

1 ACCEPTED SOLUTION
HotChilli
Super User
Super User

The simplest thing to do, if there are not too many different codes, would be to right-click and Replace Values.  Do that multiple times for each different code. It's simple but maybe not appropriate

--

There is no built-in regular expression functions in power query but you can make web calls to do this from within PQ.  It's not going to be quick but it does open up the whole power of regex.  Here's an example thread

https://community.fabric.microsoft.com/t5/Desktop/Regex-for-subtracting-texts-by-Power-Query-Dax/td-...

 

--

Alternatively, you can define a list of all possible codes to search for, then use that list in a function to clean the column:

1 Make a list {1..999}

2 Transform it to codes : List.Transform(Source, each Text.Combine({"s:", Text.From(_), ":"}))

 

then add a column to the original query using :List.Accumulate( SearchList, {[Column1], ""}, (old, current) => let CreateNew = Text.Replace(old{0}, current,""), CreateSubstringsRemoved = try Text.Combine (List.Difference(Text.ToList (old{0}), Text.ToList(CreateNew) ) ) otherwise "" in {CreateNew, if CreateSubstringsRemoved <> "" and old{1} <> "" then old{1} & ", " & CreateSubstringsRemoved else if old{1} <> "" and CreateSubstringsRemoved = "" then old{1} else CreateSubstringsRemoved } ) {0}I didn't write that originally, it's by a contributor who used to be on the forum

Jimmy801

 

 

View solution in original post

5 REPLIES 5
HotChilli
Super User
Super User

Very nice. I had a look for something to feed a list into.  Couldn't find it!

HotChilli
Super User
Super User

The simplest thing to do, if there are not too many different codes, would be to right-click and Replace Values.  Do that multiple times for each different code. It's simple but maybe not appropriate

--

There is no built-in regular expression functions in power query but you can make web calls to do this from within PQ.  It's not going to be quick but it does open up the whole power of regex.  Here's an example thread

https://community.fabric.microsoft.com/t5/Desktop/Regex-for-subtracting-texts-by-Power-Query-Dax/td-...

 

--

Alternatively, you can define a list of all possible codes to search for, then use that list in a function to clean the column:

1 Make a list {1..999}

2 Transform it to codes : List.Transform(Source, each Text.Combine({"s:", Text.From(_), ":"}))

 

then add a column to the original query using :List.Accumulate( SearchList, {[Column1], ""}, (old, current) => let CreateNew = Text.Replace(old{0}, current,""), CreateSubstringsRemoved = try Text.Combine (List.Difference(Text.ToList (old{0}), Text.ToList(CreateNew) ) ) otherwise "" in {CreateNew, if CreateSubstringsRemoved <> "" and old{1} <> "" then old{1} & ", " & CreateSubstringsRemoved else if old{1} <> "" and CreateSubstringsRemoved = "" then old{1} else CreateSubstringsRemoved } ) {0}I didn't write that originally, it's by a contributor who used to be on the forum

Jimmy801

 

 

ryland91
New Member

Thanks for the answer. I've attached an example below. The idea is to simply replace all instances of s:xxx: with "".

 

Data:

Column name:ABC

Example of Column Value:

This is a sample text with s:1: embedded. Another example: s:34: can be found in this text as well. s:677: appears at the end of this string. No pattern in this one. s:123: is just another instance.

 

Expected result after transformation:

Column name:ABC

Example of Column Value after transformation:

This is a sample text with embedded. Another example: can be found in this text as well.  appears at the end of this string. No pattern in this one. is just another instance.

Hi, @ryland91 change "txt" to your column name (as well as use correct your_table reference).

let
    Source = your_table,
    numbers = {0..999},
    delimiters = List.Buffer(List.Transform(numbers, (x) => "s:" & Text.From(x) & ":")),
    z = Table.TransformColumns(Source, {"txt", (x) => Text.Combine(Splitter.SplitTextByAnyDelimiter(delimiters)(x), "")})
in
    z
HotChilli
Super User
Super User

More detail please.  Show your data and the desired result please

Helpful resources

Announcements
PBI_APRIL_CAROUSEL1

Power BI Monthly Update - April 2024

Check out the April 2024 Power BI update to learn about new features.

Top Solution Authors
Top Kudoed Authors