cancel
Showing results for 
Search instead for 
Did you mean: 
Reply
Highlighted
Helper V
Helper V

type Text: Clean in-between white space

Hi. Is there a way to parse text to single white spaces only?

 

That is:

"mary    white     helloooo"

 

Becomes:

"mary white helloooo"

1 ACCEPTED SOLUTION

Accepted Solutions
Highlighted

try this

 

 

let
    Source = Excel.Workbook(File.Contents("C:yourpath\Q.14 Data.xlsx"), null, true),
    Table1_Table = Source{[Item="Table1",Kind="Table"]}[Data],
    #"Changed Type" = Table.TransformColumnTypes(Table1_Table,{{"Single space everything, and trim both ends (don’t make a new column)", type text}}),
    tc=Table.TransformColumns( #"Changed Type", {"Single space everything, and trim both ends (don’t make a new column)", each Text.Combine(List.Select(Text.Split(_," "), each _<>""), " ")})
in
    tc

 

 

image.png

 

View solution in original post

10 REPLIES 10
Highlighted
Super User II
Super User II

Hi @michellepace 

Paste this M code in a blank query to see the steps. The recursive function is the relevant one.

let
    Source = Table.FromRows(Json.Document(Binary.Decompress(Binary.FromText("i45Wyk0sqlQAgvKMzJJUEEMhIzUnJx8IlGJjAQ==", BinaryEncoding.Base64), Compression.Deflate)), let _t = ((type nullable text) meta [Serialized.Text = true]) in type table [Column1 = _t]),
    #"Changed Type" = Table.TransformColumnTypes(Source,{{"Column1", type text}}),
    fx = (input as text) => let reduce = Text.Replace(input,"  "," "), res=if Text.Length(reduce)=Text.Length(input) then input else @fx(reduce) in res,  
    #"Added Custom" = Table.AddColumn(#"Changed Type", "Custom", each fx([Column1]))
in
    #"Added Custom"

 

Please mark the question solved when done and consider giving kudos if posts are helpful.

Contact me privately for support with any larger-scale BI needs, tutoring, etc.

Cheers 

SU18_powerbi_badge

Highlighted
Memorable Member
Memorable Member

 

 

Text.Combine(List.Select(Text.Split("mary    white     helloooo"," "), each _<>""), " ")

 

 

 

I do not intend to boast this solution, but just as a side observation, I point out that this cuts away the blanks that precede the first word or follow the last word.
This of course can be an advantage or a disadvantage, it depends on what one is looking for.

 

 

 

Highlighted
Solution Supplier
Solution Supplier

Hi, @michellepace 

Try this:

= let str="mary    white     helloooo" in Text.Combine(Splitter.SplitTextByWhitespace()(str), " ")

If my code solves your problem, mark it as a solution 

Highlighted

= Splitter.SplitTextByWhitespace()(" mary white helloooo ")

 

this function, that of which I now know the existence thanks to @ziying35 , for a string with leading or trailing whites spaces, gives this result

 

image.png

 

where the first and last element are the empty string "".

 

Why?

 

what kind of internal logic is applied in your opinion?

 

 

Highlighted

@Rocco_sprmnt21 

The function does have the problems you describe, and I'm not sure about its internal logic. The scenarios I've used it in before have been without leading and trailing spaces

 

= let str=" mary white helloooo " in Text. Trim(Text.Combine(Splitter.SplitTextByWhitespace()(str), " "))

Highlighted

Thank you all for your replies. In truth, I am struggling to use any of the solutions. Looking at my initial post, I did not explain my requirement sufficiently. If you look at the column name below, it is exactly what I need to do. Does anyone have the line of M-Code I can paste into the Advanced Editor to achieve this? I am happy for Format>Trim to be used as the first step if it simplifies the solution.

 

.pbix and data here: clickme

 

s.png

Many thanks once again.

Highlighted

try this

 

 

let
    Source = Excel.Workbook(File.Contents("C:yourpath\Q.14 Data.xlsx"), null, true),
    Table1_Table = Source{[Item="Table1",Kind="Table"]}[Data],
    #"Changed Type" = Table.TransformColumnTypes(Table1_Table,{{"Single space everything, and trim both ends (don’t make a new column)", type text}}),
    tc=Table.TransformColumns( #"Changed Type", {"Single space everything, and trim both ends (don’t make a new column)", each Text.Combine(List.Select(Text.Split(_," "), each _<>""), " ")})
in
    tc

 

 

image.png

 

View solution in original post

Highlighted

@Rocco_sprmnt21 im not sure, but maybe Splitter.SplitTextByWhitespace() considers more spaces in a row ("      ") as a single "whitespace"... thats why there is only one empty element at the beginnning and end of the list:

whitespace.PNG

Highlighted

try to write, for example, in pseudo-code (simulating a low-level language) the various steps that produce that result.
Let's see where the different management of leading and trailing spaces intervenes with respect to those inside the text string.

 

 

 

PS

Un empty element "" is different from blank " "

Helpful resources

Announcements
Community Conference

Power Platform Community Conference

Check out the on demand sessions that are available now!

Community Conference

Microsoft Power Platform Communities

Check out the Winners!

secondImage

Create an end-to-end data and analytics solution

Learn how Power BI works with the latest Azure data and analytics innovations at the digital event with Microsoft CEO Satya Nadella.

Top Solution Authors
Top Kudoed Authors