Earn the coveted Fabric Analytics Engineer certification. 100% off your exam for a limited time only!
Hi there
I have some really bad quality data I need to use unfortunately and one of the issues is data in parenthesis within this field. I need some of the data in parenthesis, and some I do not. I want to see what the data looks like if remove data between 1 and 2 character length within parenthesis e.g. (EU) and keep anything greater than 2 characters within parameters e.g. (SGP).
Is this possible?
Solved! Go to Solution.
Hi @Anonymous ,
Try this custom column:
let _text = Text.Split([Current Column], " ") in
Text.Combine(List.RemoveNulls(List.Transform(_text, each
if Text.StartsWith(_, "(") and Text.EndsWith(_, ")") then
if Text.Length(_) = 3 or Text.Length(_) = 4 then null
else _ else _
)), " ")
you could try in PowerQuery, by going to 'Transform data' and then edit that column. there is an option to extract text before and after delimiters.
hope it helps.
It doesn't allow you to specify the number of characters between the delimiters unless I am doing something wrong
I want to say: (??) = remove (???) = keep.
Here are some examples of the data and my desired result
Current Column | Desired Column |
MODA KTB (ME) | MODA KTB |
Petroliam Berhad (PETRONAT) | Petroliam Berhad (PETRONAT) |
MODA Kuwait | MODA Kuwait |
Hotel & Property Development (Kenal) Ltd | Hotel & Property Development (Kenal) Ltd |
Everhouse LLP (EU) | Everhouse LLP |
Hi @Anonymous ,
Try this custom column:
let _text = Text.Split([Current Column], " ") in
Text.Combine(List.RemoveNulls(List.Transform(_text, each
if Text.StartsWith(_, "(") and Text.EndsWith(_, ")") then
if Text.Length(_) = 3 or Text.Length(_) = 4 then null
else _ else _
)), " ")
User | Count |
---|---|
139 | |
113 | |
103 | |
73 | |
63 |
User | Count |
---|---|
136 | |
125 | |
107 | |
70 | |
61 |