Register now to learn Fabric in free live sessions led by the best Microsoft experts. From Apr 16 to May 9, in English and Spanish.
Hi all,
Seeking assistance on a data consistency issue. We currently extract a department name field from a system that carries with it some sort of invisible formatting (possibly a HTML format??). The data is copied from the system and pasted into Excel and is subsequently pulled into PowerBI.
When we type what appears to be the exact same name into the sheet, Excel/PowerQuery/PowerBI all believe they are different content even though they appear the same. If you were to filter on a column with the extracted and manually typed names, you'll see this for example:
For a range of reasons, this repeatedly comes up as an issue when we pull the data through or use the field for reporting. Is there anyway to remove the formatting or convert the data to a text format so the system can read the data as consistent? Constantly trying to resolve 'invisible errors' is becoming unmanageable.
A few notes:
Cheers
Kirsty
Solved! Go to Solution.
Thanks Yuliana @v-yulgu-msft! Naturally I can't manually replace the apparent spaces but knowing they were "non-printable characters", helped with searching for a solution.
For others with this problem: Lots of research indicated that a CLEAN (Dax/Excel) or Text.Clean (M Code) function would cleans the data by removing the non-printable character. However, I found this did not work for me. Feel free to try it for your problem in PowerQuery by selecting the relevant column --> Transform menu --> Format menu --> Clean.
For me, I had to replace the non'printable character specifically by using the code #(00A0) as what needed to be 'found' and 'repalced'. Code as follows:
= Table.ReplaceValue(#"Prior step", "#(00A0)"," ",Replacer.ReplaceText,{"Column Being Validated"})
Hi @KCinMelbourne,
You could upload the sample Excel file to OneDrive, and paste the shared link here so that I can download.
Regards,
Yuliana Gu
Thanks @v-yulgu-msft,
I thought I was just having a dumb moment not being able to find the attachment. I've popped on google drive, it should be public but let me know if you can't access it.
Cheers
KC
Hi @KCinMelbourne,
I tested with your sample file there might existing some non-printable characters in column values. After removing all space, the duplicates disappeared.
Then, I typed space manually again, the problem didn't persist.
Best regards,
Yuliana Gu
Thanks Yuliana @v-yulgu-msft! Naturally I can't manually replace the apparent spaces but knowing they were "non-printable characters", helped with searching for a solution.
For others with this problem: Lots of research indicated that a CLEAN (Dax/Excel) or Text.Clean (M Code) function would cleans the data by removing the non-printable character. However, I found this did not work for me. Feel free to try it for your problem in PowerQuery by selecting the relevant column --> Transform menu --> Format menu --> Clean.
For me, I had to replace the non'printable character specifically by using the code #(00A0) as what needed to be 'found' and 'repalced'. Code as follows:
= Table.ReplaceValue(#"Prior step", "#(00A0)"," ",Replacer.ReplaceText,{"Column Being Validated"})
Covering the world! 9:00-10:30 AM Sydney, 4:00-5:30 PM CET (Paris/Berlin), 7:00-8:30 PM Mexico City
Check out the April 2024 Power BI update to learn about new features.
User | Count |
---|---|
109 | |
99 | |
77 | |
66 | |
54 |
User | Count |
---|---|
144 | |
104 | |
102 | |
87 | |
64 |