Earn a 50% discount on the DP-600 certification exam by completing the Fabric 30 Days to Learn It challenge.
Hi team, I'm hoping someon might be able to help
I have a column of data that is a mixture of English words and non-English text, and I need to clean out the non-English text with a calculated column, leaving just the English words separated by spaces. Here's an example of my data:
Example data:
div dir classmsrtestatefielddivdiv dataspcanvascontrol dataspcanvasdataversion1.0 dataspcontroldata123quotpositionquot58123quotzoneIndexqu
Here's an example of the output I'd like:
div dir class estate field div div data canvas control data canvas data version data control data position zone index
I am proposing to use this data source for my English words:
https://www.kaggle.com/datasets/yk1598/479k-english-words?resource=download
I'd be loading this data as a text file into my report and into a table called 'English Words'.
Any help would be greatly appreciated 🙂
User | Count |
---|---|
102 | |
90 | |
80 | |
71 | |
69 |
User | Count |
---|---|
114 | |
100 | |
97 | |
72 | |
68 |