Register now to learn Fabric in free live sessions led by the best Microsoft experts. From Apr 16 to May 9, in English and Spanish.
Hi guys,
I have a list of names from two different sources, but they are not exactly the same. Unfortunately, there is no consistent way to split the name based on a delimiter and then pick only the first and surname, because there are inconsistencies in these areas as well.
What are the possibilities in either dax or m to clean data like this?
Example:
Source 1:
Daniel L Jones
Meredith Anne Summer
Chloe Lemaire-Trudeau
Martin van Hubert
Source 2:
Daniel Jones
Anne Summer
Chloe Lemaire
Martin van der Hubert
Solved! Go to Solution.
I manage to fix it by doing:
-A fuzzy merge (join) of the names from the two sources
-Create ID's (index)
-Join these to the fact table
Hi @Anonymous ,
I studied the examples you provided below and can't find an inherent rule for cleaning. As a workaround, you could use other fields(like unique ids) to match them.
I manage to fix it by doing:
-A fuzzy merge (join) of the names from the two sources
-Create ID's (index)
-Join these to the fact table
Covering the world! 9:00-10:30 AM Sydney, 4:00-5:30 PM CET (Paris/Berlin), 7:00-8:30 PM Mexico City
Check out the April 2024 Power BI update to learn about new features.
User | Count |
---|---|
113 | |
97 | |
80 | |
69 | |
59 |
User | Count |
---|---|
150 | |
119 | |
104 | |
87 | |
67 |