Skip to main content
cancel
Showing results for 
Search instead for 
Did you mean: 

Earn the coveted Fabric Analytics Engineer certification. 100% off your exam for a limited time only!

Reply
MiltonKeynes
New Member

Replacing missing values based on two columns

Hello everyone!

 

I'm having trouble preparing my data using Power Query in Power BI. I have a dataset that has the following columns: CustomerID, Street Name, Zip Code and Location Code. Location Code has some missing values, but I am able to fill in some of those by merging another query (customer info). Still, I am left with some missing values. In Excel I solved the problem by looking up street name and zip code, and returning corresponding location code (location code is based on this pair).

 

However, I am a literal noob in Power Query M and, even though I have given this a lot of thought, have not been able to solve the problem. Even though the number of missing values is not that significant, I would prefer not losing data in this case.

 

My data looks something like this:

CustomerIDStreet nameZip CodeLocation code
123Street 11235000
456Street 24565002
789Street 1123 
987Street 39875003
654Street 3987 

 

Obviously, the end result should look like this:

CustomerIDStreet nameZip CodeLocation code
123Street 11235000
456Street 24565002
789Street 11235000
987Street 39875003
654Street 39875003

 

I would appreciate any help and suggestions. I am not even sure if this is possible or practical in the first place.

1 ACCEPTED SOLUTION
HotChilli
Super User
Super User

In Power Query:

Duplicate the query

Remove the Customer and Street Name columns.

Filter out the null entries in Location Code column (using the dropdown in the column heading)

This leaves you a kind of master table which matches Zip codes with Locations.

 

Using 'Merge Queries->Merge as New'

Merge the original and duplicated queries using an Inner Join on the Zip Code.

This gets you a table like this

 

MergeZips.PNG

Expand the column with 'Table' in each row (using the icon in the column heading)

You are really only interested in the new Location Code column. It should be fully populated.

Tidy up your data ( by Removing the old Location Code column)

View solution in original post

3 REPLIES 3
HotChilli
Super User
Super User

In Power Query:

Duplicate the query

Remove the Customer and Street Name columns.

Filter out the null entries in Location Code column (using the dropdown in the column heading)

This leaves you a kind of master table which matches Zip codes with Locations.

 

Using 'Merge Queries->Merge as New'

Merge the original and duplicated queries using an Inner Join on the Zip Code.

This gets you a table like this

 

MergeZips.PNG

Expand the column with 'Table' in each row (using the icon in the column heading)

You are really only interested in the new Location Code column. It should be fully populated.

Tidy up your data ( by Removing the old Location Code column)

I got it working, thank you very much!

 

Apologies for the elementary question.

Don't worry. It's not an elementary question

Helpful resources

Announcements
April AMA free

Microsoft Fabric AMA Livestream

Join us Tuesday, April 09, 9:00 – 10:00 AM PST for a live, expert-led Q&A session on all things Microsoft Fabric!

March Fabric Community Update

Fabric Community Update - March 2024

Find out what's new and trending in the Fabric Community.

Top Solution Authors
Top Kudoed Authors