cancel
Showing results for 
Search instead for 
Did you mean: 
Reply
Highlighted
MiltonKeynes Frequent Visitor
Frequent Visitor

Replacing missing values based on two columns

Hello everyone!

 

I'm having trouble preparing my data using Power Query in Power BI. I have a dataset that has the following columns: CustomerID, Street Name, Zip Code and Location Code. Location Code has some missing values, but I am able to fill in some of those by merging another query (customer info). Still, I am left with some missing values. In Excel I solved the problem by looking up street name and zip code, and returning corresponding location code (location code is based on this pair).

 

However, I am a literal noob in Power Query M and, even though I have given this a lot of thought, have not been able to solve the problem. Even though the number of missing values is not that significant, I would prefer not losing data in this case.

 

My data looks something like this:

CustomerIDStreet nameZip CodeLocation code
123Street 11235000
456Street 24565002
789Street 1123 
987Street 39875003
654Street 3987 

 

Obviously, the end result should look like this:

CustomerIDStreet nameZip CodeLocation code
123Street 11235000
456Street 24565002
789Street 11235000
987Street 39875003
654Street 39875003

 

I would appreciate any help and suggestions. I am not even sure if this is possible or practical in the first place.

1 ACCEPTED SOLUTION

Accepted Solutions
Super User I
Super User I

Re: Replacing missing values based on two columns

In Power Query:

Duplicate the query

Remove the Customer and Street Name columns.

Filter out the null entries in Location Code column (using the dropdown in the column heading)

This leaves you a kind of master table which matches Zip codes with Locations.

 

Using 'Merge Queries->Merge as New'

Merge the original and duplicated queries using an Inner Join on the Zip Code.

This gets you a table like this

 

MergeZips.PNG

Expand the column with 'Table' in each row (using the icon in the column heading)

You are really only interested in the new Location Code column. It should be fully populated.

Tidy up your data ( by Removing the old Location Code column)

View solution in original post

3 REPLIES 3
Super User I
Super User I

Re: Replacing missing values based on two columns

In Power Query:

Duplicate the query

Remove the Customer and Street Name columns.

Filter out the null entries in Location Code column (using the dropdown in the column heading)

This leaves you a kind of master table which matches Zip codes with Locations.

 

Using 'Merge Queries->Merge as New'

Merge the original and duplicated queries using an Inner Join on the Zip Code.

This gets you a table like this

 

MergeZips.PNG

Expand the column with 'Table' in each row (using the icon in the column heading)

You are really only interested in the new Location Code column. It should be fully populated.

Tidy up your data ( by Removing the old Location Code column)

View solution in original post

MiltonKeynes Frequent Visitor
Frequent Visitor

Re: Replacing missing values based on two columns

I got it working, thank you very much!

 

Apologies for the elementary question.

Super User I
Super User I

Re: Replacing missing values based on two columns

Don't worry. It's not an elementary question

Helpful resources

Announcements
Exclusive LIVE Community Event #3 – Ask Arun Anything

Exclusive LIVE Community Event #3 – Ask Arun Anything

Join us in the third Triple A event!

Meet the 2020 Season 1 Power BI Super Users!

Meet the 2020 Season 1 Power BI Super Users!

It’s the start of a new Super User season! Learn all about the new Super Users and brand-new tiered recognition system.

January 2020 Community Highlights

January 2020 Community Highlights

Make sure you didn't miss any of the things that happened in the community in January!

Top Solution Authors
Top Kudoed Authors