Skip to main content
cancel
Showing results for 
Search instead for 
Did you mean: 

Register now to learn Fabric in free live sessions led by the best Microsoft experts. From Apr 16 to May 9, in English and Spanish.

Reply
robertDTS
New Member

Data Refresh Fails, Error message claims duplicate ID value in table

I have a PBIX file where when I go to refresh my data from a SQL server the refresh eventually fails with an error message that says one of my tables contains duplicate IDs. I went to my actual SQL DB and ran a query on the table for the exact ID provided by the error message. Only one row is returned by the straight query, therefore there are no duplicate values in my table.

 

I'm thinking this is some sort of red herring and that the real issue lies either in a relationship or somewhere else in the data. Any suggestions on how to troubleshoot? I've updated PowerBI to the latest released version.

19 REPLIES 19
knammer
Regular Visitor

Hi, I was having this issue as well.  In my case, the Query is doing a 'groupby' on order numbers , and capturing the minimum dates associated with the order number.    It would refresh on desktop, but gave the duplicate value error when refreshing on the server.    By the way, it was refreshing fine on the server for months, but the error started recently.   Further investigation into the raw data, I found a few cases when the order numbers had a trailing space that shouldn't be there.   So it seems the group by function will result in 2 different rows/orders ( with and without the space), but when you look at the resulting groupby results , the trailing space is not there, so the uniqueness is lost.    Adding a 'trim' function to my query to eliminate the trailing spaces on my order numbers, before the Groupby step,  resolved my issue.   Hope this helps someone.  cheers  

gernot
Frequent Visitor

Hello!

I am having the same issue where I query data from app insights with

 

Web.Contents("https://api.applicationinsights.io....

 

Data is fetched into a tables which contain one row per day having the date as a unique column set in the model. All tables also have 1:1 relation to a date-table which is the date master table.

When refreshing with Power BI desktop, everything works as expected. When doing a scheduled (incremental) refresh in the cloud, it fails with error:

Column '<pii>Date</pii>' in Table '<pii>SomeTable</pii>' contains a duplicate value '<pii>10/1/2021</pii>' and this is not allowed for columns on the one side of a many-to-one relationship or for columns that are used as the primary key of a table.

This is 100% deterministic and it fails always on the same date (10/1/2021).

I tried to make the Date column non-unique and changed the relation to be 1:*. This made the scheduled update work but the data was wrong since I had to apply a sum() function to get 1 row per day. 10/1/2021 was x2 then.

 

One observation I made:

When setting incremental refreshs to.

refresh: 1 month

detect changes: Date column

The error is deterministic with 10/1/2021 occuring twice.

 

If I change to

refresh: 2 weeks

detect changes: Date column

The error occurs with other dates (more than 1). In fact a series of 5 days is x2.

 

I, of course, checked the result by querying directly in app insights. The data is correct with no duplictate rows.

 

Regards,

Gernot

gernot
Frequent Visitor

It turned out to be my own fault. The way, I wrote the query was inclusive on both sides so on a new refresh, it also returned the last record from the previous iteration.

GrahamMWade
New Member

I have also had this issue when I am pulling data from an API.  I assumed it was tempormentality of the API requests but I am consistently able to manually refresh or refresh in the desktop version whereas the PowerBI scheduled refresh seems to fail very regularly so I am now convinced there is a flaw in the way the cacheing of some queries works.  My primary key is a integer ID column and whilst I don't personally manage the database I know that it is the primary key of an SQL table so it is almost inconceivable that there could be a genuine duplicate.

I am using the staging table suggested by mahidharsridhar below.  Microsoft do need to look into this.

Anonymous
Not applicable

On many occasions I've also come across and been able to clear this issue via issuing an additional manual refresh but today a manual refresh wouldn't clear it. 

Finally found my issue was caused by a Upper/Lower CASE SENSITIVITY ISSUE...

I use a bridge table to create a usable relationship between two *:* tables (instead creates a 1:* in both directions) and it seems I had a repeated value (only differing in case) when merging the two sources for the bridge. Power query would properly perform the merge when I was in the "Transform Data" window but I think it became case sensitive when running a refresh of the entire file...

Weird right? Anyway; hope this helps someone out there.

Anonymous
Not applicable

Hi,

I had this issue with my SSAS Tabular, and have solved by "Processing it in Full"

I'm sure you can apply this with your PBI refresh.

Cheers,

Basia

reference:https://social.msdn.microsoft.com/Forums/sqlserver/en-US/68c219b1-c9c4-43ca-8a3c-f3c3bb4e84c9/delete...

 

Alternatively remove the Unique Key constraint within PBI (SSAS has this feature)

Anonymous
Not applicable

We have about 20 Power BI app workspaces scheduled to refresh on a regular bases... we get the false duplicate error message about 1 in every 100 refreshes. For those with the false duplicate error reported, we perform a manual on-demand refresh from the service and this solves the false reported duplicate 100% of the times. 

 

It has been confirmed as a "false" duplicate error, but must be some other error.

 

Maybe its something to do with not clearing the table completely before reloading.

 

We have also seen the same behavior with PBI desktop, and again just doing another immediate refresh the reported duplicate is gone.    Weird

Anonymous
Not applicable

This "false duplicate" error is still occurring for us. However it is increasing in frequency, we are now seeing this in more and more PBI apps, and the false error is happening about 20% of the time during scheduled refreshes. We are now seeing it reported as a problem experienced by an increased number of content producers too.  

 

We can't reproduce the false error on-demand which limits our ability to troubleshoot.  We are sharing this information here in hopes of keeping MS and the community aware of the problem.

 

We currently ignore the false error messages as it disappears with the next refresh cycle.   However it appears the symptom is much more broad, varied, and not localized to our tenant or premium capacity.

Can you give more information on "we perform a manual on-demand refresh from the service and this solves the false reported duplicate 100% of the times." 

 

I am having the same issue and need to try this, but I am not sure what to do. Thanks.

Anonymous
Not applicable

I should NOT have said it solves the problem... it just proves that there is not really a duplicate ID 100% of the time.

 

The problem is still happening for us and how at a much greater frequency.

Can confirm this is still an issue, started facing this recently. Also confirmed the source data has no duplicates at all, when this error pops up, in 99% of cases just re-doing the refresh makes it go away. Definitely a bug where the string matching routine within PBI thinks something is a dup, when in reality it isn't

 

For a workaround, please see below

https://exceleratorbi.com.au/preventing-managing-refresh-failures-in-power-bi/

Anonymous
Not applicable

I am experiencing the same issue when importing a file. The file format has not changed for months and so has relationships to other files. 

The column that returns the error only contians unique values in form of ID's, validated that using excel. For somereason powerbi claims that the column contains ,,,None,Functional which are values from another column in the data set. the columns are 9 columns appart in the data set. 

 

Error Message is below.

Column 'ID' in Table 'ReportExport' contains a duplicate value ',,,None,Functional' and this is not allowed for columns on the one side of a many-to-one relationship or for columns that are used as the primary key of a table. Table: ReportExport.Cluster URI:WABI-US-NORTH-CENTRAL-redirect.analysis.windows.netActivity ID:debdaefc-f0cb-4ae0-b698-b77851d4e999Request ID:2305575e-f6be-c22d-0bb4-549b25ac04f4Time:2019-01-25 15:59:14Z

 

I have no idea how to solve the suddenly introduced issue.

 

Anonymous
Not applicable

Hi, did you ever find out why this was happening? I encountered the same error message when I was trying to load the query to the model so I removed the column containing the duplicate values -->successfully loaded to the model-->went back to the query and deleted the step for removing the column---> successfully loaded back to the model containing the column with duplicates. 

 

I have since tested several dashboards and gone through the same step of loading the model with the product ID (which continues to contain duplicates) and the query has loaded successfully without me having to go through the additional step of removing the column and adding back in. I'm not able to understand why it happened or why I couldn't replicate the error message. I am worried that any auto-refreshes once I publish might cause the query to fail at some point.

Greg_Deckler
Super User
Super User

Can you share the exact error message that you are getting? Also, duplicates in DAX are not necessarily the same as in SQL so can you share the format of some of your ID's? DAX may be considering things to be duplicates that are not duplicates in SQL.


@ me in replies or I'll lose your thread!!!
Instead of a Kudo, please vote for this idea
Become an expert!: Enterprise DNA
External Tools: MSHGQM
YouTube Channel!: Microsoft Hates Greg
Latest book!:
The Definitive Guide to Power Query (M)

DAX is easy, CALCULATE makes DAX hard...

PowerBI Refresh ErrorPowerBI Refresh ErrorSQL QuerySQL QuerySQL ResultSQL Result

Hi, here are images from PowerBI and SQL.

 

There is not much DAX in this data model. There are a series of Measures to cacluate the "Latest" SWC since each row has a unique data time stamp. The SWC table has some modifications in the Query editor, some look-up columns are expanded and the date column is duplicated so that I can have the date in multiple user readable formats.

 

Try doing a Trim and Clean on that field in Power Query. If you have it in there with a trailing space, for example, or maybe a leading zero I believe the model will consider that a duplicate but your SQL will not find that.


@ me in replies or I'll lose your thread!!!
Instead of a Kudo, please vote for this idea
Become an expert!: Enterprise DNA
External Tools: MSHGQM
YouTube Channel!: Microsoft Hates Greg
Latest book!:
The Definitive Guide to Power Query (M)

DAX is easy, CALCULATE makes DAX hard...

No dice, which I can't say I'm suprised. We authored the application that is filling the SQL DB and while our app is not perfect, one part that is rock solid is the SWC ID. Its the unique key for that table in SQL too, so SQL will not allow duplicates (though I agree there can be situtations where something might not be seen as a dupe by SQL).

Here's what I would do. Create a new Power BI file and copy over your query for this table. Run the import/refresh. Since it is not part of a relationship, you will not get that error and then you will be able to better assess what is causing the duplicate issue. I can't imagine you are getting that error on a lark, something is causing it.


@ me in replies or I'll lose your thread!!!
Instead of a Kudo, please vote for this idea
Become an expert!: Enterprise DNA
External Tools: MSHGQM
YouTube Channel!: Microsoft Hates Greg
Latest book!:
The Definitive Guide to Power Query (M)

DAX is easy, CALCULATE makes DAX hard...

Yeah, that is more or less where I've gotten to. Made a copy of the PBIX and started to isolate and delete stuff to see what came of it. Looks like I found a relationship causing the issue. We'll see.

 

Thanks for replying though!

 

-R

Helpful resources

Announcements
Microsoft Fabric Learn Together

Microsoft Fabric Learn Together

Covering the world! 9:00-10:30 AM Sydney, 4:00-5:30 PM CET (Paris/Berlin), 7:00-8:30 PM Mexico City

PBI_APRIL_CAROUSEL1

Power BI Monthly Update - April 2024

Check out the April 2024 Power BI update to learn about new features.

April Fabric Community Update

Fabric Community Update - April 2024

Find out what's new and trending in the Fabric Community.

Top Solution Authors