Skip to main content
cancel
Showing results for 
Search instead for 
Did you mean: 

Register now to learn Fabric in free live sessions led by the best Microsoft experts. From Apr 16 to May 9, in English and Spanish.

Reply
matheus_peppers
Frequent Visitor

What is the best way not to have duplicate data and to be productive when importing data

Hi guys, what's up?

 

I would like to take a doubt a little less technical, as I believe this is more like good practice.

 

I created a report where the database, was exported from Facebook Ads, where I pulled the values from the maximum period.

 

I did the data retrieval through the folder path, instead of choosing the file.

 

However, I am afraid, and even here is my doubt.

 

In this report, I pulled the values from a maximum date period, and for example, if next week, I also pull the updated report, but from the maximum period as well, instead of just this last week.

 

Do I run the risk of having duplicate data? (When I import the base into Power BI, I always Transform the Data and choose to combine the tables, but it is a doubt that sticks in my mind).

 

And another thing, instead of importing the database by folder path so I don't need to mess with the Excel file, but only download and put it there in the folder, is it not better if I choose to import by Excel file itself, instead of folder, and every day update the data in the file with the new data downloaded?

 

These are just questions I have about good practices to:

 

1) Not run the risk of having duplicate data in my final report...

 

2) And to optimize my time as much as possible, without having to manually update the data in the base spreadsheet, copying and pasting the updated data.

 

If someone can help me with this question, I will be extremely grateful.

2 REPLIES 2
Adel
Helper III
Helper III

and just another point, why are you loading the files through import or through file path, why not use use a power bi connector, set it up with a future date and you will get the data you need with no headaches or wasted time.

Greg_Deckler
Super User
Super User

@matheus_peppers So, if you are using import mode and not using incremental refresh, Power BI basically truncates the tables in the data model and reloads the data in its entirety. Thus, in your situation you should not run into duplicate rows in this circumstance. You can make absolutely certain of this by doing a "Remove duplicates" step in Power Query Editor.


@ me in replies or I'll lose your thread!!!
Instead of a Kudo, please vote for this idea
Become an expert!: Enterprise DNA
External Tools: MSHGQM
YouTube Channel!: Microsoft Hates Greg
Latest book!:
The Definitive Guide to Power Query (M)

DAX is easy, CALCULATE makes DAX hard...

Helpful resources

Announcements
Microsoft Fabric Learn Together

Microsoft Fabric Learn Together

Covering the world! 9:00-10:30 AM Sydney, 4:00-5:30 PM CET (Paris/Berlin), 7:00-8:30 PM Mexico City

PBI_APRIL_CAROUSEL1

Power BI Monthly Update - April 2024

Check out the April 2024 Power BI update to learn about new features.

April Fabric Community Update

Fabric Community Update - April 2024

Find out what's new and trending in the Fabric Community.