Skip to main content
cancel
Showing results for 
Search instead for 
Did you mean: 

Earn a 50% discount on the DP-600 certification exam by completing the Fabric 30 Days to Learn It challenge.

Reply
Anonymous
Not applicable

Processing of huge file in power bi

Hello 

I have muliple csv in sharepoint ( about 100 files).  I want to combine all cvs in one file and generate report in powerbi. 

In all their are about 1000 million rows

How should I process such huge dataset in power bi with low latency?  

Your suggestions/ comments would be valuable.
Thank you in advance.

1 ACCEPTED SOLUTION
selimovd
Super User
Super User

Hello @Anonymous ,

 

so, we're talking about 1 billion rows in 100 files, right?

From my experience that seems to be too much to load it directly to Power BI.

 

I personally would store the csv in a BLOB storage and try Azure Synapse:

Instead of ETL, design ELT - Azure Synapse Analytics | Microsoft Docs

 

Or if you want to stay in the SQL world, I would store the CSV in a BLOB storage and then load it directly or with data factory to an Azure SQL Server:

Bulk access to data in Azure Blob storage - SQL Server | Microsoft Docs

Loading a csv file into Azure SQL Database from Azure Storage | by Mayank Srivastava | Towards Data Science

 

If you need any help please let me know.
If I answered your question I would be happy if you could mark my post as a solution ✔️ and give it a thumbs up 👍
 
Best regards
Denis
 

View solution in original post

5 REPLIES 5
selimovd
Super User
Super User

Hello @Anonymous ,

 

so, we're talking about 1 billion rows in 100 files, right?

From my experience that seems to be too much to load it directly to Power BI.

 

I personally would store the csv in a BLOB storage and try Azure Synapse:

Instead of ETL, design ELT - Azure Synapse Analytics | Microsoft Docs

 

Or if you want to stay in the SQL world, I would store the CSV in a BLOB storage and then load it directly or with data factory to an Azure SQL Server:

Bulk access to data in Azure Blob storage - SQL Server | Microsoft Docs

Loading a csv file into Azure SQL Database from Azure Storage | by Mayank Srivastava | Towards Data Science

 

If you need any help please let me know.
If I answered your question I would be happy if you could mark my post as a solution ✔️ and give it a thumbs up 👍
 
Best regards
Denis
 
Anonymous
Not applicable

Hi @selimovd 

Thank you for a quick reply.
Can I use dataflow? If yes then how should I be using it


Anonymous
Not applicable

Hello 

I have muliple csv in sharepoint ( about 100 files).  I want to combine all cvs in one file and generate report in powerbi. 

In all their are about 1000 million rows. 

How should I process such huge dataset in power bi with low latency?  

Your suggestions/ comments would be valuable.
Thank you in advance.

Hey @Anonymous ,

 

I guess that was a double post:

https://community.powerbi.com/t5/Desktop/Processing-of-huge-file-in-power-bi/m-p/1846991

 

Best regards

Denis

Hey @Anonymous ,

 

you can definitely give it a try.

Be aware there are some limitations. Check the documentation:

Dataflows Limitations, restrictions and supported connectors and features - Power BI | Microsoft Docs

 

For example:

"Shared dataflows have a refresh limit of two hours per entity, and three per dataflow. So if you have two entities, and each takes two hours, you shouldn't put them in the same dataflow".

 

I think data flows have some throttling and from my experience Sharepoint tends to be kind of slow. But give it a try. In the best case it works, in the worst case it doesn't.

 

If you need any help please let me know.
If I answered your question I would be happy if you could mark my post as a solution ✔️ and give it a thumbs up 👍
 
Best regards
Denis
 

Helpful resources

Announcements
LearnSurvey

Fabric certifications survey

Certification feedback opportunity for the community.

PBI_APRIL_CAROUSEL1

Power BI Monthly Update - April 2024

Check out the April 2024 Power BI update to learn about new features.

April Fabric Community Update

Fabric Community Update - April 2024

Find out what's new and trending in the Fabric Community.