Skip to main content
cancel
Showing results for 
Search instead for 
Did you mean: 

Register now to learn Fabric in free live sessions led by the best Microsoft experts. From Apr 16 to May 9, in English and Spanish.

Reply
Anonymous
Not applicable

Processing of huge file in power bi

Hello 

I have muliple csv in sharepoint ( about 100 files).  I want to combine all cvs in one file and generate report in powerbi. 

In all their are about 1000 million rows

How should I process such huge dataset in power bi with low latency?  

Your suggestions/ comments would be valuable.
Thank you in advance.

1 ACCEPTED SOLUTION
selimovd
Super User
Super User

Hello @Anonymous ,

 

so, we're talking about 1 billion rows in 100 files, right?

From my experience that seems to be too much to load it directly to Power BI.

 

I personally would store the csv in a BLOB storage and try Azure Synapse:

Instead of ETL, design ELT - Azure Synapse Analytics | Microsoft Docs

 

Or if you want to stay in the SQL world, I would store the CSV in a BLOB storage and then load it directly or with data factory to an Azure SQL Server:

Bulk access to data in Azure Blob storage - SQL Server | Microsoft Docs

Loading a csv file into Azure SQL Database from Azure Storage | by Mayank Srivastava | Towards Data Science

 

If you need any help please let me know.
If I answered your question I would be happy if you could mark my post as a solution ✔️ and give it a thumbs up 👍
 
Best regards
Denis
 

View solution in original post

5 REPLIES 5
selimovd
Super User
Super User

Hello @Anonymous ,

 

so, we're talking about 1 billion rows in 100 files, right?

From my experience that seems to be too much to load it directly to Power BI.

 

I personally would store the csv in a BLOB storage and try Azure Synapse:

Instead of ETL, design ELT - Azure Synapse Analytics | Microsoft Docs

 

Or if you want to stay in the SQL world, I would store the CSV in a BLOB storage and then load it directly or with data factory to an Azure SQL Server:

Bulk access to data in Azure Blob storage - SQL Server | Microsoft Docs

Loading a csv file into Azure SQL Database from Azure Storage | by Mayank Srivastava | Towards Data Science

 

If you need any help please let me know.
If I answered your question I would be happy if you could mark my post as a solution ✔️ and give it a thumbs up 👍
 
Best regards
Denis
 
Anonymous
Not applicable

Hi @selimovd 

Thank you for a quick reply.
Can I use dataflow? If yes then how should I be using it


Anonymous
Not applicable

Hello 

I have muliple csv in sharepoint ( about 100 files).  I want to combine all cvs in one file and generate report in powerbi. 

In all their are about 1000 million rows. 

How should I process such huge dataset in power bi with low latency?  

Your suggestions/ comments would be valuable.
Thank you in advance.

Hey @Anonymous ,

 

I guess that was a double post:

https://community.powerbi.com/t5/Desktop/Processing-of-huge-file-in-power-bi/m-p/1846991

 

Best regards

Denis

Hey @Anonymous ,

 

you can definitely give it a try.

Be aware there are some limitations. Check the documentation:

Dataflows Limitations, restrictions and supported connectors and features - Power BI | Microsoft Docs

 

For example:

"Shared dataflows have a refresh limit of two hours per entity, and three per dataflow. So if you have two entities, and each takes two hours, you shouldn't put them in the same dataflow".

 

I think data flows have some throttling and from my experience Sharepoint tends to be kind of slow. But give it a try. In the best case it works, in the worst case it doesn't.

 

If you need any help please let me know.
If I answered your question I would be happy if you could mark my post as a solution ✔️ and give it a thumbs up 👍
 
Best regards
Denis
 

Helpful resources

Announcements
Microsoft Fabric Learn Together

Microsoft Fabric Learn Together

Covering the world! 9:00-10:30 AM Sydney, 4:00-5:30 PM CET (Paris/Berlin), 7:00-8:30 PM Mexico City

PBI_APRIL_CAROUSEL1

Power BI Monthly Update - April 2024

Check out the April 2024 Power BI update to learn about new features.

April Fabric Community Update

Fabric Community Update - April 2024

Find out what's new and trending in the Fabric Community.