Skip to main content
cancel
Showing results for 
Search instead for 
Did you mean: 

Grow your Fabric skills and prepare for the DP-600 certification exam by completing the latest Microsoft Fabric challenge.

Reply
Anonymous
Not applicable

Filter CSV data within the loading process

Hi

I have to load and combine txt files that each have multiple CSV formatted blocks. Something like:

 

# Some header info1
# Some header info2
#@startFirstBlock
A;B;C;D
E;F;G;H
I;L;M;N
#@endFirstBlock

#@startSecondBlock
1;2;3
4;5;6
#@endSecondBlock

 

As I need only the " header" rows and the ones between "#@startFirstBlock" and "#@endFirstBlock", is there a way to only load these rows skipping the rest during the loading process?

My question/problem is essentially performance driven: I have 100s of files with 1000s of rows in each "block" and each file can contain 10 and more of these blocks. To load all is taking verrrrrry long and once loaded I have to delete the rows from the not needed blocks (that also takes time). Is there a way to just load what I need from the start?

 

Thanks for helping.

1 ACCEPTED SOLUTION
ImkeF
Super User
Super User

Hi @Anonymous ,
no, that's not possible. These kind of files have to be ingested fully before anything can be extracted from them.
But it could be that you are making expensive transformations on them. Check out my performance tips for traps to avoid: Speed/Performance aspects – The BIccountant

Imke Feldmann (The BIccountant)

If you liked my solution, please give it a thumbs up. And if I did answer your question, please mark this post as a solution. Thanks!

How to integrate M-code into your solution -- How to get your questions answered quickly -- How to provide sample data -- Check out more PBI- learning resources here -- Performance Tipps for M-queries

View solution in original post

1 REPLY 1
ImkeF
Super User
Super User

Hi @Anonymous ,
no, that's not possible. These kind of files have to be ingested fully before anything can be extracted from them.
But it could be that you are making expensive transformations on them. Check out my performance tips for traps to avoid: Speed/Performance aspects – The BIccountant

Imke Feldmann (The BIccountant)

If you liked my solution, please give it a thumbs up. And if I did answer your question, please mark this post as a solution. Thanks!

How to integrate M-code into your solution -- How to get your questions answered quickly -- How to provide sample data -- Check out more PBI- learning resources here -- Performance Tipps for M-queries

Helpful resources

Announcements
Europe Fabric Conference

Europe’s largest Microsoft Fabric Community Conference

Join the community in Stockholm for expert Microsoft Fabric learning including a very exciting keynote from Arun Ulag, Corporate Vice President, Azure Data.

RTI Forums Carousel3

New forum boards available in Real-Time Intelligence.

Ask questions in Eventhouse and KQL, Eventstream, and Reflex.

MayPowerBICarousel1

Power BI Monthly Update - May 2024

Check out the May 2024 Power BI update to learn about new features.