Skip to main content
cancel
Showing results for 
Search instead for 
Did you mean: 

Grow your Fabric skills and prepare for the DP-600 certification exam by completing the latest Microsoft Fabric challenge.

Reply
Angel_S_M
New Member

Need Advice: Optimise Datasheet Storage Space (Reduce No. of Rows)

I have a datasheet on Power BI, reading test parameters of various devices. However, there are so many values taken that the number of rows blows up to 6 digits. For example, the sample data extract below is only for one test, for two serial numbers, over a limited frequency range.

 

Is there a method of optimising the raw data to keep the row/column number as low as possible with ability to be plotted as an Output vs. Frequency curve, without aggregating the datapoints or otherwise changing the raw data values? Specifically I want to decrease the number of cells used so that storage space and processing speed is reduced. 

 

For example, would it be possible to plot a curve from two columns with comma separated values within a cell? E.g. plotting if all the Frequency values for SerialNumber ABC01 were in csv format in one cell, and the same for Output.

Otherwise, is there a manual transformation method such as using Pivoting that can reduce the space used?

Thank you, looking forward to any solutions! 😃 

 

SerialNumberFrequencyOutput
ABC012000109.07
ABC011900107.84
ABC011800106.93
ABC011700106.2
ABC011600105.03
ABC011500104.39
ABC011400103.97
ABC011300103.76
ABC011200103.67
ABC011100103.41
ABC011000103.14
ABC01900102.85
ABC01800102.46
ABC01700102.44
ABC01600102.18
ABC01500102.23
ABC01400102.27
ABC01300102.75
ABC01200102.71
ABC022000109.76
ABC021900108.51
ABC021800107.68
ABC021700106.9
ABC021600105.73
ABC021500105.08
ABC021400104.65
ABC021300104.42
ABC021200104.41
ABC021100104.09
ABC021000103.83
ABC02900103.6
ABC02800103.15
ABC02700103.09
ABC02600102.87
ABC02500102.92
ABC02400102.9
ABC02300103.43
ABC02200103.46
1 ACCEPTED SOLUTION
vs_7
Responsive Resident
Responsive Resident

HI @Angel_S_M ,

check below points:

 

  1. Data Aggregation: Instead of storing each individual data point, you can aggregate the data at a higher level. For example, you can calculate the average, minimum, maximum, or any other relevant summary statistics for each frequency range or time period. This can significantly reduce the number of rows and still provide meaningful information for plotting the curve.

  2. Data Compression: Power BI supports data compression techniques, which can help reduce the storage space and improve processing speed. Power BI automatically applies compression algorithms to minimize the size of your dataset. You can also manually compress the data by utilizing data types with lower precision or by rounding off values.

  3. Data Transformation: You can use Power Query Editor in Power BI to transform your data before loading it into the dataset. For example, you can unpivot the columns so that each data point is represented in a single row, rather than having multiple columns for each frequency value. This can help reduce the number of columns and make the dataset more efficient.

  4. Data Modeling Techniques: Power BI offers various modeling techniques, such as creating calculated columns or measures, to perform calculations or aggregations on the fly. By utilizing these techniques, you can avoid duplicating data and reduce the overall storage space.

View solution in original post

1 REPLY 1
vs_7
Responsive Resident
Responsive Resident

HI @Angel_S_M ,

check below points:

 

  1. Data Aggregation: Instead of storing each individual data point, you can aggregate the data at a higher level. For example, you can calculate the average, minimum, maximum, or any other relevant summary statistics for each frequency range or time period. This can significantly reduce the number of rows and still provide meaningful information for plotting the curve.

  2. Data Compression: Power BI supports data compression techniques, which can help reduce the storage space and improve processing speed. Power BI automatically applies compression algorithms to minimize the size of your dataset. You can also manually compress the data by utilizing data types with lower precision or by rounding off values.

  3. Data Transformation: You can use Power Query Editor in Power BI to transform your data before loading it into the dataset. For example, you can unpivot the columns so that each data point is represented in a single row, rather than having multiple columns for each frequency value. This can help reduce the number of columns and make the dataset more efficient.

  4. Data Modeling Techniques: Power BI offers various modeling techniques, such as creating calculated columns or measures, to perform calculations or aggregations on the fly. By utilizing these techniques, you can avoid duplicating data and reduce the overall storage space.

Helpful resources

Announcements
Europe Fabric Conference

Europe’s largest Microsoft Fabric Community Conference

Join the community in Stockholm for expert Microsoft Fabric learning including a very exciting keynote from Arun Ulag, Corporate Vice President, Azure Data.

RTI Forums Carousel3

New forum boards available in Real-Time Intelligence.

Ask questions in Eventhouse and KQL, Eventstream, and Reflex.

MayPowerBICarousel1

Power BI Monthly Update - May 2024

Check out the May 2024 Power BI update to learn about new features.