Earn the coveted Fabric Analytics Engineer certification. 100% off your exam for a limited time only!
I have a friend he works on Power Bi we have a common database that contains data of at least four years of 1.8 million clients per month which means about 24 millions rows in one year obviusly every month we have new clients making greater the database everymonth. this database contains at least 10 columns so if i wanted to upload the data of one year means 24 millions rows which means at least 100 Mb of storage but somehow this guy uploaded the data and make it refreshsable in one clic, his data is 14 Mb of storage and contains the whole same data. I asked him how he did that and didnt tell me how. So i came up here if everyone here have some idea. We work on sql and usually (me of course) upload the query into power by to generate the table. CSV doesnt workforetelling any further answer from now. Any idea?
Hi @jorgesantin28,
Have you made a test with 24 minllions rows data? I have made a test, it is about 70M.
Actually, You could write the select query under SQL statement to get the required data when you get data from SQL database.
Besides, you need to optimize your data model, such as remove the data you don't need.
Best Regards,
Cherry
I tried to made it through a csv file 3 columns the weight is obout 24Mb of size on disk maybe the optimized way is grouping by the whole data.
So Power BI makes use of Vertipaq which does columnar storage and it makes columnstore indexes very efficient especially since it has a better and unique way of indexing opposed to how it is usually done therefore Power BI compresses the data greatly and stores data in much smaller size on the Power BI WorkSpace/Service.
User | Count |
---|---|
141 | |
113 | |
104 | |
77 | |
64 |
User | Count |
---|---|
135 | |
123 | |
101 | |
71 | |
61 |