Earn the coveted Fabric Analytics Engineer certification. 100% off your exam for a limited time only!
Hi. I'm just curious about the graphic representation of distinct vs unique values in Power Query Editor, specifically in the Column distribution view and the Column profile view. My understanding is the distinct count is how many values are in the column that appear more than once and the unique count is how many values are in the column that only appear once.
But it doesn't seem like the graphics always represent this, so I'm guessing the graphic is truncated, but I haven't read that anywhere. I put some images below to help explain. Thank you.
So in this image it says there are 11 distinct values, 0 unique. The green bars and their labels correspond perfectly to the actual column.
But in this image (even when I scroll to the end of the window) there are only 49 green bars with labels, not 87.
But you can choose "Column profiling based on top 1000 rows" or "Column profiling based on entire data set" and all the other information you get to in the View panel will change based on that choice.
Hi @bmcminn ,
As far as I know, the graph is based on the first 1000 rows only.
Proud to be a Super User!
User | Count |
---|---|
140 | |
113 | |
104 | |
76 | |
63 |
User | Count |
---|---|
135 | |
126 | |
110 | |
70 | |
61 |