Skip to main content
cancel
Showing results for 
Search instead for 
Did you mean: 

Register now to learn Fabric in free live sessions led by the best Microsoft experts. From Apr 16 to May 9, in English and Spanish.

Reply
Anonymous
Not applicable

Try to create an estimate of a value per day

Hi All! Apologies for the vague title and length of this post but I'm trying to achieve something quite tricky and not sure how to exactly tackle it.

 

I am trying to create an estimate for faults that occur based on previous ratios. In the table below, if we assume a factory with two different types of machines, a daily Production figure that is always available, Unscheduled Deferment when something unexpectedly causes production to drop (e.g. a machine gets stuck or breaks down), and Scheduled Deferment (e.g. routine maintenance), and a Theoretical Prod which is the sum of Production+Unscheduled Deferment+Scheduled Deferment.

 

DateUnscheduled DefermentScheduled DefermentTheoretical ProdProductionMachine Type
31/12/20192455728A
31/12/20191214229B
01/01/20202325530A
01/01/20201254528B
02/01/2020124643B
02/01/20202396937A
03/01/2020  1212A
03/01/2020  4040B
04/01/2020  2323B
04/01/2020  1111A

 

My initial logic is as follows, assuming we've selected a slicer where only Machine Type "A" is selected:

  1. Find the dates where both the Unscheduled Deferment & Scheduled Deferment columns are BLANK() (i.e. 03/01/2020, 04/01/2020).
  2. Then, assuming we are using the past two days to draw our estimates from (i.e. 01/01/2020, 02/01/2020) I would like to get the average Theoretical Production: (30+37)/2 = 33.5 
  3. Then, I would like to do the same for the Unscheduled/Scheduled Ratio to allow me to distribute the change in production to Unscheduled & Scheduled deferments in the same ratio as shown in the past two days. So: (23+23)/(23+23+9+2) = 0.807
  4. Now, I would use the average Theoretical Production from Step 2. and the daily Production figure to estimate the deferments. For 03/01/2020, I would do: 33.5-12 = 21.5  So 21.5 is the difference in Production and essentially what we expect to be theoretically producing. The difference being due to Unscheduled & Scheduled Deferment. Using the previous ratio, I would estimate Unscheduled Deferment to be 21.5 * 0.807 = 17.35 and Scheduled Deferment to be 21.5 - 17.35 = 4.15
  5. I would then repeat the same for 04/01/2020

In theory, I would end up with a table similar to the one below which would allow me to visualise on a daily basis the estimates:

DateUnscheduledScheduledTheoretical ProdProductionMachine Type
31/12/20192455728A
31/12/20191214229B
01/01/20202325530A
01/01/20201254528B
02/01/2020124643B
02/01/20202396937A
03/01/202017.354.1533.512A
03/01/20203.581.9245.540B
04/01/202014.637.8745.523B
04/01/202018.164.3433.511A

 

Any help or pointers in how to achieve this steps would be much appreciated as I am quite lost! I am able to find the last day that contains deferment data but then I am getting confused using calculated columns and measures.

2 REPLIES 2
lbendlin
Super User
Super User

you will want to consider statistical significance.  In other words - you don't have enough data points for reliable predictions.

 

Once you have (many) more data points  you can then simplify your formulas and treat both unscheduled and maintenance as regular parts of your data.

Anonymous
Not applicable

Hi @lbendlin, I understand it's not the soundest of methods of generating estimates but it was a method accepted by the business. 

Helpful resources

Announcements
Microsoft Fabric Learn Together

Microsoft Fabric Learn Together

Covering the world! 9:00-10:30 AM Sydney, 4:00-5:30 PM CET (Paris/Berlin), 7:00-8:30 PM Mexico City

PBI_APRIL_CAROUSEL1

Power BI Monthly Update - April 2024

Check out the April 2024 Power BI update to learn about new features.

April Fabric Community Update

Fabric Community Update - April 2024

Find out what's new and trending in the Fabric Community.