The true test – predicting the unknown (Motivation Decay Part 7)

  • by

Looking backwards is always easy – it is looking forward and making predictions which show if your algorithm does work or might be fundamentally flawed. I discarded all my original data when I revisited the motivation decay as I couldn’t tell how accurate they have been. So now is a good time to revisit this ‘discarded data’ and seen how well we do. We could have a philosophical discussion how well this qualifies as independent test set – but maybe have a look the data first:

My generation 1 data (see also GamePress)

Max CPDecay Percent Per HourDecay calc.T(max)T(max) calc
1041.37%1.11%58.3972
7411.89%1.83%42.3343.72
10102.38%2.35%33.6134.04
12812.97%3.01%26.9426.61
12822.96%3.01%27.0326.58
16114.10%4.12%19.5119.44
16444.26%4.25%18.7818.81
16514.29%4.28%18.6518.67
17374.68%4.68%17.0917.08
18305.03%5.18%15.9015.45
18505.24%5.29%15.2715.11
19986.31%6.30%12.6812.71
20126.39%6.41%12.5212.49
22068.32%8.32%9.629.61
22288.62%8.60%9.289.30
231510.10%9.87%7.928.10
23359.98%10.00%8.028
25979.97%10.00%8.028
28749.96%10.00%8.038

The calculated values actually are a lot closer as I thought. We have an average difference of just 0.05% between measured and calculated values. Only 3 data points have a difference > 0.06%. Unfortunately these three contain two of the most influential data points. The point for 104 CP is worst. This isn’t surprising as it is the most difficult one to get right.

Nearly as bad is CP 2315 – I guess my Pokemon got kicked early and I didn’t redo this data point. But more importantly – there is a large gap in data between 104 and 741 CP – therefore there is no chance to determine that the algorithm is discontinuous.

Here is the match for my generation 2 data.

CPDecay/hourDecay calc.T(max)T(max) calc
1031.01%1.11%79.2172.00
1491.08%1.11%74.0772.00
5011.46%1.43%54.6555.96
7411.89%1.83%42.3443.72
10102.38%2.35%33.6734.04
12693.02%2.97%26.5026.90
12812.97%3.01%26.9726.61
12822.96%3.01%27.0326.58
13853.37%3.31%23.7624.17
16114.10%4.12%19.5319.44
16444.26%4.25%18.7918.81
16514.29%4.28%18.6718.67
17374.68%4.68%17.0917.08
18305.03%5.18%15.9115.45
18425.35%5.25%14.9515.25
18505.24%5.29%15.2615.11
19986.31%6.30%12.6912.71
20126.39%6.41%12.5212.49
21287.61%7.45%10.5110.74
22068.32%8.32%9.619.61
22288.62%8.60%9.289.30
22719.24%9.19%8.668.70
23049.54%9.70%8.398.25

And in graphic format

Looking back is interesting. It seems I reused some generation 1 data while removing the two most misleading values as I must have been aware of the limited quality of them. I also add a high quality data point around 500 CP and try to get accurate data for 149 CP and 103 CP. Unfortunately the latter seems deceivingly like 1% – and I never bothered to fill the gap between 300 CP and 500 CP.

The average difference between calculated and measured is 0.05% again. I don’t think we can expect any more – knowing how I generated my original data.