Fantasy Football Forecasting

Predicting the top performing teams

The absolute predictions of these algorithms are not the most important metric to judge them by, it is the relative performance of the teams that matters. We really need to have the ranking of performances correctly, so we will test these algorithms by seeing how accurately we can predict the top 3 teams.

For a selection of three teams out of 32 in the NFL, the likelihood of randomly drawing one of the top three in a given metric is 9.4%. To randomly guess all three correctly, the likelihood is 0.020%. These serve as a baseline to compare our precision and recall metric with, though there is not a 1 to 1 comparison we can make between the values. There are more than three teams that can appear in the top teams, as say, if no team fumbles the ball in a given week, you would have 32 teams in the "top 3" for fumbles for the week. We will however use the precision and recall metric for "team in the top three" for a week, for our test data set over 2020-2022.

Field Name	N in top 3	N predicted in top 3	N correct	Precision	Recall
Complete Passes	254	965	220	0.228	0.866
Defensive Points Allowed	628	1354	628	0.464	1.000
Field goal successes close	810	1354	810	0.598	1.000
Field goal successes far	1013	1354	1013	0.748	1.000
Sacks	364	1354	364	0.269	1.000
Touchdown Passing	584	1257	546	0.434	0.934
Touchdown Rushing	705	1354	705	0.521	1.000
Touchdown Extra Points	470	1175	395	0.336	0.840
Yards Passing/Receiving	189	372	82	0.220	0.434
Yards Rushing	184	656	128	0.195	0.696

Some of the fields only make sense to compare the top 1, or best performing. In cases like fumbles, where many teams may only be at 0, this category will be crowded and useless as a top 3. We provide that result below, and recommend comparing against a simple 3.12% chance of guessing randomly the top performing team.

Field Name	N top	N predicted in top	N correct	Precision	Recall
Fumbles	443	471	165	0.350	0.372
Fumbles Forced	443	471	165	0.350	0.372
Interceptions caught	108	1207	89	0.074	0.824
Interceptions thrown	634	1349	632	0.469	0.997

Comparing our metrics against the random samples, it's clear most of our forecasts at least approximate better than random guessing. We are capturing a signal, and able to predict based on it.

However, there is a big difference between being able to predict values, and the predictions being useful. In some cases the top performers are predicted at twice the rate as they should be. While not a terrible performance on paper, someone having a pen and pencil following players and teams can likely perform on a similar or better level than this algorithm with simple assumptions.

This project is an ongoing one, and I've already outlined places for improvements. I will update these pages when appropriate, and when results improve.