2024
Optimal Predictor Selection for High-dimensional Nonparametric Forecasting
Date: September 26, 2024
Event: NUMBAT Group Meeting
Sparse Multiple Index (SMI) Models for High-dimensional Nonparametric Forecasting
Date: July 01, 2024
Event: 44th International Symposium on Forecasting, Dijon, France
Better Forecasts through Better Choices: Optimal Predictor Selection for High-dimensional Nonparametric Forecasting
Date: June 18, 2024
Event: Three Minute Thesis (3MT) Competition 2024, Faculty of Business and Economics, Monash University
Result: Runner-up (Also competed as a Wildcard in the Monash University 3MT Competition 2024.)
Static Slide: Script: On a scorching summer day, every moment outside feels like a battle against the heat. After running errands, you finally make it home. Rushing into the kitchen, you grab a glass, filled to the brim with icy-cold water, and switch on the AC. But, even before you fully enjoy that cooling breeze, “Click”… the power goes out. In an instant, you are drenched in sweat all over again.
Doesn’t sound great, right?
Power cuts can occur due to bad weather, damage to infrastructure, or even regular maintenance. But did you know that electricity regulators sometimes deliberately cut power, simply because they are unprepared to meet the demand? This unpreparedness directly falls on to not having accurate predictions for the future.
Electricity demand depends on several factors such as prevailing weather conditions, calendar effects, demographics, and even economic factors. Temperature, for example, is obviously related to electricity demand. Not only current temperature, but at least a week of historical temperatures, can influence electricity demand at a given time.
In this case, statisticians develop a model combining such current and historical information to produce forecasts. However, feeding all the available information into a model does not produce accurate forecasts. Hence, it is important to determine which information is relevant, and which is not. This is where expert knowledge comes in. As this is very subjective, having a principled and a more objective method is a pressing requirement.
My research addresses this problem. I’m developing an optimisation algorithm that automatically identifies the most important information and discards unimportant information without the need for expert knowledge. This algorithm takes all the historical information that we provide as inputs, uses algorithmic reasoning to combine the inputs into groups, and explores the possibly complicated relationships between the past and the future. The result is an optimised model that can produce forecasts with better predictive accuracy.
Now you might wonder, does this algorithm only solves the electricity demand problem? Not really! I’m developing a general algorithm that you can use in almost any forecasting problem: you may try to forecast mortality, sales, or groundwater level. I got you covered!
I’ve used this algorithm to forecast mortality and solar intensity. It has shown promising accuracy improvements compared to benchmark methods. Therefore, once completed, my research will be another step forward in statistical forecasting!