Earth Science News
EARTH OBSERVATION
Validation technique could help scientists make more accurate forecasts
illustration only
Validation technique could help scientists make more accurate forecasts
by Adam Zewe | MIT News
Boston MA (SPX) Feb 10, 2025

Should you grab your umbrella before you walk out the door? Checking the weather forecast beforehand will only be helpful if that forecast is accurate.

Spatial prediction problems, like weather forecasting or air pollution estimation, involve predicting the value of a variable in a new location based on known values at other locations. Scientists typically use tried-and-true validation methods to determine how much to trust these predictions.

But MIT researchers have shown that these popular validation methods can fail quite badly for spatial prediction tasks. This might lead someone to believe that a forecast is accurate or that a new prediction method is effective, when in reality that is not the case.

The researchers developed a technique to assess prediction-validation methods and used it to prove that two classical methods can be substantively wrong on spatial problems. They then determined why these methods can fail and created a new method designed to handle the types of data used for spatial predictions.

In experiments with real and simulated data, their new method provided more accurate validations than the two most common techniques. The researchers evaluated each method using realistic spatial problems, including predicting the wind speed at the Chicago O-Hare Airport and forecasting the air temperature at five U.S. metro locations.

Their validation method could be applied to a range of problems, from helping climate scientists predict sea surface temperatures to aiding epidemiologists in estimating the effects of air pollution on certain diseases.

"Hopefully, this will lead to more reliable evaluations when people are coming up with new predictive methods and a better understanding of how well methods are performing," says Tamara Broderick, an associate professor in MIT's Department of Electrical Engineering and Computer Science (EECS), a member of the Laboratory for Information and Decision Systems and the Institute for Data, Systems, and Society, and an affiliate of the Computer Science and Artificial Intelligence Laboratory (CSAIL).

Broderick is joined on the paper by lead author and MIT postdoc David R. Burt and EECS graduate student Yunyi Shen. The research will be presented at the International Conference on Artificial Intelligence and Statistics.

Evaluating validations

Broderick's group has recently collaborated with oceanographers and atmospheric scientists to develop machine-learning prediction models that can be used for problems with a strong spatial component.

Through this work, they noticed that traditional validation methods can be inaccurate in spatial settings. These methods hold out a small amount of training data, called validation data, and use it to assess the accuracy of the predictor.

To find the root of the problem, they conducted a thorough analysis and determined that traditional methods make assumptions that are inappropriate for spatial data. Evaluation methods rely on assumptions about how validation data and the data one wants to predict, called test data, are related.

Traditional methods assume that validation data and test data are independent and identically distributed, which implies that the value of any data point does not depend on the other data points. But in a spatial application, this is often not the case.

For instance, a scientist may be using validation data from EPA air pollution sensors to test the accuracy of a method that predicts air pollution in conservation areas. However, the EPA sensors are not independent - they were sited based on the location of other sensors.

In addition, perhaps the validation data are from EPA sensors near cities while the conservation sites are in rural areas. Because these data are from different locations, they likely have different statistical properties, so they are not identically distributed.

"Our experiments showed that you get some really wrong answers in the spatial case when these assumptions made by the validation method break down," Broderick says.

The researchers needed to come up with a new assumption.

Specifically spatial

Thinking specifically about a spatial context, where data are gathered from different locations, they designed a method that assumes validation data and test data vary smoothly in space.

For instance, air pollution levels are unlikely to change dramatically between two neighboring houses.

"This regularity assumption is appropriate for many spatial processes, and it allows us to create a way to evaluate spatial predictors in the spatial domain. To the best of our knowledge, no one has done a systematic theoretical evaluation of what went wrong to come up with a better approach," says Broderick.

To use their evaluation technique, one would input their predictor, the locations they want to predict, and their validation data, then it automatically does the rest. In the end, it estimates how accurate the predictor's forecast will be for the location in question. However, effectively assessing their validation technique proved to be a challenge.

"We are not evaluating a method, instead we are evaluating an evaluation. So, we had to step back, think carefully, and get creative about the appropriate experiments we could use," Broderick explains.

First, they designed several tests using simulated data, which had unrealistic aspects but allowed them to carefully control key parameters. Then, they created more realistic, semi-simulated data by modifying real data. Finally, they used real data for several experiments.

Using three types of data from realistic problems, like predicting the price of a flat in England based on its location and forecasting wind speed, enabled them to conduct a comprehensive evaluation. In most experiments, their technique was more accurate than either traditional method they compared it to.

In the future, the researchers plan to apply these techniques to improve uncertainty quantification in spatial settings. They also want to find other areas where the regularity assumption could improve the performance of predictors, such as with time-series data.

Research Report:Consistent Validation for Predictive Methods in Spatial Settings

Related Links
MIT CSAIL
Earth Observation News - Suppiliers, Technology and Application

Subscribe Free To Our Daily Newsletters
Tweet

RELATED CONTENT
The following news reports may link to other Space Media Network websites.
EARTH OBSERVATION
Trump taps 'Sharpiegate' meteorologist to lead top science agency
Washington (AFP) Feb 4, 2025
A meteorologist who caved to political pressure during Donald Trump's first administration to mislead the public about a hurricane forecast was nominated by the president Tuesday to once more lead the National Oceanic and Atmospheric Administration (NOAA). Neil Jacobs, who previously helmed the renowned science agency from 2018 to 2021, was officially censured for his role in the infamous "Sharpiegate" scandal - one of the more bizarre episodes of Trump's first term. Despite this, he has now be ... read more

EARTH OBSERVATION
One dead, dozens missing in China landslide

Fukushima nuclear plant operator to dismantle water tanks next week

El Salvador offers to jail violent U.S. criminals in 'unprecedented' deal

Israel defence minister orders army to plan for 'voluntary' departures from Gaza

EARTH OBSERVATION
Alloy discovered that barely changes with temperature

Big Tech's AI spending rattles markets

Orbex lands D-Orbit deal prior to first mission this year

EdgeCortix unveils SAKURA-I with proven radiation immunity for orbital and lunar ventures

EARTH OBSERVATION
Seeking climate connections among the oceans' smallest organisms

Marine Prosperity Areas introduce a fresh approach to ocean conservation

New Zealand chides Cook Islands for 'lack of transparency'

New Zealand says 'blindsided' by Cook Islands' China overture

EARTH OBSERVATION
Greenland ice crevasses escalate fueling further rise in sea levels

Arctic sea ice levels second lowest on record for January: US data

Ice streams move due to tiny ice quakes

Greenland glacier accelerates each day with weather and tide changes

EARTH OBSERVATION
Drying and rewetting cycles amplify soil CO2 emissions

Hong Kong scientists fight to save fragrant incense trees

French cognac exports to China slump as tariffs bite; Scottish whisky makers fear return of Trump tariffs

Study examines how African farmers are adapting to mountain climate change

EARTH OBSERVATION
Fresh quake barrage hits Greek island Santorini

'We're not afraid': Santorini residents brave tremors to stay put

Pain, anger as Turkey marks two years since quake disaster

Greek PM insists no danger from Santorini quake swarm

EARTH OBSERVATION
80 dead in southern Sudan violence: UN

Niger orders Red Cross to leave country

France to pull troops from I.Coast in February;Kenya urges DRC 'immediate ceasefire'

At least 56 killed as fighting grips Sudan's capital

EARTH OBSERVATION
New play takes on OpenAI drama and AI's existential questions

Trump signs order to get 'transgender ideology' out of military

How to Design Humane Autonomous Systems

Three million years ago our ancestors relied on plant-based diets

Subscribe Free To Our Daily Newsletters




The content herein, unless otherwise known to be public domain, are Copyright 1995-2024 - Space Media Network. All websites are published in Australia and are solely subject to Australian law and governed by Fair Use principals for news reporting and research purposes. AFP, UPI and IANS news wire stories are copyright Agence France-Presse, United Press International and Indo-Asia News Service. ESA news reports are copyright European Space Agency. All NASA sourced material is public domain. Additional copyrights may apply in whole or part to other bona fide parties. All articles labeled "by Staff Writers" include reports supplied to Space Media Network by industry news wires, PR agencies, corporate press officers and the like. Such articles are individually curated and edited by Space Media Network staff on the basis of the report's information value to our industry and professional readership. Advertising does not imply endorsement, agreement or approval of any opinions, statements or information provided by Space Media Network on any Web page published or hosted by Space Media Network. General Data Protection Regulation (GDPR) Statement Our advertisers use various cookies and the like to deliver the best ad banner available at one time. All network advertising suppliers have GDPR policies (Legitimate Interest) that conform with EU regulations for data collection. By using our websites you consent to cookie based advertising. If you do not agree with this then you must stop using the websites from May 25, 2018. Privacy Statement. Additional information can be found here at About Us.