Demonstration and Mitigation of Spatial Sampling Bias for Machine-Learning Predictions

Liu, Wendi; Ikonnikova, Svetlana; Scott Hamlin, H.; Sivila, Livia; Pyrcz, Michael J.

doi:10.2118/203838-PA

User: Guest

Lehrstuhl für Resource Economics (Prof. Ikonnikova)

Back
Back to start of result list
Permanent link for displayed object

Title:: Demonstration and Mitigation of Spatial Sampling Bias for Machine-Learning Predictions
Document type:: Zeitschriftenaufsatz
Author(s):: Liu, Wendi; Ikonnikova, Svetlana; Scott Hamlin, H.; Sivila, Livia; Pyrcz, Michael J.
Non-TUM Co-author(s):: ja
Cooperation:: international
Abstract:: Summary Machine learning provides powerful methods for inferential and predictive modeling of complicated multivariate relationships to support decision-making for spatial problems such as optimization of unconventional reservoir development. Current machine-learning methods have been widely used in exhaustive spatial data sets like satellite images. However, geological subsurface characterization is significantly different because it is conditioned by sparse, nonrepresentative sampling. These sparse spatial data sets are generally not sampled in a representative manner; therefore, they are biased. The critical questions are: first, does spatial bias in training data result in a bias for machine-learning-based predictive models; and if there is a bias, how can we mitigate the bias in these spatial machine-learning-based predictions? The presence and mitigation of prediction with spatial sampling bias is demonstrated with tree-based machine learning due to its high degree of interpretability. In expectation, training data bias imposes bias in machine-learning predictions over a wide variety of spatial data configurations and degrees of bias, even when the model is applied to make predictions with unbiased testing and real-world data. We reduce the bias in prediction with a novel spatial weighted tree method over a variety of spatial data configurations and degrees of spatial sampling bias. The proposed method is able to improve the accuracy for reservoir evaluation. We recommend modeling checking and bias mitigation for all machine-learning prediction models with sparse, spatial data sets, because bias in, bias out. «
Summary Machine learning provides powerful methods for inferential and predictive modeling of complicated multivariate relationships to support decision-making for spatial problems such as optimization of unconventional reservoir development. Current machine-learning methods have been widely used in exhaustive spatial data sets like satellite images. However, geological subsurface characterization is significantly different because it is conditioned by sparse, nonrepresentative sampling. These... »
Intellectual Contribution:: Contribution to Practice
Journal title:: SPE Reservoir Evaluation & Engineering
Year:: 2021
Journal volume:: 24
Month:: February
Journal issue:: 01
Pages contribution:: 262--274
Language:: en
Fulltext / DOI:: doi:10.2118/203838-PA
WWW:: https://onepetro.org/REE/article/24/01/262/448271/Demonstration-and-Mitigation-of-Spatial-Sampling
Print-ISSN:: 1094-6470, 1930-0212
Judgement review:: 0
Peer reviewed:: Ja
Commissioned:: not commissioned
Technology:: Ja
Interdisciplinarity:: Ja
Mission statement:: ;
Ethics and Sustainability:: Nein
BibTeX

versions

Occurrences:

mediaTUM Gesamtbestand Einrichtungen Schools TUM School of Management Departments Economics and Policy Lehrstuhl für Resource Economics (Prof. Ikonnikova)

mediaTUM Gesamtbestand Hochschulbibliographie 2021 Fakultäten Wirtschaftswissenschaften Lehrstuhl für Resource Economics (Prof. Ikonnikova)