Predict-and-Optimize Techniques for Data-Driven Optimization Problems: A ReviewShow others and affiliations
2025 (English)In: Neural Processing Letters, ISSN 1370-4621, E-ISSN 1573-773X, Vol. 57, no 2, article id 40Article, review/survey (Refereed) Published
Abstract [en]
Machine learning predictive models rely on data to make predictions for new input data. However, accurate predictions are not always the end goal; practitioners often aim to make informed decisions through optimization problems (OPs) based on these predictions. While the idea that better predictions lead to better decisions was widely accepted, the latest literature highlights that even small inaccuracies in predictions can lead to poor decisions depending on the structure of the OP. Therefore, recent research has been focused on end-to-end learning approaches that directly improve decision quality without considering prediction accuracy when solving data-driven OPs. Some of these end-to-end learning approaches are mainly called "predict-and-optimize" (PaO), and they aim to learn a predictor based on the quality of the downstream task decisions by incorporating mathematical programming into the learning process. This literature review discusses the variations of and approaches to PaO problems by proposing a unified notation and a taxonomy for them. Throughout the paper, we aim to provide a valuable roadmap for researchers and practitioners in the field, guiding them to choose data-driven methods to solve their decision problems effectively.
Place, publisher, year, edition, pages
Springer, 2025. Vol. 57, no 2, article id 40
Keywords [en]
Neural networks, Constrained optimization, Decision making, Machine learning, Optimization problems
National Category
Computer Sciences Artificial Intelligence
Identifiers
URN: urn:nbn:se:oru:diva-120675DOI: 10.1007/s11063-025-11746-wISI: 001465322200002OAI: oai:DiVA.org:oru-120675DiVA, id: diva2:1953861
Funder
Knowledge Foundation, 20190128Wallenberg AI, Autonomous Systems and Software Program (WASP)Knut and Alice Wallenberg FoundationÖrebro University
Note
This work has been supported by the Industrial Graduate School Collaborative AI & Robotics funded by the Swedish Knowledge Foundation Dnr:20190128, and the Knut and Alice Wallenberg Foundation through Wallenberg AI, Autonomous Systems and Software Program (WASP). Open access funding provided by Örebro University.
2025-04-232025-04-232025-04-23Bibliographically approved