Authors
Wlodzimierz Ogryczak, Patrice Perny, Paul Weng
Publication date
2013/9
Journal
International Journal of Information Technology & Decision Making
Volume
12
Issue
05
Pages
1021-1053
Publisher
World Scientific Publishing Company
Description
A Markov decision process (MDP) is a general model for solving planning problems under uncertainty. It has been extended to multiobjective MDP to address multicriteria or multiagent problems in which the value of a decision must be evaluated according to several viewpoints, sometimes conflicting. Although most of the studies concentrate on the determination of the set of Pareto-optimal policies, we focus here on a more specialized problem that concerns the direct determination of policies achieving well-balanced tradeoffs. To this end, we introduce a reference point method based on the optimization of a weighted ordered weighted average (WOWA) of individual disachievements. We show that the resulting notion of optimal policy does not satisfy the Bellman principle and depends on the initial state. To overcome these difficulties, we propose a solution method based on a linear programming (LP) reformulation …
Total citations
201220132014201520162017201820192020202120222023202431124222331
Scholar articles
W Ogryczak, P Perny, P Weng - International Journal of Information Technology & …, 2013