View article

[PDF] from psu.edu

Information relaxations and duality in stochastic dynamic programs

Authors

David B Brown, James E Smith, Peng Sun

Publication date

2010/8

Journal

Operations research

Volume

Issue

4-part-1

Pages

785-801

Publisher

INFORMS

Description

We describe a general technique for determining upper bounds on maximal values (or lower bounds on minimal costs) in stochastic dynamic programs. In this approach, we relax the nonanticipativity constraints that require decisions to depend only on the information available at the time a decision is made and impose a “penalty” that punishes violations of nonanticipativity. In applications, the hope is that this relaxed version of the problem will be simpler to solve than the original dynamic program. The upper bounds provided by this dual approach complement lower bounds on values that may be found by simulating with heuristic policies. We describe the theory underlying this dual approach and establish weak duality, strong duality, and complementary slackness results that are analogous to the duality results of linear programming. We also study properties of good penalties. Finally, we demonstrate the use of …

Total citations

Cited by 288

2010201120122013201420152016201720182019202020212022202320243 13 15 17 21 22 29 24 22 21 29 17 21 22 12

Scholar articles

Information relaxations and duality in stochastic dynamic programs

DB Brown, JE Smith, P Sun - Operations research, 2010