IDEAS home Printed from https://ideas.repec.org/a/spr/joptap/v136y2008i2d10.1007_s10957-007-9297-7.html
   My bibliography  Save this article

Measure-Valued Differentiation for Markov Chains

Author

Listed:
  • B. Heidergott

    (Vrije Universiteit and Tinbergen Institute)

  • F. J. Vázquez-Abad

    (University of Melbourne)

Abstract

This paper addresses the problem of sensitivity analysis for finite-horizon performance measures of general Markov chains. We derive closed-form expressions and associated unbiased gradient estimators for the derivatives of finite products of Markov kernels by measure-valued differentiation (MVD). In the MVD setting, the derivatives of Markov kernels, called $\mathcal{D}$ -derivatives, are defined with respect to a class of performance functions $\mathcal{D}$ such that, for any performance measure $g\in\mathcal{D}$ , the derivative of the integral of g with respect to the one-step transition probability of the Markov chain exists. The MVD approach (i) yields results that can be applied to performance functions out of a predefined class, (ii) allows for a product rule of differentiation, that is, analyzing the derivative of the transition kernel immediately yields finite-horizon results, (iii) provides an operator language approach to the differentiation of Markov chains and (iv) clearly identifies the trade-off between the generality of the performance classes that can be analyzed and the generality of the classes of measures (Markov kernels). The $\mathcal{D}$ -derivative of a measure can be interpreted in terms of various (unbiased) gradient estimators and the product rule for $\mathcal {D}$ -differentiation yields a product-rule for various gradient estimators.

Suggested Citation

  • B. Heidergott & F. J. Vázquez-Abad, 2008. "Measure-Valued Differentiation for Markov Chains," Journal of Optimization Theory and Applications, Springer, vol. 136(2), pages 187-209, February.
  • Handle: RePEc:spr:joptap:v:136:y:2008:i:2:d:10.1007_s10957-007-9297-7
    DOI: 10.1007/s10957-007-9297-7
    as

    Download full text from publisher

    File URL: http://link.springer.com/10.1007/s10957-007-9297-7
    File Function: Abstract
    Download Restriction: Access to the full text of the articles in this series is restricted.

    File URL: https://libkey.io/10.1007/s10957-007-9297-7?utm_source=ideas
    LibKey link: if access is restricted and if your library uses this service, LibKey will redirect you to where you can use your library subscription to access this item
    ---><---

    As the access to this document is restricted, you may want to search for a different version of it.

    References listed on IDEAS

    as
    1. Paul Glasserman & David D. Yao, 1992. "Some Guidelines and Guarantees for Common Random Numbers," Management Science, INFORMS, vol. 38(6), pages 884-908, June.
    Full references (including those not matched with items on IDEAS)

    Citations

    Citations are extracted by the CitEc Project, subscribe to its RSS feed for this item.
    as


    Cited by:

    1. Kloeden Peter E. & Sanz-Chacón Carlos, 2011. "Efficient price sensitivity estimation of financial derivatives by weak derivatives," Monte Carlo Methods and Applications, De Gruyter, vol. 17(1), pages 47-75, January.
    2. Sandjai Bhulai & Taoying Farenhorst-Yuan & Bernd Heidergott & Dinard Laan, 2012. "Optimal balanced control for call centers," Annals of Operations Research, Springer, vol. 201(1), pages 39-62, December.
    3. Bernd Heidergott & Haralambie Leahu, 2010. "Weak Differentiability of Product Measures," Mathematics of Operations Research, INFORMS, vol. 35(1), pages 27-51, February.
    4. Bernd Heidergott & Taoying Farenhorst-Yuan, 2010. "Gradient Estimation for Multicomponent Maintenance Systems with Age-Replacement Policy," Operations Research, INFORMS, vol. 58(3), pages 706-718, June.
    5. Koch, Erwan & Robert, Christian Y., 2022. "Stochastic derivative estimation for max-stable random fields," European Journal of Operational Research, Elsevier, vol. 302(2), pages 575-588.
    6. Thomas Flynn & Felisa Vázquez-Abad, 2019. "A simultaneous perturbation weak derivative estimator for stochastic neural networks," Computational Management Science, Springer, vol. 16(4), pages 715-738, October.

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Michael C. Fu & Jian-Qiang Hu & Chun-Hung Chen & Xiaoping Xiong, 2007. "Simulation Allocation for Determining the Best Design in the Presence of Correlated Sampling," INFORMS Journal on Computing, INFORMS, vol. 19(1), pages 101-111, February.
    2. N. Hilber & N. Reich & C. Schwab & C. Winter, 2009. "Numerical methods for Lévy processes," Finance and Stochastics, Springer, vol. 13(4), pages 471-500, September.
    3. Nathan L. Kleinman & James C. Spall & Daniel Q. Naiman, 1999. "Simulation-Based Optimization with Stochastic Approximation Using Common Random Numbers," Management Science, INFORMS, vol. 45(11), pages 1570-1578, November.
    4. Boyle, Phelim & Broadie, Mark & Glasserman, Paul, 1997. "Monte Carlo methods for security pricing," Journal of Economic Dynamics and Control, Elsevier, vol. 21(8-9), pages 1267-1321, June.
    5. Dag Kolsrud, 2008. "Stochastic Ceteris Paribus Simulations," Computational Economics, Springer;Society for Computational Economics, vol. 31(1), pages 21-43, February.
    6. Guillaume Bernis & Emmanuel Gobet & Arturo Kohatsu‐Higa, 2003. "Monte Carlo Evaluation of Greeks for Multidimensional Barrier and Lookback Options," Mathematical Finance, Wiley Blackwell, vol. 13(1), pages 99-113, January.
    7. Phillip M LaCasse & Lance E Champagne & Jonathan M Escamilla, 2024. "Simulation analysis of applicant scheduling and processing alternatives at a military entrance processing station," The Journal of Defense Modeling and Simulation, , vol. 21(2), pages 229-243, April.
    8. Benhamou, Eric, 2000. "A generalisation of Malliavin weighted scheme for fast computation of the Greeks," LSE Research Online Documents on Economics 119105, London School of Economics and Political Science, LSE Library.
    9. Eric Benhamou, 2000. "A Generalisation of Malliavin Weighted Scheme for Fast Computation of the Greeks," FMG Discussion Papers dp350, Financial Markets Group.
    10. Shuowen Chen, 2022. "Indirect Inference for Nonlinear Panel Models with Fixed Effects," Papers 2203.10683, arXiv.org, revised Apr 2022.
    11. Davis, Mark H.A. & Johansson, Martin P., 2006. "Malliavin Monte Carlo Greeks for jump diffusions," Stochastic Processes and their Applications, Elsevier, vol. 116(1), pages 101-129, January.
    12. Sheldon H. Jacobson & Enver Yücesan, 1999. "On the Complexity of Verifying Structural Properties of Discrete Event Simulation Models," Operations Research, INFORMS, vol. 47(3), pages 476-481, June.
    13. Andrea Maria Zanchettin, 2022. "Robust scheduling and dispatching rules for high-mix collaborative manufacturing systems," Flexible Services and Manufacturing Journal, Springer, vol. 34(2), pages 293-316, June.
    14. Nowak, Maciek & Szufel, Przemysław, 2024. "Technician routing and scheduling for the sharing economy," European Journal of Operational Research, Elsevier, vol. 314(1), pages 15-31.
    15. Saif, Ahmed & Elhedhli, Samir, 2016. "Cold supply chain design with environmental considerations: A simulation-optimization approach," European Journal of Operational Research, Elsevier, vol. 251(1), pages 274-287.
    16. Marvin K. Nakayama & Perwez Shahabuddin, 1998. "Likelihood Ratio Derivative Estimation for Finite-Time Performance Measures in Generalized Semi-Markov Processes," Management Science, INFORMS, vol. 44(10), pages 1426-1441, October.
    17. Jiajie Kong & Robert Lund, 2023. "Seasonal count time series," Journal of Time Series Analysis, Wiley Blackwell, vol. 44(1), pages 93-124, January.
    18. Chih, Mingchang, 2023. "Stochastic stability analysis of particle swarm optimization with pseudo random number assignment strategy," European Journal of Operational Research, Elsevier, vol. 305(2), pages 562-593.
    19. Chaudhuri, Anirban & Kramer, Boris & Willcox, Karen E., 2020. "Information Reuse for Importance Sampling in Reliability-Based Design Optimization," Reliability Engineering and System Safety, Elsevier, vol. 201(C).

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:spr:joptap:v:136:y:2008:i:2:d:10.1007_s10957-007-9297-7. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: Sonal Shukla or Springer Nature Abstracting and Indexing (email available below). General contact details of provider: http://www.springer.com .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.