Approximate dynamic programming via direct search in the space of value function approximations
Author
Abstract
Suggested Citation
Download full text from publisher
As the access to this document is restricted, you may want to search for a different version of it.
References listed on IDEAS
- Sox, Charles R. & Jackson, Peter L. & Bowman, Alan & Muckstadt, John A., 1999. "A review of the stochastic lot scheduling problem," International Journal of Production Economics, Elsevier, vol. 62(3), pages 181-200, September.
- Benjamin Van Roy, 2006. "Performance Loss Bounds for Approximate Value Iteration with State Aggregation," Mathematics of Operations Research, INFORMS, vol. 31(2), pages 234-244, May.
- Ishai Menache & Shie Mannor & Nahum Shimkin, 2005. "Basis Function Adaptation in Temporal Difference Reinforcement Learning," Annals of Operations Research, Springer, vol. 134(1), pages 215-238, February.
Citations
Citations are extracted by the CitEc Project, subscribe to its RSS feed for this item.
Cited by:
- Arruda, E.F. & Fragoso, M.D., 2015. "Solving average cost Markov decision processes by means of a two-phase time aggregation algorithm," European Journal of Operational Research, Elsevier, vol. 240(3), pages 697-705.
- Arruda, Edilson F. & Ourique, Fabrício O. & LaCombe, Jason & Almudevar, Anthony, 2013. "Accelerating the convergence of value iteration by using partial transition functions," European Journal of Operational Research, Elsevier, vol. 229(1), pages 190-198.
Most related items
These are the items that most often cite the same works as this one and are cited by the same works as this one.- Beck, Fabian G. & Biel, Konstantin & Glock, Christoph H., 2019. "Integration of energy aspects into the economic lot scheduling problem," International Journal of Production Economics, Elsevier, vol. 209(C), pages 399-410.
- Löhndorf, Nils & Riel, Manuel & Minner, Stefan, 2014. "Simulation optimization for the stochastic economic lot scheduling problem with sequence-dependent setup times," International Journal of Production Economics, Elsevier, vol. 157(C), pages 170-176.
- Tiacci, Lorenzo & Saetta, Stefano, 2012. "Demand forecasting, lot sizing and scheduling on a rolling horizon basis," International Journal of Production Economics, Elsevier, vol. 140(2), pages 803-814.
- Chevalier, Philippe & Lamas, Alejandro & Lu, Liang & Mlinar, Tanja, 2015.
"Revenue management for operations with urgent orders,"
European Journal of Operational Research, Elsevier, vol. 240(2), pages 476-487.
- LAMAS, Alejandro & MLINAR, Tanja & CHEVALIER, Philippe, 2013. "Revenue management for operations with urgent orders," LIDAM Discussion Papers CORE 2013046, Université catholique de Louvain, Center for Operations Research and Econometrics (CORE).
- CHEVALIER, Philippe & LAMAS, Alejandro & LU, Lian & MLINAR, Tanja, 2015. "Revenue management for operations with urgent orders," LIDAM Reprints CORE 2628, Université catholique de Louvain, Center for Operations Research and Econometrics (CORE).
- Rokhforoz, Pegah & Montazeri, Mina & Fink, Olga, 2023. "Safe multi-agent deep reinforcement learning for joint bidding and maintenance scheduling of generation units," Reliability Engineering and System Safety, Elsevier, vol. 232(C).
- Kerkkanen, Annastiina, 2007. "Determining semi-finished products to be stocked when changing the MTS-MTO policy: Case of a steel mill," International Journal of Production Economics, Elsevier, vol. 108(1-2), pages 111-118, July.
- Prasenjit Karmakar & Shalabh Bhatnagar, 2018. "Two Time-Scale Stochastic Approximation with Controlled Markov Noise and Off-Policy Temporal-Difference Learning," Mathematics of Operations Research, INFORMS, vol. 43(1), pages 130-151, February.
- Beemsterboer, Bart & Land, Martin & Teunter, Ruud, 2016. "Hybrid MTO-MTS production planning: An explorative study," European Journal of Operational Research, Elsevier, vol. 248(2), pages 453-461.
- Manuel Castejón-Limas & Joaquín Ordieres-Meré & Ana González-Marcos & Víctor González-Castro, 2011. "Effort estimates through project complexity," Annals of Operations Research, Springer, vol. 186(1), pages 395-406, June.
- Azaron, Amir & Tang, Ou & Tavakkoli-Moghaddam, Reza, 2009. "Dynamic lot sizing problem with continuous-time Markovian production cost," International Journal of Production Economics, Elsevier, vol. 120(2), pages 607-612, August.
- Soman, Chetan Anil & Pieter van Donk, Dirk & Gaalman, Gerard, 2006. "Comparison of dynamic scheduling policies for hybrid make-to-order and make-to-stock production systems with stochastic demand," International Journal of Production Economics, Elsevier, vol. 104(2), pages 441-453, December.
- Brander, Par & Forsberg, Rolf, 2006. "Determination of safety stocks for cyclic schedules with stochastic demands," International Journal of Production Economics, Elsevier, vol. 104(2), pages 271-295, December.
- Lopez de Haro, Santiago & Gershwin, Stanley B. & Rosenfield, Donald B., 2009. "Schedule evaluation in unstable manufacturing environments," International Journal of Production Economics, Elsevier, vol. 121(1), pages 183-194, September.
- Garn, Wolfgang & Aitken, James, 2015. "Agile factorial production for a single manufacturing line with multiple products," European Journal of Operational Research, Elsevier, vol. 245(3), pages 754-766.
- Serge M. Karalli & A. Dale Flowers, 2006. "The Multiple-Family ELSP with Safety Stocks," Operations Research, INFORMS, vol. 54(3), pages 523-531, June.
- Vaughan, Timothy S., 2007. "Cyclical schedules vs. dynamic sequencing: Replenishment dynamics and inventory efficiency," International Journal of Production Economics, Elsevier, vol. 107(2), pages 518-527, June.
- George Liberopoulos & Dimitrios Pandelis & Olympia Hatzikonstantinou, 2013. "The stochastic economic lot sizing problem for non-stop multi-grade production with sequence-restricted setup changeovers," Annals of Operations Research, Springer, vol. 209(1), pages 179-205, October.
- Beraldi, Patrizia & Ghiani, Gianpaolo & Guerriero, Emanuela & Grieco, Antonio, 2006. "Scenario-based planning for lot-sizing and scheduling with uncertain processing times," International Journal of Production Economics, Elsevier, vol. 101(1), pages 140-149, May.
- Dimitri P. Bertsekas & Huizhen Yu, 2012. "Q-Learning and Enhanced Policy Iteration in Discounted Dynamic Programming," Mathematics of Operations Research, INFORMS, vol. 37(1), pages 66-94, February.
- Ashayeri, J. & Heuts, R.J.M. & Lansdaal, H.G.L. & Strijbosch, L.W.G., 2006. "Cyclic production-inventory planning and control in the pre-Deco industry: A case study," International Journal of Production Economics, Elsevier, vol. 103(2), pages 715-725, October.
More about this item
Keywords
Dynamic programming Markov decision processes Convex optimization Direct search methods;Statistics
Access and download statisticsCorrections
All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:eee:ejores:v:211:y:2011:i:2:p:343-351. See general information about how to correct material in RePEc.
If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.
If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .
If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.
For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: Catherine Liu (email available below). General contact details of provider: http://www.elsevier.com/locate/eor .
Please note that corrections may take a couple of weeks to filter through the various RePEc services.