Reinforcement learning-based design of sampling policies under cost constraints in Markov random fields: Application to weed map reconstruction

My bibliography Save this article

Reinforcement learning-based design of sampling policies under cost constraints in Markov random fields: Application to weed map reconstruction

Author

Listed:

Bonneau, Mathieu
Gaba, Sabrina
Peyrard, Nathalie
Sabbadin, Régis

Registered:

Abstract

Weeds are responsible for yield losses in arable fields, whereas the role of weeds in agro-ecosystem food webs and in providing ecological services has been well established. Innovative weed management policies have to be designed to handle this trade-off between production and regulation services. As a consequence, there has been a growing interest in the study of the spatial distribution of weeds in crops, as a prerequisite to management. Such studies are usually based on maps of weed species. The issues involved in building probabilistic models of spatial processes as well as plausible maps of the process on the basis of models and observed data are frequently encountered and important. As important is the question of designing optimal sampling policies that make it possible to build maps of high probability when the model is known. This optimization problem is more complex to solve than the pure reconstruction problem and cannot generally be solved exactly. A generic approach to spatial sampling for optimizing map construction, based on Markov Random Fields (MRF), is provided and applied to the problem of weed sampling for mapping. MRF offer a powerful representation for reasoning on large sets of random variables in interaction. In the field of spatial statistics, the design of sampling policies has been largely studied in the case of continuous variables, using tools from the geostatistics domain. In the MRF case with finite state space variables, some heuristics have been proposed for the design problem but no universally accepted solution exists, particularly when considering adaptive policies as opposed to static ones. The problem of designing an adaptive sampling policy in an MRF can be formalized as an optimization problem. By combining tools from the fields of Artificial Intelligence (AI) and Computational Statistics, an original algorithm is then proposed for approximate resolution. This generic procedure, referred to as Least-Squares Dynamic Programming (LSDP), combines an approximation of the value of a sampling policy based on a linear regression, the construction of a batch of MRF realizations and a backwards induction algorithm. Based on an empirical comparison of the performance of LSDP with existing one-step-look-ahead sampling heuristics and solutions provided by classical AI algorithms, the following conclusions can be derived: (i) a naïve heuristic consisting of sampling sites where marginals are the most uncertain is already an efficient sampling approach; (ii) LSDP outperforms all the classical approaches we have tested; and (iii) LSDP outperforms the naïve heuristic approach in cases where sampling costs are not uniform over the set of variables or where sampling actions are constrained.

Suggested Citation

Bonneau, Mathieu & Gaba, Sabrina & Peyrard, Nathalie & Sabbadin, Régis, 2014. "Reinforcement learning-based design of sampling policies under cost constraints in Markov random fields: Application to weed map reconstruction," Computational Statistics & Data Analysis, Elsevier, vol. 72(C), pages 30-44.

Handle: RePEc:eee:csdana:v:72:y:2014:i:c:p:30-44
DOI: 10.1016/j.csda.2013.10.002

Download full text from publisher

As the access to this document is restricted, you may want to search for a different version of it.

References listed on IDEAS

Lafratta, Giovanni, 2006. "Efficiency evaluation of MEV spatial sampling strategies: a scenario analysis," Computational Statistics & Data Analysis, Elsevier, vol. 50(3), pages 878-890, February.
Martinelli, Gabriele & Eidsvik, Jo & Hauge, Ragnar, 2013. "Dynamic decision making for graphical models applied to oil exploration," European Journal of Operational Research, Elsevier, vol. 230(3), pages 688-702.

Full references (including those not matched with items on IDEAS)

Citations

Citations are extracted by the CitEc Project, subscribe to its RSS feed for this item.

Cited by:

Sharma, Anjali & Singh, Param Vir & Musunur, Laxmi P., 2020. "Artificial Intelligence and Robotics for Reducing Waste in the Food Supply Chain: Systematic Literature Review, Theoretical Framework, and Research Agenda," OSF Preprints h3jgb, Center for Open Science.
Sharma, Anjali & Singh, Param Vir & Musunur, Laxmi P., 2020. "Artificial Intelligence and Robotics for Reducing Waste in the Food Supply Chain: Systematic Literature Review, Theoretical Framework, and Research Agenda," OSF Preprints h3jgb_v1, Center for Open Science.

Most related items

These are the items that most often cite the same works as this one and are cited by the same works as this one.

Debarun Bhattacharjya & Jo Eidsvik & Tapan Mukerji, 2013. "The Value of Information in Portfolio Problems with Dependent Projects," Decision Analysis, INFORMS, vol. 10(4), pages 341-351, December.
Maarten J. Punt & Brooks A. Kaiser, 2021. "Seismic Shifts from Regulations: Spatial Trade-offs in Marine Mammals and the Value of Information from Hydrocarbon Seismic Surveying," Environmental & Resource Economics, Springer;European Association of Environmental and Resource Economists, vol. 80(3), pages 553-585, November.
- Maarten J. Punt & Brooks A. Kaiser, 2019. "Seismic shifts from regulations: Spatial trade-offs in marine mammals and the value of information from hydrocarbon seismic surveying," Working Papers 2/19, University of Southern Denmark, Department of Sociology, Environmental and Business Economics.
Hobæk Haff, Ingrid & Aas, Kjersti & Frigessi, Arnoldo & Lacal, Virginia, 2016. "Structure learning in Bayesian Networks using regular vines," Computational Statistics & Data Analysis, Elsevier, vol. 101(C), pages 186-208.
Finn Olesen, 1999. "Monetær integration i EU," Working Papers 2/99, University of Southern Denmark, Department of Sociology, Environmental and Business Economics.
Walde, Janette F., 2007. "Valid hypothesis testing in face of spatially dependent data using multi-layer perceptrons and sub-sampling techniques," Computational Statistics & Data Analysis, Elsevier, vol. 51(5), pages 2701-2719, February.
Arbia, Giuseppe & Lafratta, Giovanni & Simeoni, Carla, 2007. "Spatial sampling plans to monitor the 3-D spatial distribution of extremes in soil pollution surveys," Computational Statistics & Data Analysis, Elsevier, vol. 51(8), pages 4069-4082, May.
Naresh Kumar, 2007. "Spatial Sampling Design for a Demographic and Health Survey," Population Research and Policy Review, Springer;Southern Demographic Association (SDA), vol. 26(5), pages 581-599, December.
Yu, Shiwei & Gao, Siwei & sun, Han, 2016. "A dynamic programming model for environmental investment decision-making in coal mining," Applied Energy, Elsevier, vol. 166(C), pages 273-281.

More about this item

Keywords

Sampling design; Markov decision process; Dynamic programming; Gibbs sampling; Least-squares linear regression; Weed mapping;
All these keywords.

Statistics

Access and download statistics

Corrections

All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:eee:csdana:v:72:y:2014:i:c:p:30-44. See general information about how to correct material in RePEc.

If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: Catherine Liu (email available below). General contact details of provider: http://www.elsevier.com/locate/csda .

Please note that corrections may take a couple of weeks to filter through the various RePEc services.

IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.

Browse Econ Literature

More features

Reinforcement learning-based design of sampling policies under cost constraints in Markov random fields: Application to weed map reconstruction

Author

Abstract

Suggested Citation

Download full text from publisher

References listed on IDEAS

Citations

Most related items

More about this item

Keywords

Statistics

Corrections

More services and features

MyIDEAS

Author registration

Rankings

RePEc Genealogy

RePEc Biblio

MPRA

New papers by email

EconAcademics

Plagiarism

About RePEc

RePEc home

Blog

Help/FAQ

RePEc team

Participating archives

Privacy statement

Help us

Corrections

Volunteers

Get papers listed

Open a RePEc archive

Get RePEc data