Cross validation for the classical model of structured expert judgment

My bibliography Save this article

Cross validation for the classical model of structured expert judgment

Author

Listed:

Colson, Abigail R.
Cooke, Roger M.

Registered:

Roger M. Cooke

Abstract

We update the 2008 TU Delft structured expert judgment database with data from 33 professionally contracted Classical Model studies conducted between 2006 and March 2015 to evaluate its performance relative to other expert aggregation models. We briefly review alternative mathematical aggregation schemes, including harmonic weighting, before focusing on linear pooling of expert judgments with equal weights and performance-based weights. Performance weighting outperforms equal weighting in all but 1 of the 33 studies in-sample. True out-of-sample validation is rarely possible for Classical Model studies, and cross validation techniques that split calibration questions into a training and test set are used instead. Performance weighting incurs an â€œout-of-sample penaltyâ€ and its statistical accuracy out-of-sample is lower than that of equal weighting. However, as a function of training set size, the statistical accuracy of performance-based combinations reaches 75% of the equal weight value when the training set includes 80% of calibration variables. At this point the training set is sufficiently powerful to resolve differences in individual expert performance. The information of performance-based combinations is double that of equal weighting when the training set is at least 50% of the set of calibration variables. Previous out-of-sample validation work used a Total Out-of-Sample Validity Index based on all splits of the calibration questions into training and test subsets, which is expensive to compute and includes small training sets of dubious value. As an alternative, we propose an Out-of-Sample Validity Index based on averaging the product of statistical accuracy and information over all training sets sized at 80% of the calibration set. Performance weighting outperforms equal weighting on this Out-of-Sample Validity Index in 26 of the 33 post-2006 studies; the probability of 26 or more successes on 33 trials if there were no difference between performance weighting and equal weighting is 0.001.

Suggested Citation

Colson, Abigail R. & Cooke, Roger M., 2017. "Cross validation for the classical model of structured expert judgment," Reliability Engineering and System Safety, Elsevier, vol. 163(C), pages 109-120.

Handle: RePEc:eee:reensy:v:163:y:2017:i:c:p:109-120
DOI: 10.1016/j.ress.2017.02.003

Download full text from publisher

As the access to this document is restricted, you may want to search for a different version of it.

References listed on IDEAS

Kenneth Gillingham & William D. Nordhaus & David Anthoff & Geoffrey Blanford & Valentina Bosetti & Peter Christensen & Haewon McJeon & John Reilly & Paul Sztorc, 2015. "Modeling Uncertainty in Climate Change: A Multi-Model Comparison," NBER Working Papers 21637, National Bureau of Economic Research, Inc.
- Gillingham, Kenneth & Nordhaus, William & Anthoff, David & Blanford, Geoffrey & Bosetti, Valentina & Christensen, Peter & McJeon, Haewon & Reilly, John & Sztorc, Paul, 2016. "Modeling Uncertainty in Climate Change: A Multi-Model Comparison," Conference papers 332720, Purdue University, Center for Global Trade Analysis, Global Trade Analysis Project.
- Kenneth Gillingham & William Nordhaus & David Anthoff & Geoffrey Blanford & Valentina Bosetti & Peter Christensen & Haewon McJeon & John Reilly & Paul Sztorc, 2015. "Modeling Uncertainty in Climate Change: A Multi-Model Comparison," CESifo Working Paper Series 5538, CESifo.
- Gillingham, Kenneth & Nordhaus, William & Anthoff, David & Bosetti, Valentina & McJeon, Haewon & Blanford, Geoffrey & Christensen, Peter & Reilly, John & Sztorc, Paul, 2016. "Modeling Uncertainty in Climate Change: A Multi-Model Comparison," MITP: Mitigation, Innovation and Transformation Pathways 232219, Fondazione Eni Enrico Mattei (FEEM).
- Kenneth Gillingham & William Nordhaus & David Anthoff & Valentina Bosetti & Haewon McJeon & Geoffrey Blanford & Peter Christensen & John Reilly & Paul Sztorc, 2016. "Modeling Uncertainty in Climate Change: A Multi-Model Comparison," Working Papers 2016.13, Fondazione Eni Enrico Mattei.
- Kenneth Gillingham & William D. Nordhaus & David Anthoff & Geoffrey Blanford & Valentina Bosetti & Peter Christensen & Haewan McJeon & John Reilly & Paul Sztorc, 2015. "Modeling Uncertainty in Climate Change: A Multi-Model Comparison," Cowles Foundation Discussion Papers 2022, Cowles Foundation for Research in Economics, Yale University.
JL Bamber & WP Aspinall & RM Cooke, 2016. "A commentary on “how to interpret expert judgment assessments of twenty-first century sea-level rise” by Hylke de Vries and Roderik SW van de Wal," Climatic Change, Springer, vol. 137(3), pages 321-328, August.
Flandoli, F. & Giorgi, E. & Aspinall, W.P. & Neri, A., 2011. "Comparison of a new expert elicitation model with the Classical Model, equal weights and single experts, using a cross-validation technique," Reliability Engineering and System Safety, Elsevier, vol. 96(10), pages 1292-1310.
Willy Aspinall, 2010. "A route to more tractable expert advice," Nature, Nature, vol. 463(7279), pages 294-295, January.
Eggstaff, Justin W. & Mazzuchi, Thomas A. & Sarkani, Shahram, 2014. "The effect of the number of seed variables on the performance of Cookeâ€²s classical model," Reliability Engineering and System Safety, Elsevier, vol. 121(C), pages 72-82.
W P Aspinall & R M Cooke & A H Havelaar & S Hoffmann & T Hald, 2016. "Evaluation of a Performance-Based Expert Elicitation: WHO Global Attribution of Foodborne Diseases," PLOS ONE, Public Library of Science, vol. 11(3), pages 1-14, March.
Cooke, Roger M. & Goossens, Louis L.H.J., 2008. "TU Delft expert judgment data base," Reliability Engineering and System Safety, Elsevier, vol. 93(5), pages 657-674.
Kenneth C. Lichtendahl & Yael Grushka-Cockayne & Robert L. Winkler, 2013. "Is It Better to Average Probabilities or Quantiles?," Management Science, INFORMS, vol. 59(7), pages 1594-1611, July.
Roger M. Cooke, 2015. "Messaging climate change uncertainty," Nature Climate Change, Nature, vol. 5(1), pages 8-10, January.

Full references (including those not matched with items on IDEAS)

Citations

Citations are extracted by the CitEc Project, subscribe to its RSS feed for this item.

Cited by:

Hoffmann, Sandra & Ashton, Lydia & Todd, Jessica E. & Ahn, Jae-Wan & Berck, Peter, 2021. "Attributing U.S. Campylobacteriosis Cases to Food Sources, Season, and Temperature," USDA Miscellaneous 309617, United States Department of Agriculture.
- Hoffman, Sandra & Ashton, Lydia & Todd, Jessica E & Ahn, Jae-Wan & Berck, Peter, 2021. "Attributing U.S. Campylobacteriosis Cases to Food Sources, Season, and Temperature," Economic Research Report 327200, United States Department of Agriculture, Economic Research Service.
- Hoffmann, Sandra & Ashton, Lydia & Todd, Jessica E. & Ahn, Jae-wan & Berck, Peter, 2021. "Attributing U.S. Campylobacteriosis Cases to Food Sources, Season, and Temperature," USDA Miscellaneous 309620, United States Department of Agriculture.
Kevin Rennert & Brian C. Prest & William A. Pizer & Richard G. Newell & David Anthoff & Cora Kingdon & Lisa Rennels & Roger Cooke & Adrian E. Raftery & Hana Sevcikova & Frank Errickson, 2021. "The Social Cost of Carbon: Advances in Long-Term Probabilistic Projections of Population, GDP, Emissions, and Discount Rates," Brookings Papers on Economic Activity, Economic Studies Program, The Brookings Institution, vol. 52(2 (Fall)), pages 223-305.
- Rennert, Kevin & Prest, Brian C. & Pizer, William & Newell, Richard G. & Anthoff, David & Kingdon, Cora & Rennels, Lisa & Cooke, Roger & Raftery, Adrian E. & Ševčíková, Hana & Errickson, Frank, 2021. "The Social Cost of Carbon: Advances in Long-Term Probabilistic Projections of Population, GDP, Emissions, and Discount Rates," RFF Working Paper Series 21-28, Resources for the Future.
Patrick Afflerbach & Christopher Dun & Henner Gimpel & Dominik Parak & Johannes Seyfried, 2021. "A Simulation-Based Approach to Understanding the Wisdom of Crowds Phenomenon in Aggregating Expert Judgment," Business & Information Systems Engineering: The International Journal of WIRTSCHAFTSINFORMATIK, Springer;Gesellschaft für Informatik e.V. (GI), vol. 63(4), pages 329-348, August.
Abigail R Colson & Roger M Cooke, 2018. "Expert Elicitation: Using the Classical Model to Validate Experts’ Judgments," Review of Environmental Economics and Policy, Association of Environmental and Resource Economists, vol. 12(1), pages 113-132.
Jeremy Rohmer & Eric Chojnacki, 2021. "Forecast of environment systems using expert judgements: performance comparison between the possibilistic and the classical model," Environment Systems and Decisions, Springer, vol. 41(1), pages 131-146, March.
Timothy McDaniels, 2021. "Four Decades of Transformation in Decision Analytic Practice for Societal Risk Management," Risk Analysis, John Wiley & Sons, vol. 41(3), pages 491-502, March.
Mohammad Yazdi, 2019. "A review paper to examine the validity of Bayesian network to build rational consensus in subjective probabilistic failure analysis," International Journal of System Assurance Engineering and Management, Springer;The Society for Reliability, Engineering Quality and Operations Management (SREQOM),India, and Division of Operation and Maintenance, Lulea University of Technology, Sweden, vol. 10(1), pages 1-18, February.
Despoina Makariou & Pauline Barrieu & George Tzougas, 2021. "A Finite Mixture Modelling Perspective for Combining Experts’ Opinions with an Application to Quantile-Based Risk Measures," Risks, MDPI, vol. 9(6), pages 1-25, June.
Ren, Xin & Nane, Gabriela F. & Terwel, Karel C. & van Gelder, Pieter H.A.J.M., 2024. "Measuring the impacts of human and organizational factors on human errors in the Dutch construction industry using structured expert judgement," Reliability Engineering and System Safety, Elsevier, vol. 244(C).
Mario P. Brito & Ian G. J. Dawson, 2020. "Predicting the Validity of Expert Judgments in Assessing the Impact of Risk Mitigation Through Failure Prevention and Correction," Risk Analysis, John Wiley & Sons, vol. 40(10), pages 1928-1943, October.
Rongen, G. & Morales-NÃ¡poles, O. & Kok, M., 2022. "Expert judgment-based reliability analysis of the Dutch flood defense system," Reliability Engineering and System Safety, Elsevier, vol. 224(C).
Cooke, Roger M. & Marti, Deniz & Mazzuchi, Thomas, 2021. "Expert forecasting with and without uncertainty quantification and weighting: What do the data say?," International Journal of Forecasting, Elsevier, vol. 37(1), pages 378-387.
Fathy, Mohammad & Kazemzadeh Haghighi, Foojan & Ahmadi, Mohammad, 2024. "Uncertainty quantification of reservoir performance using machine learning algorithms and structured expert judgment," Energy, Elsevier, vol. 288(C).
Funk, Patrick & Davis, Alex & Vaishnav, Parth & Dewitt, Barry & Fuchs, Erica, 2020. "Individual inconsistency and aggregate rationality: Overcoming inconsistencies in expert judgment at the technical frontier," Technological Forecasting and Social Change, Elsevier, vol. 155(C).
Nogal, Maria & Morales Nápoles, Oswaldo & O’Connor, Alan, 2019. "Structured expert judgement to understand the intrinsic vulnerability of traffic networks," Transportation Research Part A: Policy and Practice, Elsevier, vol. 127(C), pages 136-152.
Hathout, Michel & Vuillet, Marc & Carvajal, Claudio & Peyras, Laurent & Diab, Youssef, 2019. "Expert judgments calibration and combination for assessment of river levee failure probability," Reliability Engineering and System Safety, Elsevier, vol. 188(C), pages 377-392.
Abigail R Colson & Itamar Megiddo & Gerardo Alvarez-Uria & Sumanth Gandra & Tim Bedford & Alec Morton & Roger M Cooke & Ramanan Laxminarayan, 2019. "Quantifying uncertainty about future antimicrobial resistance: Comparing structured expert judgment and statistical forecasting methods," PLOS ONE, Public Library of Science, vol. 14(7), pages 1-18, July.

Most related items

These are the items that most often cite the same works as this one and are cited by the same works as this one.

Despoina Makariou & Pauline Barrieu & George Tzougas, 2021. "A Finite Mixture Modelling Perspective for Combining Experts’ Opinions with an Application to Quantile-Based Risk Measures," Risks, MDPI, vol. 9(6), pages 1-25, June.
Abigail R Colson & Itamar Megiddo & Gerardo Alvarez-Uria & Sumanth Gandra & Tim Bedford & Alec Morton & Roger M Cooke & Ramanan Laxminarayan, 2019. "Quantifying uncertainty about future antimicrobial resistance: Comparing structured expert judgment and statistical forecasting methods," PLOS ONE, Public Library of Science, vol. 14(7), pages 1-18, July.
Bolger, Donnacha & Houlding, Brett, 2017. "Deriving the probability of a linear opinion pooling method being superior to a set of alternatives," Reliability Engineering and System Safety, Elsevier, vol. 158(C), pages 41-49.
Cooke, Roger M. & Marti, Deniz & Mazzuchi, Thomas, 2021. "Expert forecasting with and without uncertainty quantification and weighting: What do the data say?," International Journal of Forecasting, Elsevier, vol. 37(1), pages 378-387.
Alexander M. R. Bakker & Domitille Louchard & Klaus Keller, 2017. "Sources and implications of deep uncertainties surrounding sea-level projections," Climatic Change, Springer, vol. 140(3), pages 339-347, February.
JL Bamber & WP Aspinall & RM Cooke, 2016. "A commentary on “how to interpret expert judgment assessments of twenty-first century sea-level rise” by Hylke de Vries and Roderik SW van de Wal," Climatic Change, Springer, vol. 137(3), pages 321-328, August.
Makariou, Despoina & Barrieu, Pauline & Tzougas, George, 2021. "A finite mixture modelling perspective for combining experts’ opinions with an application to quantile-based risk measures," LSE Research Online Documents on Economics 110763, London School of Economics and Political Science, LSE Library.
Abigail R Colson & Roger M Cooke, 2018. "Expert Elicitation: Using the Classical Model to Validate Experts’ Judgments," Review of Environmental Economics and Policy, Association of Environmental and Resource Economists, vol. 12(1), pages 113-132.
Hanea, A.M. & McBride, M.F. & Burgman, M.A. & Wintle, B.C. & Fidler, F. & Flander, L. & Twardy, C.R. & Manning, B. & Mascaro, S., 2017. "I nvestigate D iscuss E stimate A ggregate for structured expert judgement," International Journal of Forecasting, Elsevier, vol. 33(1), pages 267-279.
Laura Diaz Anadon & Erin Baker & Valentina Bosetti & Lara Aleluia Reis, 2016. "Expert views - and disagreements - about the potential of energy technology R&D," Climatic Change, Springer, vol. 136(3), pages 677-691, June.
Anca M. Hanea & Marissa F. McBride & Mark A. Burgman & Bonnie C. Wintle, 2018. "The Value of Performance Weights and Discussion in Aggregated Expert Judgments," Risk Analysis, John Wiley & Sons, vol. 38(9), pages 1781-1794, September.
Elena Verdolini & Laura Díaz Anadón & Erin Baker & Valentina Bosetti & Lara Aleluia Reis, 2018. "Future Prospects for Energy Technologies: Insights from Expert Elicitations," Review of Environmental Economics and Policy, Association of Environmental and Resource Economists, vol. 12(1), pages 133-153.
- Elena Verdolini & Laura Diaz Anadón & Erin Baker & Valentina Bosetti & Lara Aleluia Reis, 2016. "The Future Prospects of Energy Technologies: Insights from Expert Elicitations," Working Papers 2016.47, Fondazione Eni Enrico Mattei.
- Verdolini, Elena & Anadón, Laura Diaz & Baker, Erin & Bosetti, Valentina & Reis, Lara Aleluia, 2016. "The Future Prospects of Energy Technologies: Insights from Expert Elicitations," MITP: Mitigation, Innovation and Transformation Pathways 243148, Fondazione Eni Enrico Mattei (FEEM).
Bolger, Fergus & Wright, George, 2017. "Use of expert knowledge to anticipate the future: Issues, analysis and directions," International Journal of Forecasting, Elsevier, vol. 33(1), pages 230-243.
Alvarado-Valencia, Jorge & Barrero, Lope H. & Önkal, Dilek & Dennerlein, Jack T., 2017. "Expertise, credibility of system forecasts and integration methods in judgmental demand forecasting," International Journal of Forecasting, Elsevier, vol. 33(1), pages 298-313.
Flandoli, F. & Giorgi, E. & Aspinall, W.P. & Neri, A., 2011. "Comparison of a new expert elicitation model with the Classical Model, equal weights and single experts, using a cross-validation technique," Reliability Engineering and System Safety, Elsevier, vol. 96(10), pages 1292-1310.
Anil Gaba & Ilia Tsetlin & Robert L. Winkler, 2017. "Combining Interval Forecasts," Decision Analysis, INFORMS, vol. 14(1), pages 1-20, March.
Rongen, G. & Morales-NÃ¡poles, O. & Kok, M., 2022. "Expert judgment-based reliability analysis of the Dutch flood defense system," Reliability Engineering and System Safety, Elsevier, vol. 224(C).
Eggstaff, Justin W. & Mazzuchi, Thomas A. & Sarkani, Shahram, 2014. "The effect of the number of seed variables on the performance of Cookeâ€²s classical model," Reliability Engineering and System Safety, Elsevier, vol. 121(C), pages 72-82.
James K. Hammitt & Yifan Zhang, 2013. "Combining Experts’ Judgments: Comparison of Algorithmic Methods Using Synthetic Data," Risk Analysis, John Wiley & Sons, vol. 33(1), pages 109-120, January.
- Hammitt, James K. & Zhang, Yifan, 2012. "Combining Experts’ Judgments: Comparison of Algorithmic Methods using Synthetic Data," TSE Working Papers 12-293, Toulouse School of Economics (TSE).
Cooke, Roger M., 2014. "Deep and Shallow Uncertainty in Messaging Climate Change," RFF Working Paper Series dp-14-11, Resources for the Future.

More about this item

Keywords

Expert judgment; Calibration; Information; Classical model; Out-of-sample validation;
All these keywords.

Statistics

Access and download statistics

Corrections

All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:eee:reensy:v:163:y:2017:i:c:p:109-120. See general information about how to correct material in RePEc.

If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: Catherine Liu (email available below). General contact details of provider: https://www.journals.elsevier.com/reliability-engineering-and-system-safety .

Please note that corrections may take a couple of weeks to filter through the various RePEc services.

IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.

Browse Econ Literature

More features

Cross validation for the classical model of structured expert judgment

Author

Abstract

Suggested Citation

Download full text from publisher

References listed on IDEAS

Citations

Most related items

More about this item

Keywords

Statistics

Corrections

More services and features

MyIDEAS

Author registration

Rankings

RePEc Genealogy

RePEc Biblio

MPRA

New papers by email

EconAcademics

Plagiarism

About RePEc

RePEc home

Blog

Help/FAQ

RePEc team

Participating archives

Privacy statement

Help us

Corrections

Volunteers

Get papers listed

Open a RePEc archive

Get RePEc data