IDEAS home Printed from https://ideas.repec.org/p/arx/papers/1911.00688.html
   My bibliography  Save this paper

Model Specification Test with Unlabeled Data: Approach from Covariate Shift

Author

Listed:
  • Masahiro Kato
  • Hikaru Kawarazaki

Abstract

We propose a novel framework of the model specification test in regression using unlabeled test data. In many cases, we have conducted statistical inferences based on the assumption that we can correctly specify a model. However, it is difficult to confirm whether a model is correctly specified. To overcome this problem, existing works have devised statistical tests for model specification. Existing works have defined a correctly specified model in regression as a model with zero conditional mean of the error term over train data only. Extending the definition in conventional statistical tests, we define a correctly specified model as a model with zero conditional mean of the error term over any distribution of the explanatory variable. This definition is a natural consequence of the orthogonality of the explanatory variable and the error term. If a model does not satisfy this condition, the model might lack robustness with regards to the distribution shift. The proposed method would enable us to reject a misspecified model under our definition. By applying the proposed method, we can obtain a model that predicts the label for the unlabeled test data well without losing the interpretability of the model. In experiments, we show how the proposed method works for synthetic and real-world datasets.

Suggested Citation

  • Masahiro Kato & Hikaru Kawarazaki, 2019. "Model Specification Test with Unlabeled Data: Approach from Covariate Shift," Papers 1911.00688, arXiv.org, revised Feb 2020.
  • Handle: RePEc:arx:papers:1911.00688
    as

    Download full text from publisher

    File URL: http://arxiv.org/pdf/1911.00688
    File Function: Latest version
    Download Restriction: no
    ---><---

    References listed on IDEAS

    as
    1. Wooldridge, Jeffrey M., 1990. "An encompassing approach to conditional mean tests with applications to testing nonnested hypotheses," Journal of Econometrics, Elsevier, vol. 45(3), pages 331-350.
    2. Pesaran, M H & Deaton, Angus S, 1978. "Testing Non-Nested Nonlinear Regression Models," Econometrica, Econometric Society, vol. 46(3), pages 677-694, May.
    3. Davidson, Russell & MacKinnon, James G, 1981. "Several Tests for Model Specification in the Presence of Alternative Hypotheses," Econometrica, Econometric Society, vol. 49(3), pages 781-793, May.
    4. Smith, Richard J, 1992. "Non-nested.Tests for Competing Models Estimated by Generalized Method of Moments," Econometrica, Econometric Society, vol. 60(4), pages 973-980, July.
    5. Masashi Sugiyama & Taiji Suzuki & Shinichi Nakajima & Hisashi Kashima & Paul Bünau & Motoaki Kawanabe, 2008. "Direct importance estimation for covariate shift adaptation," Annals of the Institute of Statistical Mathematics, Springer;The Institute of Statistical Mathematics, vol. 60(4), pages 699-746, December.
    6. Junpei Komiyama & Hajime Shimao, 2018. "Cross Validation Based Model Selection via Generalized Method of Moments," Papers 1807.06993, arXiv.org.
    7. Vuong, Quang H, 1989. "Likelihood Ratio Tests for Model Selection and Non-nested Hypotheses," Econometrica, Econometric Society, vol. 57(2), pages 307-333, March.
    8. Hausman, Jerry, 2015. "Specification tests in econometrics," Applied Econometrics, Russian Presidential Academy of National Economy and Public Administration (RANEPA), vol. 38(2), pages 112-134.
    9. Godfrey, L G, 1984. "On the Uses of Misspecification Checks and Tests of Non-Nested Hypotheses in Emperical Econometrics," Economic Journal, Royal Economic Society, vol. 94(376a), pages 69-81, Supplemen.
    10. Susan Athey & Guido Imbens, 2015. "A Measure of Robustness to Misspecification," American Economic Review, American Economic Association, vol. 105(5), pages 476-480, May.
    Full references (including those not matched with items on IDEAS)

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Otsu, Taisuke & Whang, Yoon-Jae, 2011. "Testing For Nonnested Conditional Moment Restrictions Via Conditional Empirical Likelihood," Econometric Theory, Cambridge University Press, vol. 27(1), pages 114-153, February.
    2. Julia Campos & Neil R. Ericsson & David F. Hendry, 2005. "General-to-specific modeling: an overview and selected bibliography," International Finance Discussion Papers 838, Board of Governors of the Federal Reserve System (U.S.).
    3. Luc Anselin, 1988. "Model Validation in Spatial Econometrics: A Review and Evaluation of Alternative Approaches," International Regional Science Review, , vol. 11(3), pages 279-316, December.
    4. Silva João M. C. Santos & Tenreyro Silvana & Windmeijer Frank, 2015. "Testing Competing Models for Non-negative Data with Many Zeros," Journal of Econometric Methods, De Gruyter, vol. 4(1), pages 29-46, January.
    5. Gordon Fisher & Michael McAleer, 1980. "Principles and Methods in the Testing of Alternative Models," Working Paper 400, Economics Department, Queen's University.
    6. McAleer, Michael, 1995. "The significance of testing empirical non-nested models," Journal of Econometrics, Elsevier, vol. 67(1), pages 149-171, May.
    7. Adrian C. Darnell, 1994. "A Dictionary Of Econometrics," Books, Edward Elgar Publishing, number 118.
    8. Chen, Yi-Ting & Kuan, Chung-Ming, 2002. "The pseudo-true score encompassing test for non-nested hypotheses," Journal of Econometrics, Elsevier, vol. 106(2), pages 271-295, February.
    9. Hafiz Akhand, 1998. "On income tax functions: an application of robust, regression-based diagnostics to models of conditional means," Applied Economics Letters, Taylor & Francis Journals, vol. 5(5), pages 317-320.
    10. Li, Tong, 2009. "Simulation based selection of competing structural econometric models," Journal of Econometrics, Elsevier, vol. 148(2), pages 114-123, February.
    11. Mizon, Grayham E & Richard, Jean-Francois, 1986. "The Encompassing Principle and Its Application to Testing Non-nested Hypotheses," Econometrica, Econometric Society, vol. 54(3), pages 657-678, May.
    12. Matthew Backus & Christopher Conlon & Michael Sinkinson, 2021. "Common Ownership and Competition in the Ready-to-Eat Cereal Industry," NBER Working Papers 28350, National Bureau of Economic Research, Inc.
    13. D. R. Cox, 2013. "A return to an old paper: ‘Tests of separate families of hypotheses’," Journal of the Royal Statistical Society Series B, Royal Statistical Society, vol. 75(2), pages 207-215, March.
    14. Bonnet, Céline, 2007. "Économétrie de la concurrence entre produits différenciés : théorie et méthodes empiriques," L'Actualité Economique, Société Canadienne de Science Economique, vol. 83(4), pages 555-580, décembre.
    15. McAleer, Michael, 1994. "Sherlock Holmes and the Search for Truth: A Diagnostic Tale," Journal of Economic Surveys, Wiley Blackwell, vol. 8(4), pages 317-370, December.
    16. Hnatkovska, Viktoria & Marmer, Vadim & Tang, Yao, 2009. "Supplement to "Comparison of Misspecified Calibrated Models"," Microeconomics.ca working papers vadim_marmer-2009-58, Vancouver School of Economics, revised 03 Feb 2011.
    17. Christophe Bontemps & Grayham E. Mizon, 2008. "Encompassing: Concepts and Implementation," Oxford Bulletin of Economics and Statistics, Department of Economics, University of Oxford, vol. 70(s1), pages 721-750, December.
    18. Kuan, Chung-Ming & Lin, Hsin-Yi, 2010. "An encompassing test for non-nested quantile regression models," Economics Letters, Elsevier, vol. 107(2), pages 257-260, May.
    19. Marmer, Vadim & Otsu, Taisuke, 2012. "Optimal comparison of misspecified moment restriction models under a chosen measure of fit," Journal of Econometrics, Elsevier, vol. 170(2), pages 538-550.
    20. MacKinnon, James G, 1992. "Model Specification Tests and Artificial Regressions," Journal of Economic Literature, American Economic Association, vol. 30(1), pages 102-146, March.

    More about this item

    NEP fields

    This paper has been announced in the following NEP Reports:

    Statistics

    Access and download statistics

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:arx:papers:1911.00688. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: arXiv administrators (email available below). General contact details of provider: http://arxiv.org/ .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.