IDEAS home Printed from https://ideas.repec.org/a/spr/astaws/v10y2016i2d10.1007_s11943-016-0190-4.html
   My bibliography  Save this article

The use of Twitter data to improve small area estimates of households’ share of food consumption expenditure in Italy
[Die Nutzung von Twitter Daten um die Small Area Schätzungen vom Ausgabenanteil für Lebensmittel von Haushalten in Italien zu verbessern]

Author

Listed:
  • Stefano Marchetti

    (University of Pisa)

  • Caterina Giusti

    (University of Pisa)

  • Monica Pratesi

    (University of Pisa)

Abstract

The use of big data in many socio-economic studies has received a growing interest in the last few years. In this work we use emotional data coming from Twitter as auxiliary variable in a small area model to estimate Italian households’ share of food consumption expenditure (the proportion of food consumption expenditure on the total consumption expenditure) at provincial level. We show that the use of Twitter data has a potential in predicting our target variable. Moreover, the use of these data as auxiliary variable in the small area working model reduces the estimated mean squared error in comparison with what obtained by the same working model without the Twitter data.

Suggested Citation

  • Stefano Marchetti & Caterina Giusti & Monica Pratesi, 2016. "The use of Twitter data to improve small area estimates of households’ share of food consumption expenditure in Italy [Die Nutzung von Twitter Daten um die Small Area Schätzungen vom Ausgabenanteil," AStA Wirtschafts- und Sozialstatistisches Archiv, Springer;Deutsche Statistische Gesellschaft - German Statistical Society, vol. 10(2), pages 79-93, October.
  • Handle: RePEc:spr:astaws:v:10:y:2016:i:2:d:10.1007_s11943-016-0190-4
    DOI: 10.1007/s11943-016-0190-4
    as

    Download full text from publisher

    File URL: http://link.springer.com/10.1007/s11943-016-0190-4
    File Function: Abstract
    Download Restriction: Access to the full text of the articles in this series is restricted.

    File URL: https://libkey.io/10.1007/s11943-016-0190-4?utm_source=ideas
    LibKey link: if access is restricted and if your library uses this service, LibKey will redirect you to where you can use your library subscription to access this item
    ---><---

    As the access to this document is restricted, you may want to search for a different version of it.

    References listed on IDEAS

    as
    1. Gauri Sankar Datta & J. N. K. Rao & David Daniel Smith, 2005. "On measuring the variability of small area estimators under a basic area level model," Biometrika, Biometrika Trust, vol. 92(1), pages 183-196, March.
    2. Bruce D. Meyer & James X. Sullivan, 2003. "Measuring the Well-Being of the Poor Using Income and Consumption," NBER Working Papers 9760, National Bureau of Economic Research, Inc.
    3. Barigozzi, Matteo & Alessi, Lucia & Capasso, Marco & Fagiolo, Giorgio, 2012. "The distribution of household consumption-expenditure budget shares," Structural Change and Economic Dynamics, Elsevier, vol. 23(1), pages 69-91.
    4. Ray Chambers & Nikos Tzavidis, 2006. "M-quantile models for small area estimation," Biometrika, Biometrika Trust, vol. 93(2), pages 255-268, June.
    5. Luigi Curini & Stefano Iacus & Luciano Canova, 2015. "Measuring Idiosyncratic Happiness Through the Analysis of Twitter: An Application to the Italian Case," Social Indicators Research: An International and Interdisciplinary Journal for Quality-of-Life Measurement, Springer, vol. 121(2), pages 525-542, April.
    6. Datta, Gauri S. & Hall, Peter & Mandal, Abhyuday, 2011. "Model Selection by Testing for the Presence of Small-Area Effects, and Application to Area-Level Data," Journal of the American Statistical Association, American Statistical Association, vol. 106(493), pages 362-374.
    7. Lynn M. R. Ybarra & Sharon L. Lohr, 2008. "Small area estimation when auxiliary information is measured with error," Biometrika, Biometrika Trust, vol. 95(4), pages 919-931.
    8. Angus Deaton, 2003. "Household Surveys, Consumption, and the Measurement of Poverty," Economic Systems Research, Taylor & Francis Journals, vol. 15(2), pages 135-159.
    Full references (including those not matched with items on IDEAS)

    Citations

    Citations are extracted by the CitEc Project, subscribe to its RSS feed for this item.
    as


    Cited by:

    1. Iacus Stefano M. & Salini Silvia & Siletti Elena & Porro Giuseppe, 2020. "Controlling for Selection Bias in Social Media Indicators through Official Statistics: a Proposal," Journal of Official Statistics, Sciendo, vol. 36(2), pages 315-338, June.
    2. Sakshaug Joseph W. & Wiśniowski Arkadiusz & Ruiz Diego Andres Perez & Blom Annelies G., 2019. "Supplementing Small Probability Samples with Nonprobability Samples: A Bayesian Approach," Journal of Official Statistics, Sciendo, vol. 35(3), pages 653-681, September.
    3. Ann-Kristin Kreutzmann, 2018. "Estimation of sample quantiles: challenges and issues in the context of income and wealth distributions [Die Schätzung von Quantilen: Herausforderungen und Probleme im Kontext von Einkommens- und V," AStA Wirtschafts- und Sozialstatistisches Archiv, Springer;Deutsche Statistische Gesellschaft - German Statistical Society, vol. 12(3), pages 245-270, December.
    4. Ralf Thomas Münnich & Markus Zwick, 2016. "Big Data und was nun? Neue Datenbestände und ihre Auswirkungen," AStA Wirtschafts- und Sozialstatistisches Archiv, Springer;Deutsche Statistische Gesellschaft - German Statistical Society, vol. 10(2), pages 73-77, October.

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Szymkowiak Marcin & Młodak Andrzej & Wawrowski Łukasz, 2017. "Mapping Poverty at the Level of Subregions in Poland Using Indirect Estimation," Statistics in Transition New Series, Statistics Poland, vol. 18(4), pages 609-635, December.
    2. Sugasawa, Shonosuke & Kubokawa, Tatsuya, 2017. "Transforming response values in small area prediction," Computational Statistics & Data Analysis, Elsevier, vol. 114(C), pages 47-60.
    3. Marcin Szymkowiak & Andrzej Młodak & Łukasz Wawrowski, 2017. "Mapping Poverty At The Level Of Subregions In Poland Using Indirect Estimation," Statistics in Transition New Series, Polish Statistical Association, vol. 18(4), pages 609-635, December.
    4. J. N. K. Rao, 2015. "Inferential issues in model-based small area estimation: some new developments," Statistics in Transition new series, Główny Urząd Statystyczny (Polska), vol. 16(4), pages 491-510, December.
    5. repec:csb:stintr:v:17:y:2016:i:1:p:9-24 is not listed on IDEAS
    6. Erciulescu Andreea L. & Fuller Wayne A., 2016. "Small Area Prediction Under Alternative Model Specifications," Statistics in Transition New Series, Statistics Poland, vol. 17(1), pages 9-24, March.
    7. Stefano Marchetti & Luca Secondi, 2017. "Estimates of Household Consumption Expenditure at Provincial Level in Italy by Using Small Area Estimation Methods: “Real” Comparisons Using Purchasing Power Parities," Social Indicators Research: An International and Interdisciplinary Journal for Quality-of-Life Measurement, Springer, vol. 131(1), pages 215-234, March.
    8. Iacus Stefano M. & Salini Silvia & Siletti Elena & Porro Giuseppe, 2020. "Controlling for Selection Bias in Social Media Indicators through Official Statistics: a Proposal," Journal of Official Statistics, Sciendo, vol. 36(2), pages 315-338, June.
    9. Andreea L. Erciulescu & Wayne A. Fuller, 2016. "Small Area Prediction Under Alternative Model Specifications," Statistics in Transition New Series, Polish Statistical Association, vol. 17(1), pages 9-24, March.
    10. Chakraborty Adrijo & Datta Gauri Sankar & Mandal Abhyuday, 2016. "A Two-Component Normal Mixture Alternative to the Fay-Herriot Model," Statistics in Transition New Series, Statistics Poland, vol. 17(1), pages 67-90, March.
    11. Stefano Marchetti & Caterina Giusti & Nicola Salvati & Monica Pratesi, 2017. "Small area estimation based on M-quantile models in presence of outliers in auxiliary variables," Statistical Methods & Applications, Springer;Società Italiana di Statistica, vol. 26(4), pages 531-555, November.
    12. J. N. K. Rao, 2015. "Inferential Issues In Model-Based Small Area Estimation: Some New Developments," Statistics in Transition New Series, Polish Statistical Association, vol. 16(4), pages 491-510, December.
    13. repec:csb:stintr:v:17:y:2016:i:1:p:67-90 is not listed on IDEAS
    14. Malay Ghosh, 2020. "Small area estimation: its evolution in five decades," Statistics in Transition New Series, Polish Statistical Association, vol. 21(4), pages 1-22, August.
    15. Ralf Münnich & Jan Burgard & Martin Vogt, 2013. "Small Area-Statistik: Methoden und Anwendungen," AStA Wirtschafts- und Sozialstatistisches Archiv, Springer;Deutsche Statistische Gesellschaft - German Statistical Society, vol. 6(3), pages 149-191, March.
    16. Isabel Molina & Paul Corral & Minh Nguyen, 2022. "Estimation of poverty and inequality in small areas: review and discussion," TEST: An Official Journal of the Spanish Society of Statistics and Operations Research, Springer;Sociedad de Estadística e Investigación Operativa, vol. 31(4), pages 1143-1166, December.
    17. Schmid, Timo & Bruckschen, Fabian & Salvati, Nicola & Zbiranski, Till, 2016. "Constructing socio-demographic indicators for National Statistical Institutes using mobile phone data: Estimating literacy rates in Senegal," Discussion Papers 2016/9, Free University Berlin, School of Business & Economics.
    18. Shonosuke Sugasawa & Tatsuya Kubokawa & J. N. K. Rao, 2018. "Small area estimation via unmatched sampling and linking models," TEST: An Official Journal of the Spanish Society of Statistics and Operations Research, Springer;Sociedad de Estadística e Investigación Operativa, vol. 27(2), pages 407-427, June.
    19. Shonosuke Sugasawa & Tatsuya Kubokawa & Kota Ogasawara, 2017. "Empirical Uncertain Bayes Methods in Area-level Models," Scandinavian Journal of Statistics, Danish Society for Theoretical Statistics;Finnish Statistical Society;Norwegian Statistical Association;Swedish Statistical Association, vol. 44(3), pages 684-706, September.
    20. Sanjoy K. Sinha, 2019. "Robust small area estimation in generalized linear mixed models," METRON, Springer;Sapienza Università di Roma, vol. 77(3), pages 201-225, December.
    21. Adrijo Chakraborty & Gauri Sankar Datta & Abhyuday Mandal, 2016. "A Two-Component Normal Mixture Alternative To The Fay-Herriot Model," Statistics in Transition New Series, Polish Statistical Association, vol. 17(1), pages 67-90, March.
    22. Ghosh Malay, 2020. "Small area estimation: its evolution in five decades," Statistics in Transition New Series, Statistics Poland, vol. 21(4), pages 1-22, August.

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:spr:astaws:v:10:y:2016:i:2:d:10.1007_s11943-016-0190-4. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: Sonal Shukla or Springer Nature Abstracting and Indexing (email available below). General contact details of provider: http://www.springer.com .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.