IDEAS home Printed from https://ideas.repec.org/a/inm/ormnsc/v53y2007i9p1375-1388.html
   My bibliography  Save this article

Yahoo! for Amazon: Sentiment Extraction from Small Talk on the Web

Author

Listed:
  • Sanjiv R. Das

    (Department of Finance, Leavey School of Business, Santa Clara University, Santa Clara, California 95053)

  • Mike Y. Chen

    (Ludic Labs, San Mateo, California 94401)

Abstract

Extracting sentiment from text is a hard semantic problem. We develop a methodology for extracting small investor sentiment from stock message boards. The algorithm comprises different classifier algorithms coupled together by a voting scheme. Accuracy levels are similar to widely used Bayes classifiers, but false positives are lower and sentiment accuracy higher. Time series and cross-sectional aggregation of message information improves the quality of the resultant sentiment index, particularly in the presence of slang and ambiguity. Empirical applications evidence a relationship with stock values--tech-sector postings are related to stock index levels, and to volumes and volatility. The algorithms may be used to assess the impact on investor opinion of management announcements, press releases, third-party news, and regulatory changes.

Suggested Citation

  • Sanjiv R. Das & Mike Y. Chen, 2007. "Yahoo! for Amazon: Sentiment Extraction from Small Talk on the Web," Management Science, INFORMS, vol. 53(9), pages 1375-1388, September.
  • Handle: RePEc:inm:ormnsc:v:53:y:2007:i:9:p:1375-1388
    DOI: 10.1287/mnsc.1070.0704
    as

    Download full text from publisher

    File URL: http://dx.doi.org/10.1287/mnsc.1070.0704
    Download Restriction: no

    File URL: https://libkey.io/10.1287/mnsc.1070.0704?utm_source=ideas
    LibKey link: if access is restricted and if your library uses this service, LibKey will redirect you to where you can use your library subscription to access this item
    ---><---

    References listed on IDEAS

    as
    1. Bagnoli, Mark & Beneish, Messod D. & Watts, Susan G., 1999. "Whisper forecasts of quarterly earnings per share," Journal of Accounting and Economics, Elsevier, vol. 28(1), pages 27-50, November.
    2. Lo, Andrew W & MacKinlay, A Craig, 1990. "When Are Contrarian Profits Due to Stock Market Overreaction?," The Review of Financial Studies, Society for Financial Studies, vol. 3(2), pages 175-205.
    3. David Godes & Dina Mayzlin, 2004. "Using Online Conversations to Study Word-of-Mouth Communication," Marketing Science, INFORMS, vol. 23(4), pages 545-560, June.
    4. N/A, 1996. "Note:," Foreign Trade Review, , vol. 31(1-2), pages 1-1, January.
    5. repec:bla:jfinan:v:59:y:2004:i:3:p:1259-1294 is not listed on IDEAS
    Full references (including those not matched with items on IDEAS)

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Harrison Hong & Terence Lim & Jeremy C. Stein, 2000. "Bad News Travels Slowly: Size, Analyst Coverage, and the Profitability of Momentum Strategies," Journal of Finance, American Finance Association, vol. 55(1), pages 265-295, February.
    2. Dominique Guegan & Giovanni de Luca & Giorgia Rivieccio, 2017. "Three-stage estimation method for non-linear multiple time-series," Post-Print halshs-01439860, HAL.
    3. Diwanji, Vaibhav S. & Cortese, Juliann, 2020. "Contrasting user generated videos versus brand generated videos in ecommerce," Journal of Retailing and Consumer Services, Elsevier, vol. 54(C).
    4. Shijie Lu & Xin (Shane) Wang & Neil Bendle, 2020. "Does Piracy Create Online Word of Mouth? An Empirical Analysis in the Movie Industry," Management Science, INFORMS, vol. 66(5), pages 2140-2162, May.
    5. Kris James Mitchener & Matthew Jaremski, 2014. "The Evolution of Bank Supervision: Evidence from U.S. States," NBER Working Papers 20603, National Bureau of Economic Research, Inc.
    6. , G. & , & ,, 2008. "Non-Bayesian updating: A theoretical framework," Theoretical Economics, Econometric Society, vol. 3(2), June.
    7. Wolfgang Aussenegg & Andreas Grünbichler, 1999. "Der Size-Effekt am Österreichischen Aktienmarkt," Schmalenbach Journal of Business Research, Springer, vol. 51(7), pages 636-661, July.
    8. Allaudeen Hameed, 1997. "Time-Varying Factors And Cross-Autocorrelations In Short-Horizon Stock Returns," Journal of Financial Research, Southern Finance Association;Southwestern Finance Association, vol. 20(4), pages 435-458, December.
    9. Drakos, Anastassios A., 2016. "Does the relationship between small and large portfolios’ returns confirm the lead–lag effect? Evidence from the Athens Stock Exchange," Research in International Business and Finance, Elsevier, vol. 36(C), pages 546-561.
    10. Chris Stivers & Licheng Sun, 2013. "Market Cycles and the Performance of Relative Strength Strategies," Financial Management, Financial Management Association International, vol. 42(2), pages 263-290, June.
    11. Sjoo, Boo & Zhang, Jianhua, 2000. "Market segmentation and information diffusion in China's stock markets," Journal of Multinational Financial Management, Elsevier, vol. 10(3-4), pages 421-438, December.
    12. Semenov, Andrei, 2021. "Measuring the stock's factor beta and identifying risk factors under market inefficiency," The Quarterly Review of Economics and Finance, Elsevier, vol. 80(C), pages 635-649.
    13. Nicholas Apergis & Vasilios Plakandaras & Ioannis Pragidis, 2022. "Industry momentum and reversals in stock markets," International Journal of Finance & Economics, John Wiley & Sons, Ltd., vol. 27(3), pages 3093-3138, July.
    14. Andrei Kapaev, 2013. "Remark on repo and options," Papers 1311.5211, arXiv.org.
    15. Daniel Sanches, 2016. "On the Inherent Instability of Private Money," Review of Economic Dynamics, Elsevier for the Society for Economic Dynamics, vol. 20, pages 198-214, April.
    16. Goyal, Sanjeev & Heidari, Hoda & Kearns, Michael, 2019. "Competitive contagion in networks," Games and Economic Behavior, Elsevier, vol. 113(C), pages 58-79.
    17. Inyoung Chae & Andrew T. Stephen & Yakov Bart & Dai Yao, 2017. "Spillover Effects in Seeded Word-of-Mouth Marketing Campaigns," Marketing Science, INFORMS, vol. 36(1), pages 89-104, January.
    18. James J. McAndrews & William Roberds, 1999. "Payment intermediation and the origins of banking," Staff Reports 85, Federal Reserve Bank of New York.
    19. Khim-Yong Goh & Cheng-Suang Heng & Zhijie Lin, 2013. "Social Media Brand Community and Consumer Behavior: Quantifying the Relative Impact of User- and Marketer-Generated Content," Information Systems Research, INFORMS, vol. 24(1), pages 88-107, March.
    20. Allen Head & Junfeng Qiu, 2007. "Elastic Money, Inflation, And Interest Rate Policy," Working Paper 1152, Economics Department, Queen's University.

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:inm:ormnsc:v:53:y:2007:i:9:p:1375-1388. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: Chris Asher (email available below). General contact details of provider: https://edirc.repec.org/data/inforea.html .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.