IDEAS home Printed from https://ideas.repec.org/a/spr/scient/v115y2018i2d10.1007_s11192-018-2688-8.html
   My bibliography  Save this article

Writing styles in different scientific disciplines: a data science approach

Author

Listed:
  • Amnah Alluqmani

    (Lawrence Technological University)

  • Lior Shamir

    (Lawrence Technological University)

Abstract

We quantified several different elements that reflect writing styles of scientific papers in four related disciplines: physics, astrophysics, mathematics, and computer science. Text descriptors such as the use of punctuation characters, the use of upper case letters, use of quotations, and other descriptors that are not based on the words used in the papers were extracted from each document. Based on these features alone an automatic classifier was able to identify the discipline of the paper with accuracy much higher than mere chance, showing that different disciplines can be differentiated by their writing styles, and without using their content directly as reflected by common words used in the papers. The study showed statistically significant differences between the different disciplines such as use of acronyms, sentence length, word length, and more. Our findings also show changes in writing styles in specific disciplines over time. For instance, mathematicians and computer scientists began to use less acronyms starting from 2006, and there is a dramatic decrease of the average of punctuation characters in mathematics papers. These observations suggest that even in closely related disciplines there are differences in the scientific communication expressed through writing styles, demonstrating the existence of a “signature” writing style developed in each discipline. These findings should also be taken into account when a multidisciplinary group of collaborators assign writing duties on a joint scientific manuscript.

Suggested Citation

  • Amnah Alluqmani & Lior Shamir, 2018. "Writing styles in different scientific disciplines: a data science approach," Scientometrics, Springer;Akadémiai Kiadó, vol. 115(2), pages 1071-1085, May.
  • Handle: RePEc:spr:scient:v:115:y:2018:i:2:d:10.1007_s11192-018-2688-8
    DOI: 10.1007/s11192-018-2688-8
    as

    Download full text from publisher

    File URL: http://link.springer.com/10.1007/s11192-018-2688-8
    File Function: Abstract
    Download Restriction: Access to the full text of the articles in this series is restricted.

    File URL: https://libkey.io/10.1007/s11192-018-2688-8?utm_source=ideas
    LibKey link: if access is restricted and if your library uses this service, LibKey will redirect you to where you can use your library subscription to access this item
    ---><---

    As the access to this document is restricted, you may want to search for a different version of it.

    References listed on IDEAS

    as
    1. Adam Okulicz-Kozaryn, 2013. "Cluttered writing: adjectives and adverbs in academia," Scientometrics, Springer;Akadémiai Kiadó, vol. 96(3), pages 679-681, September.
    2. C. Sean Burns & Charles W. Fox, 2017. "Language and socioeconomics predict geographic variation in peer review outcomes at an ecology journal," Scientometrics, Springer;Akadémiai Kiadó, vol. 113(2), pages 1113-1127, November.
    3. Lei Lei, 2016. "When science meets cluttered writing: adjectives and adverbs in academia revisited," Scientometrics, Springer;Akadémiai Kiadó, vol. 107(3), pages 1361-1372, June.
    4. Shlomo Argamon & Jeff Dodick & Paul Chase, 2008. "Language use reflects scientific methodology: A corpus-based study of peer-reviewed journal articles," Scientometrics, Springer;Akadémiai Kiadó, vol. 75(2), pages 203-238, May.
    Full references (including those not matched with items on IDEAS)

    Citations

    Citations are extracted by the CitEc Project, subscribe to its RSS feed for this item.
    as


    Cited by:

    1. Song, Ningyuan & Chen, Kejun & Zhao, Yuehua, 2023. "Understanding writing styles of scientific papers in the IS-LS domain: Evidence from abstracts over the past three decades," Journal of Informetrics, Elsevier, vol. 17(1).
    2. Omar Mubin & Dhaval Tejlavwala & Mudassar Arsalan & Muneeb Ahmad & Simeon Simoff, 2018. "An assessment into the characteristics of award winning papers at CHI," Scientometrics, Springer;Akadémiai Kiadó, vol. 116(2), pages 1181-1201, August.
    3. Melissa A. Wheeler & Ekaterina Vylomova & Melanie J. McGrath & Nick Haslam, 2021. "More confident, less formal: stylistic changes in academic psychology writing from 1970 to 2016," Scientometrics, Springer;Akadémiai Kiadó, vol. 126(12), pages 9603-9612, December.
    4. Xueying Liu & Haoran Zhu, 2023. "Linguistic positivity in soft and hard disciplines: temporal dynamics, disciplinary variation, and the relationship with research impact," Scientometrics, Springer;Akadémiai Kiadó, vol. 128(5), pages 3107-3127, May.

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Ju Wen & Lei Lei, 2022. "Adjectives and adverbs in life sciences across 50 years: implications for emotions and readability in academic texts," Scientometrics, Springer;Akadémiai Kiadó, vol. 127(8), pages 4731-4749, August.
    2. Lei Lei & Sheng Yan, 2016. "Readability and citations in information science: evidence from abstracts and articles of four journals (2003–2012)," Scientometrics, Springer;Akadémiai Kiadó, vol. 108(3), pages 1155-1169, September.
    3. Omar Mubin & Dhaval Tejlavwala & Mudassar Arsalan & Muneeb Ahmad & Simeon Simoff, 2018. "An assessment into the characteristics of award winning papers at CHI," Scientometrics, Springer;Akadémiai Kiadó, vol. 116(2), pages 1181-1201, August.
    4. Amon, Julian & Hornik, Kurt, 2022. "Is it all bafflegab? – Linguistic and meta characteristics of research articles in prestigious economics journals," Journal of Informetrics, Elsevier, vol. 16(2).
    5. Andrés Marroquín & Julio H. Cole, 2015. "Economical writing (or, “Think Hemingway”)," Scientometrics, Springer;Akadémiai Kiadó, vol. 103(1), pages 251-259, April.
    6. Daoud, Adel & Kohl, Sebastian, 2016. "How much do sociologists write about economic topics? Using big data to test some conventional views in economic sociology, 1890 to 2014," MPIfG Discussion Paper 16/7, Max Planck Institute for the Study of Societies.
    7. Myrto-Panagiota Zacharof & Anna Charalambidou, 2018. "An Exploration of the Sub-Register of Chemical Engineering Research Papers Published in English," Publications, MDPI, vol. 6(3), pages 1-19, July.
    8. Balázs Győrffy & Andrea Magda Nagy & Péter Herman & Ádám Török, 2018. "Factors influencing the scientific performance of Momentum grant holders: an evaluation of the first 117 research groups," Scientometrics, Springer;Akadémiai Kiadó, vol. 117(1), pages 409-426, October.
    9. Edoardo Magnone, 2014. "A novel graphical representation of sentence complexity: the description and its application," Scientometrics, Springer;Akadémiai Kiadó, vol. 98(2), pages 1301-1329, February.
    10. Pengfei Jia & Weixi Xie & Guangyao Zhang & Xianwen Wang, 2023. "Do reviewers get their deserved acknowledgments from the authors of manuscripts?," Scientometrics, Springer;Akadémiai Kiadó, vol. 128(10), pages 5687-5703, October.
    11. Diego Marino Fages, 2020. "Write better, publish better," Scientometrics, Springer;Akadémiai Kiadó, vol. 122(3), pages 1671-1681, March.
    12. Lei Lei, 2016. "When science meets cluttered writing: adjectives and adverbs in academia revisited," Scientometrics, Springer;Akadémiai Kiadó, vol. 107(3), pages 1361-1372, June.
    13. Henry Small, 2011. "Interpreting maps of science using citation context sentiments: a preliminary investigation," Scientometrics, Springer;Akadémiai Kiadó, vol. 87(2), pages 373-388, May.
    14. Peng, Wen & Yue, Mingliang & Sun, Mingyue & Ma, Tingcan, 2024. "Revision and academic impact: A case study of bioRxiv preprint papers," Journal of Informetrics, Elsevier, vol. 18(1).
    15. Adel Daoud & Sebastian Kohl, 2015. "Is there a New Economic Sociology Effect? A Topic Model on the Economic Orientation of Sociology, 1890 to 2014," Working Papers 1520, New School for Social Research, Department of Economics.
    16. Song, Ningyuan & Chen, Kejun & Zhao, Yuehua, 2023. "Understanding writing styles of scientific papers in the IS-LS domain: Evidence from abstracts over the past three decades," Journal of Informetrics, Elsevier, vol. 17(1).
    17. Marta Kowal & Piotr Sorokowski & Emanuel Kulczycki & Agnieszka Żelaźniewicz, 2022. "The impact of geographical bias when judging scientific studies," Scientometrics, Springer;Akadémiai Kiadó, vol. 127(1), pages 265-273, January.

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:spr:scient:v:115:y:2018:i:2:d:10.1007_s11192-018-2688-8. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: Sonal Shukla or Springer Nature Abstracting and Indexing (email available below). General contact details of provider: http://www.springer.com .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.