IDEAS home Printed from https://ideas.repec.org/a/spr/scient/v50y2001i1d10.1023_a1005682102768.html
   My bibliography  Save this article

Data collection methods on the Web for infometric purposes — A review and analysis

Author

Listed:
  • Judit Bar-Ilan

    (The Hebrew University of Jerusalem)

Abstract

We present different methods of data collection from the Web for informetric purposes. For each method, some studies utilizing it are reviewed, and advantages and shortcomings of each technique are discussed. The paper emphasizes that data collection must be carried out with great care. Since the Web changes constantly, the findings of any study are valid only in the time frame in which it was carried out, and are dependent on the quality of the data collection tools, which are usually not under the control of the researcher. At the current time, the quality and the reliability of most of the available search tools are not satisfactory, thus informetric analyses of the Web mainly serve as demonstrations of the applicability of informetric methods to this medium, and not as a means for obtaining definite conclusions. A possible solution is for the scientific world to develop its own search and data collection tools.

Suggested Citation

  • Judit Bar-Ilan, 2001. "Data collection methods on the Web for infometric purposes — A review and analysis," Scientometrics, Springer;Akadémiai Kiadó, vol. 50(1), pages 7-32, January.
  • Handle: RePEc:spr:scient:v:50:y:2001:i:1:d:10.1023_a:1005682102768
    DOI: 10.1023/A:1005682102768
    as

    Download full text from publisher

    File URL: http://link.springer.com/10.1023/A:1005682102768
    File Function: Abstract
    Download Restriction: Access to the full text of the articles in this series is restricted.

    File URL: https://libkey.io/10.1023/A:1005682102768?utm_source=ideas
    LibKey link: if access is restricted and if your library uses this service, LibKey will redirect you to where you can use your library subscription to access this item
    ---><---

    As the access to this document is restricted, you may want to search for a different version of it.

    References listed on IDEAS

    as
    1. Blaise Cronin & Herbert W. Snyder & Howard Rosenbaum & Anna Martinson & Ewa Callahan, 1998. "Invoked on the Web," Journal of the American Society for Information Science, Association for Information Science & Technology, vol. 49(14), pages 1319-1328.
    2. Nancy C. M. Ross & Dietmar Wolfram, 2000. "End user searching on the Internet: An analysis of term pair topics submitted to the Excite search engine," Journal of the American Society for Information Science, Association for Information Science & Technology, vol. 51(10), pages 949-958.
    3. Wallace Koehler, 1999. "An analysis of web page and web site constancy and permanence," Journal of the American Society for Information Science, Association for Information Science & Technology, vol. 50(2), pages 162-180.
    4. Judit Bar-Ilan & Bluma C. Peritz, 1999. "The life span of a specific topic on the web," Scientometrics, Springer;Akadémiai Kiadó, vol. 46(3), pages 371-382, November.
    5. Leo Katz, 1953. "A new status index derived from sociometric analysis," Psychometrika, Springer;The Psychometric Society, vol. 18(1), pages 39-43, March.
    6. Steve Lawrence & C. Lee Giles, 1999. "Accessibility of information on the web," Nature, Nature, vol. 400(6740), pages 107-107, July.
    Full references (including those not matched with items on IDEAS)

    Citations

    Citations are extracted by the CitEc Project, subscribe to its RSS feed for this item.
    as


    Cited by:

    1. Enrique Orduña-Malea, 2021. "Dot-science top level domain: Academic websites or dumpsites?," Scientometrics, Springer;Akadémiai Kiadó, vol. 126(4), pages 3565-3591, April.
    2. Rong Tang & Mike Thelwall, 2004. "Patterns of national and international Web inlinks to US academic departments: An analysis of disciplinary variations," Scientometrics, Springer;Akadémiai Kiadó, vol. 60(3), pages 475-485, August.
    3. Peter B. Musgrove & Ray Binns & Teresa Page-Kennedy & Mike Thelwall, 2003. "A method for identifying clusters in sets of interlinking Web spaces," Scientometrics, Springer;Akadémiai Kiadó, vol. 58(3), pages 657-672, November.
    4. José Luis Ortega & Viv Cothey & Isidro F. Aguillo, 2009. "How old is the Web? Characterizing the age and the currency of the European scientific Web," Scientometrics, Springer;Akadémiai Kiadó, vol. 81(1), pages 295-309, October.
    5. Simone Belli & Carlos Gonzalo-Penela, 2020. "Science, research, and innovation infospheres in Google results of the Ibero-American countries," Scientometrics, Springer;Akadémiai Kiadó, vol. 123(2), pages 635-653, May.
    6. Enrique Orduña-Malea, 2020. "Crossing the academic ocean? Judit Bar-Ilan’s oeuvre on search engines studies," Scientometrics, Springer;Akadémiai Kiadó, vol. 123(3), pages 1317-1340, June.
    7. Gali Halevi, 2020. "The scientific legacy of Judit Bar-Ilan," Scientometrics, Springer;Akadémiai Kiadó, vol. 123(3), pages 1201-1209, June.

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. José Luis Ortega & Viv Cothey & Isidro F. Aguillo, 2009. "How old is the Web? Characterizing the age and the currency of the European scientific Web," Scientometrics, Springer;Akadémiai Kiadó, vol. 81(1), pages 295-309, October.
    2. Enrique Orduna-Malea & Juan M. Ayllón & Alberto Martín-Martín & Emilio Delgado López-Cózar, 2015. "Methods for estimating the size of Google Scholar," Scientometrics, Springer;Akadémiai Kiadó, vol. 104(3), pages 931-949, September.
    3. Dangzhi Zhao & Elisabeth Logan, 2002. "Citation analysis using scientific publications on the Web as data source: A case study in the XML research area," Scientometrics, Springer;Akadémiai Kiadó, vol. 54(3), pages 449-472, July.
    4. Gandal, Neil, 2001. "The dynamics of competition in the internet search engine market," International Journal of Industrial Organization, Elsevier, vol. 19(7), pages 1103-1117, July.
    5. Thomas J. Sargent & John Stachurski, 2022. "Economic Networks: Theory and Computation," Papers 2203.11972, arXiv.org, revised Jul 2022.
    6. Karimi, Fatemeh & Lotfi, Shahriar & Izadkhah, Habib, 2021. "Community-guided link prediction in multiplex networks," Journal of Informetrics, Elsevier, vol. 15(4).
    7. D’Errico, Marco & Battiston, Stefano & Peltonen, Tuomas & Scheicher, Martin, 2018. "How does risk flow in the credit default swap market?," Journal of Financial Stability, Elsevier, vol. 35(C), pages 53-74.
    8. Liu, Xiaodong & Patacchini, Eleonora & Zenou, Yves & Lee, Lung-Fei, 2011. "Criminal Networks: Who is the Key Player?," Research Papers in Economics 2011:7, Stockholm University, Department of Economics.
    9. Agnieszka Rusinowska & Rudolf Berghammer & Harrie de Swart & Michel Grabisch, 2011. "Social networks: Prestige, centrality, and influence (Invited paper)," Université Paris1 Panthéon-Sorbonne (Post-Print and Working Papers) hal-00633859, HAL.
    10. Gabrielle Demange, 2018. "Contagion in Financial Networks: A Threat Index," Management Science, INFORMS, vol. 64(2), pages 955-970, February.
    11. Lin, Dan & Wu, Jiajing & Xuan, Qi & Tse, Chi K., 2022. "Ethereum transaction tracking: Inferring evolution of transaction networks via link prediction," Physica A: Statistical Mechanics and its Applications, Elsevier, vol. 600(C).
    12. Yao Hongxing & Lu Yunxia, 2017. "Analyzing the Potential Influence of Shanghai Stock Market Based on Link Prediction Method," Journal of Systems Science and Information, De Gruyter, vol. 5(5), pages 446-461, October.
    13. Zhepeng Li & Xiao Fang & Xue Bai & Olivia R. Liu Sheng, 2017. "Utility-Based Link Recommendation for Online Social Networks," Management Science, INFORMS, vol. 63(6), pages 1938-1952, June.
    14. Sheikhahmadi, Amir & Nematbakhsh, Mohammad Ali & Shokrollahi, Arman, 2015. "Improving detection of influential nodes in complex networks," Physica A: Statistical Mechanics and its Applications, Elsevier, vol. 436(C), pages 833-845.
    15. Dequiedt, Vianney & Zenou, Yves, 2017. "Local and consistent centrality measures in parameterized networks," Mathematical Social Sciences, Elsevier, vol. 88(C), pages 28-36.
    16. Jianhua Hou, 2017. "Exploration into the evolution and historical roots of citation analysis by referenced publication year spectroscopy," Scientometrics, Springer;Akadémiai Kiadó, vol. 110(3), pages 1437-1452, March.
    17. Mauleon, Ana & Nanumyan, Mariam & Vannetelbosch, Vincent, 2024. "Ideal efforts and consensus in a multi-layer network game," LIDAM Discussion Papers CORE 2024023, Université catholique de Louvain, Center for Operations Research and Econometrics (CORE).
    18. Isidro F. Aguillo, 2020. "Altmetrics of the Open Access Institutional Repositories: a webometrics approach," Scientometrics, Springer;Akadémiai Kiadó, vol. 123(3), pages 1181-1192, June.
    19. Mike Thelwall, 2017. "Judit Bar-Ilan: information scientist, computer scientist, scientometrician," Scientometrics, Springer;Akadémiai Kiadó, vol. 113(3), pages 1235-1244, December.
    20. ,, 2014. "A ranking method based on handicaps," Theoretical Economics, Econometric Society, vol. 9(3), September.

    More about this item

    Statistics

    Access and download statistics

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:spr:scient:v:50:y:2001:i:1:d:10.1023_a:1005682102768. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: Sonal Shukla or Springer Nature Abstracting and Indexing (email available below). General contact details of provider: http://www.springer.com .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.