IDEAS home Printed from https://ideas.repec.org/a/plo/pone00/0050651.html
   My bibliography  Save this article

Learning from Decoys to Improve the Sensitivity and Specificity of Proteomics Database Search Results

Author

Listed:
  • Amit Kumar Yadav
  • Dhirendra Kumar
  • Debasis Dash

Abstract

The statistical validation of database search results is a complex issue in bottom-up proteomics. The correct and incorrect peptide spectrum match (PSM) scores overlap significantly, making an accurate assessment of true peptide matches challenging. Since the complete separation between the true and false hits is practically never achieved, there is need for better methods and rescoring algorithms to improve upon the primary database search results. Here we describe the calibration and False Discovery Rate (FDR) estimation of database search scores through a dynamic FDR calculation method, FlexiFDR, which increases both the sensitivity and specificity of search results. Modelling a simple linear regression on the decoy hits for different charge states, the method maximized the number of true positives and reduced the number of false negatives in several standard datasets of varying complexity (18-mix, 49-mix, 200-mix) and few complex datasets (E. coli and Yeast) obtained from a wide variety of MS platforms. The net positive gain for correct spectral and peptide identifications was up to 14.81% and 6.2% respectively. The approach is applicable to different search methodologies- separate as well as concatenated database search, high mass accuracy, and semi-tryptic and modification searches. FlexiFDR was also applied to Mascot results and showed better performance than before. We have shown that appropriate threshold learnt from decoys, can be very effective in improving the database search results. FlexiFDR adapts itself to different instruments, data types and MS platforms. It learns from the decoy hits and sets a flexible threshold that automatically aligns itself to the underlying variables of data quality and size.

Suggested Citation

  • Amit Kumar Yadav & Dhirendra Kumar & Debasis Dash, 2012. "Learning from Decoys to Improve the Sensitivity and Specificity of Proteomics Database Search Results," PLOS ONE, Public Library of Science, vol. 7(11), pages 1-10, November.
  • Handle: RePEc:plo:pone00:0050651
    DOI: 10.1371/journal.pone.0050651
    as

    Download full text from publisher

    File URL: https://journals.plos.org/plosone/article?id=10.1371/journal.pone.0050651
    Download Restriction: no

    File URL: https://journals.plos.org/plosone/article/file?id=10.1371/journal.pone.0050651&type=printable
    Download Restriction: no

    File URL: https://libkey.io/10.1371/journal.pone.0050651?utm_source=ideas
    LibKey link: if access is restricted and if your library uses this service, LibKey will redirect you to where you can use your library subscription to access this item
    ---><---

    References listed on IDEAS

    as
    1. N/A, 2004. "Index for 2004," European Union Politics, , vol. 5(4), pages 511-512, December.
    Full references (including those not matched with items on IDEAS)

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Olunifesi Adekunle Suraj, 2016. "Managing Telecommunications for Development: An Analysis of Intellectual Capital in Nigerian Telecommunication Industry," Journal of Information & Knowledge Management (JIKM), World Scientific Publishing Co. Pte. Ltd., vol. 15(01), pages 1-30, March.
    2. Allais, Olivier & Etilé, Fabrice & Lecocq, Sébastien, 2015. "Mandatory labels, taxes and market forces: An empirical evaluation of fat policies," Journal of Health Economics, Elsevier, vol. 43(C), pages 27-44.
    3. Sakinah Mat Zin & Ahmad Azrin Adnan & Iskandar Hasan Abdullah, 2017. "How Can Ibn Khaldun’s Economic Philosophy Revive the Intellectual Capital of Entrepreneurs," Asian Social Science, Canadian Center of Science and Education, vol. 13(6), pages 164-164, June.
    4. Sandy Tubeuf & Marc Perronnin, 2008. "New prospects in the analysis of inequalities in health: a measurement of health encompassing several dimensions of health," Health, Econometrics and Data Group (HEDG) Working Papers 08/01, HEDG, c/o Department of Economics, University of York.
    5. Queiroz, Bernardo L & Gonzaga, Marcos Roberto & Nogales, Ana Maria & Torrente, Bruno & de Abreu, Daisy Maria Xavier, 2019. "Life expectancy, adult mortality and completeness of death counts in Brazil and regions: comparative analysis of IHME, IBGE and other researchers estimates of levels and trends," OSF Preprints pj3sx, Center for Open Science.
    6. Prakashan Veettil & Stijn Speelman & Guido Huylenbroeck, 2013. "Estimating the Impact of Water Pricing on Water Use Efficiency in Semi-arid Cropping System: An Application of Probabilistically Constrained Nonparametric Efficiency Analysis," Water Resources Management: An International Journal, Published for the European Water Resources Association (EWRA), Springer;European Water Resources Association (EWRA), vol. 27(1), pages 55-73, January.
    7. Barnabé Walheer, 2018. "Cost Malmquist productivity index: an output-specific approach for group comparison," Journal of Productivity Analysis, Springer, vol. 49(1), pages 79-94, February.
    8. Bushra Khalid & Bueh Cholaw & Débora Souza Alvim & Shumaila Javeed & Junaid Aziz Khan & Muhammad Asif Javed & Azmat Hayat Khan, 2018. "Riverine flood assessment in Jhang district in connection with ENSO and summer monsoon rainfall over Upper Indus Basin for 2010," Natural Hazards: Journal of the International Society for the Prevention and Mitigation of Natural Hazards, Springer;International Society for the Prevention and Mitigation of Natural Hazards, vol. 92(2), pages 971-993, June.
    9. Alcaniz, Leire & Gomez-Bezares, Fernando & Roslender, Robin, 2011. "Theoretical perspectives on intellectual capital: A backward look and a proposal for going forward," Accounting forum, Elsevier, vol. 35(2), pages 104-117.
    10. Pradeep Kumar Vyas & Suresh V Rang & Jayant R Shah & Rajiv S Mathur & Gaurav Ghatavat & Priyanka Chaudhary, 2017. "Six Minute Walk Test as a Criteria for Evaluation of Functional Status and Disability by One Time Single Measurement of Distance Walked in Six Minutes for Breathless Patients," International Journal of Pulmonary & Respiratory Sciences, Juniper Publishers Inc., vol. 1(3), pages 1-9, March.
    11. Wolswijk, Guido & de Haan, Jakob, 2005. "Government debt management in the euro area - recent theoretical developments and changes in practices," Occasional Paper Series 25, European Central Bank.
    12. Kwaku Ohene-Asare & Jones Kofi Anyimadu Asare & Charles Turkson, 2019. "Dynamic cost productivity and economies of scale of Ghanaian insurers," The Geneva Papers on Risk and Insurance - Issues and Practice, Palgrave Macmillan;The Geneva Association, vol. 44(1), pages 148-177, January.
    13. Kaiwen Li & Ming Wang & Kai Liu, 2021. "The Study on Compound Drought and Heatwave Events in China Using Complex Networks," Sustainability, MDPI, vol. 13(22), pages 1-15, November.
    14. Robert Coats & Mariza Costa-Cabral & John Riverson & John Reuter & Goloka Sahoo & Geoffrey Schladow & Brent Wolfe, 2013. "Projected 21st century trends in hydroclimatology of the Tahoe basin," Climatic Change, Springer, vol. 116(1), pages 51-69, January.
    15. Ramesh P. Rudra & Balew A. Mekonnen & Rituraj Shukla & Narayan Kumar Shrestha & Pradeep K. Goel & Prasad Daggupati & Asim Biswas, 2020. "Currents Status, Challenges, and Future Directions in Identifying Critical Source Areas for Non-Point Source Pollution in Canadian Conditions," Agriculture, MDPI, vol. 10(10), pages 1-25, October.
    16. Philip B Mason & Frank M. Howell & Jeremy R. Porter, 2014. "Examining Rural-Urban Obesity Trends among Youth in the U.S.: Testing the Socioeconomic Gradient Hypothesis," International Journal of Business and Social Research, LAR Center Press, vol. 4(12), pages 27-42, December.
    17. Schmeling, Maik & Schrimpf, Andreas & Steffensen, Sigurd A.M., 2022. "Monetary policy expectation errors," Journal of Financial Economics, Elsevier, vol. 146(3), pages 841-858.
    18. Liliana Varela, 2018. "Reallocation, Competition, and Productivity: Evidence from a Financial Liberalization Episode," The Review of Economic Studies, Review of Economic Studies Ltd, vol. 85(2), pages 1279-1313.
    19. Carlo Azzarri & Gero Carletto & Benjamin Davis & Alberto Zezza, 2006. "Monitoring Poverty Without Consumption Data : An Application Using the Albania Panel Survey," Eastern European Economics, Taylor & Francis Journals, vol. 44(1), pages 59-82, February.
    20. Guissoni, Leandro Angotti & Consoli, Matheus Alberto & Rodrigues, Jonny Mateus, 2013. "Gerenciamento por categorias em pequenos supermercados compensa o esforço?," RAE - Revista de Administração de Empresas, FGV-EAESP Escola de Administração de Empresas de São Paulo (Brazil), vol. 53(6), November.

    More about this item

    Statistics

    Access and download statistics

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:plo:pone00:0050651. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: plosone (email available below). General contact details of provider: https://journals.plos.org/plosone/ .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.