IDEAS home Printed from https://ideas.repec.org/a/bla/jorssc/v69y2020i4p909-929.html
   My bibliography  Save this article

Global forensic geolocation with deep neural networks

Author

Listed:
  • Neal S. Grantham
  • Brian J. Reich
  • Eric B. Laber
  • Krishna Pacifici
  • Robert R. Dunn
  • Noah Fierer
  • Matthew Gebert
  • Julia S. Allwood
  • Seth A. Faith

Abstract

An important problem in modern forensic analyses is identifying the provenance of materials at a crime scene, such as biological material on a piece of clothing. This procedure, which is known as geolocation, is conventionally guided by expert knowledge of the biological evidence and therefore tends to be application specific, labour intensive and often subjective. Purely data‐driven methods have yet to be fully realized in this domain, because in part of the lack of a sufficiently rich source of data. However, high throughput sequencing technologies can identify tens of thousands of fungi and bacteria taxa by using DNA recovered from a single swab collected from nearly any object or surface. This microbial community, or microbiome, may be highly informative of the provenance of the sample, but data on the spatial variation of microbiomes are sparse and high dimensional and have a complex dependence structure that render them difficult to model with standard statistical tools. Deep learning algorithms have generated a tremendous amount of interest within the machine learning community for their predictive performance in high dimensional problems. We present DeepSpace: a new algorithm for geolocation that aggregates over an ensemble of deep neural network classifiers trained on randomly generated Voronoi partitions of a spatial domain. The DeepSpace algorithm makes remarkably good point predictions; for example, when applied to the microbiomes of over 1300 dust samples collected across continental USA, more than half of geolocation predictions produced by this model fall less than 100 km from their true origin, which is a 60% reduction in error from competing geolocation methods. Moreover, we apply DeepSpace to a novel data set of global dust samples collected from nearly 30 countries, finding that dust‐associated fungi alone predict a sample's country of origin with nearly 90% accuracy.

Suggested Citation

  • Neal S. Grantham & Brian J. Reich & Eric B. Laber & Krishna Pacifici & Robert R. Dunn & Noah Fierer & Matthew Gebert & Julia S. Allwood & Seth A. Faith, 2020. "Global forensic geolocation with deep neural networks," Journal of the Royal Statistical Society Series C, Royal Statistical Society, vol. 69(4), pages 909-929, August.
  • Handle: RePEc:bla:jorssc:v:69:y:2020:i:4:p:909-929
    DOI: 10.1111/rssc.12427
    as

    Download full text from publisher

    File URL: https://doi.org/10.1111/rssc.12427
    Download Restriction: no

    File URL: https://libkey.io/10.1111/rssc.12427?utm_source=ideas
    LibKey link: if access is restricted and if your library uses this service, LibKey will redirect you to where you can use your library subscription to access this item
    ---><---

    References listed on IDEAS

    as
    1. Neal S Grantham & Brian J Reich & Krishna Pacifici & Eric B Laber & Holly L Menninger & Jessica B Henley & Albert Barberán & Jonathan W Leff & Noah Fierer & Robert R Dunn, 2015. "Fungi Identify the Geographic Origin of Dust Samples," PLOS ONE, Public Library of Science, vol. 10(4), pages 1-13, April.
    Full references (including those not matched with items on IDEAS)

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.

      More about this item

      Statistics

      Access and download statistics

      Corrections

      All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:bla:jorssc:v:69:y:2020:i:4:p:909-929. See general information about how to correct material in RePEc.

      If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

      If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

      If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

      For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: Wiley Content Delivery (email available below). General contact details of provider: https://edirc.repec.org/data/rssssea.html .

      Please note that corrections may take a couple of weeks to filter through the various RePEc services.

      IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.