Please use this identifier to cite or link to this item: https://hdl.handle.net/1959.11/45476
Title: Automated location invariant animal detection in camera trap images using publicly available data sources
Contributor(s): Shepley, Andrew  (author)orcid ; Falzon, Greg  (author)orcid ; Meek, Paul  (author); Kwan, Paul (author)
Publication Date: 2021-05
Early Online Version: 2021-03-10
Open Access: Yes
DOI: 10.1002/ece3.7344
Handle Link: https://hdl.handle.net/1959.11/45476
Abstract: 
  1. A time-consuming challenge faced by camera trap practitioners is the extraction of meaningful data from images to inform ecological management. An increasingly popular solution is automated image classification software. However, most solutions are not sufficiently robust to be deployed on a large scale due to lack of location invariance when transferring models between sites. This prevents optimal use of ecological data resulting in significant expenditure of time and resources to annotate and retrain deep learning models.
  2. We present a method ecologists can use to develop optimized location invariant camera trap object detectors by (a) evaluating publicly available image datasets characterized by high intradataset variability in training deep learning models for camera trap object detection and (b) using small subsets of camera trap images to optimize models for high accuracy domain-specific applications.
  3. We collected and annotated three datasets of images of striped hyena, rhinoceros, and pigs, from the image-sharing websites FlickR and iNaturalist (FiN), to train three object detection models. We compared the performance of these models to that of three models trained on the Wildlife Conservation Society and Camera CATalogue datasets, when tested on out-of-sample Snapshot Serengeti datasets. We then increased FiN model robustness by infusing small subsets of camera trap images into training.
  4. In all experiments, the mean Average Precision (mAP) of the FiN trained models was significantly higher (82.33%-88.59%) than that achieved by the models trained only on camera trap datasets (38.5%-66.74%). Infusion further improved mAP by 1.78%-32.08%.
  5. Ecologists can use FiN images for training deep learning object detection solutions for camera trap image processing to develop location invariant, robust, out-of-the-box software. Models can be further optimized by infusion of 5%-10% camera trap images into training data. This would allow AI technologies to be deployed on a large scale in ecological applications. Datasets and code related to this study are open source and available on this repository: https://doi.org/10.5061/dryad.1c59zw3tx.
Publication Type: Journal Article
Source of Publication: Ecology and Evolution, 11(9), p. 4494-4506
Publisher: John Wiley & Sons Ltd
Place of Publication: United Kingdom
ISSN: 2045-7758
Fields of Research (FoR) 2020: 460304 Computer vision
460202 Autonomous agents and multiagent systems
460103 Applications in life sciences
Socio-Economic Objective (SEO) 2020: 220402 Applied computing
220403 Artificial intelligence
220404 Computer systems
Peer Reviewed: Yes
HERDC Category Description: C1 Refereed Article in a Scholarly Journal
Appears in Collections:Journal Article
School of Environmental and Rural Science
School of Science and Technology

Files in This Item:
2 files
File Description SizeFormat 
openpublished/AutomatedShepleyFalzonMeek2021JournalArticle.pdfPublished Version1.28 MBAdobe PDF
Download Adobe
View/Open
Show full item record

SCOPUSTM   
Citations

13
checked on Jul 20, 2024

Page view(s)

1,110
checked on Jun 18, 2023

Download(s)

6
checked on Jun 18, 2023
Google Media

Google ScholarTM

Check

Altmetric


This item is licensed under a Creative Commons License Creative Commons