“How many images do I need?” Understanding how sample size per class affects deep learning model performance metrics for balanced designs in autonomous wildlife monitoring

Shahinfar, Saleh; Meek, Paul; Falzon, Gregory

Please use this identifier to cite or link to this item: https://hdl.handle.net/1959.11/58355

Full metadata record

DC Field	Value	Language
dc.contributor.author	Shahinfar, Saleh	en
dc.contributor.author	Meek, Paul	en
dc.contributor.author	Falzon, Gregory	en
dc.date.accessioned	2024-04-15T05:28:04Z	-
dc.date.available	2024-04-15T05:28:04Z	-
dc.date.issued	2020	-
dc.identifier.citation	Ecological Informatics, v.57, p. 1-16	en
dc.identifier.issn	1878-0512	en
dc.identifier.issn	1574-9541	en
dc.identifier.uri	https://hdl.handle.net/1959.11/58355	-
dc.description.abstract	<p>Deep learning (DL) algorithms are the state of the art in automated classification of wildlife camera trap images. The challenge is that the ecologist cannot know in advance how many images per species they need to collect for model training in order to achieve their desired classification accuracy. In fact there is limited empirical evidence in the context of camera trapping to demonstrate that increasing sample size will lead to improved accuracy.</p> <p>In this study we explore in depth the issues of deep learning model performance for progressively increasing per class (species) sample sizes. We also provide ecologists with an approximation formula to estimate how many images per animal species they need for certain accuracy level a priori. This will help ecologists for optimal allocation of resources, work and efficient study design.</p> <p>In order to investigate the effect of number of training images" seven training sets with 10, 20, 50, 150, 500, 1000 images per class were designed. Six deep learning architectures namely ResNet-18, ResNet-50, ResNet-152, DnsNet-121, DnsNet-161, and DnsNet-201 were trained and tested on a common exclusive testing set of 250 images per class. The whole experiment was repeated on three similar datasets from Australia, Africa and North America and the results were compared. Simple regression equations for use by practitioners to approximate model performance metrics are provided. Generalizes additive models (GAM) are shown to be effective in modelling DL performance metrics based on the number of training images per class, tuning scheme and dataset.</p> <p>Overall, our trained models classified images with 0.94 accuracy (ACC), 0.73 precision (PRC), 0.72 true positive rate (TPR), and 0.03 false positive rate (FPR). Variation in model performance metrics among datasets, species and deep learning architectures exist and are shown distinctively in the discussion section. The ordinary least squares regression models explained 57%, 54%, 52%, and 34% of expected variation of ACC, PRC, TPR, and FPR according to number of images available for training. Generalised additive models explained 77%, 69%, 70%, and 53% of deviance for ACC, PRC, TPR, and FPR respectively.</p> <p>Predictive models were developed linking number of training images per class, model, dataset to performance metrics. The ordinary least squares regression and Generalised additive models developed provides a practical toolbox to estimate model performance with respect to different numbers of training images.</p>	en
dc.language	en	en
dc.publisher	Elsevier BV	en
dc.relation.ispartof	Ecological Informatics	en
dc.title	“How many images do I need?” Understanding how sample size per class affects deep learning model performance metrics for balanced designs in autonomous wildlife monitoring	en
dc.type	Journal Article	en
dc.identifier.doi	10.1016/j.ecoinf.2020.101085	en
local.contributor.firstname	Saleh	en
local.contributor.firstname	Paul	en
local.contributor.firstname	Gregory	en
local.profile.school	School of Science and Technology	en
local.profile.school	School of Environmental and Rural Science	en
local.profile.school	School of Science and Technology	en
local.profile.email	sshahinf@une.edu.au	en
local.profile.email	pmeek5@une.edu.au	en
local.profile.email	gfalzon2@une.edu.au	en
local.output.category	C1	en
local.record.place	au	en
local.record.institution	University of New England	en
local.publisher.place	The Netherlands	en
local.identifier.runningnumber	101085	en
local.format.startpage	1	en
local.format.endpage	16	en
local.peerreviewed	Yes	en
local.identifier.volume	57	en
local.contributor.lastname	Shahinfar	en
local.contributor.lastname	Meek	en
local.contributor.lastname	Falzon	en
dc.identifier.staff	une-id:sshahinf	en
dc.identifier.staff	une-id:pmeek5	en
dc.identifier.staff	une-id:gfalzon2	en
local.profile.orcid	0000-0002-1989-9357	en
local.profile.role	author	en
local.profile.role	author	en
local.profile.role	author	en
local.identifier.unepublicationid	une:1959.11/58355	en
dc.identifier.academiclevel	Academic	en
dc.identifier.academiclevel	Academic	en
dc.identifier.academiclevel	Academic	en
local.title.maintitle	“How many images do I need?” Understanding how sample size per class affects deep learning model performance metrics for balanced designs in autonomous wildlife monitoring	en
local.relation.fundingsourcenote	Funding for this project was provided by the Australian Government Department of Agriculture and Water Resources through the eTechnology Hub – Utilising Technology to Improve Pest Management Effectiveness and Enhance Welfare Outcomes project.	en
local.output.categorydescription	C1 Refereed Article in a Scholarly Journal	en
local.search.author	Shahinfar, Saleh	en
local.search.author	Meek, Paul	en
local.search.author	Falzon, Gregory	en
local.uneassociation	Yes	en
local.atsiresearch	No	en
local.sensitive.cultural	No	en
local.year.published	2020	en
local.fileurl.closedpublished	https://rune.une.edu.au/web/retrieve/6072b8a8-881a-4262-99d1-9a506f2d9e96	en
local.subject.for2020	3003 Animal production not elsewhere classified	en
local.subject.seo2020	tbd	en
local.profile.affiliationtype	UNE Affiliation	en
local.profile.affiliationtype	UNE Affiliation	en
local.profile.affiliationtype	UNE Affiliation	en
local.date.moved	2024-04-15	en
Appears in Collections:	Journal Article School of Environmental and Rural Science School of Science and Technology

Files in This Item:

1 files

File	Size	Format

Show simple item record

SCOPUS^TM
Citations

101

checked on Jul 6, 2024

Google Scholar^TM

Check

Research UNE

Files in This Item:

SCOPUS^TM
Citations

Google Scholar^TM

Altmetric

Research UNE

Files in This Item:

SCOPUSTM Citations

Google ScholarTM

Altmetric

SCOPUS^TM
Citations

Google Scholar^TM