Please use this identifier to cite or link to this item: https://hdl.handle.net/1959.11/19577
Full metadata record
DC FieldValueLanguage
dc.contributor.authorAndrew, Roseen
dc.contributor.authorAlbert, Arianne YKen
dc.contributor.authorRenaut, Sebastienen
dc.contributor.authorRennison, Diana Jen
dc.contributor.authorBock, Dan Gen
dc.contributor.authorVines, Timen
dc.date.accessioned2016-10-13T16:23:00Z-
dc.date.issued2015-
dc.identifier.citationPeerJ, v.3, p. 1-22en
dc.identifier.issn2167-8359en
dc.identifier.urihttps://hdl.handle.net/1959.11/19577-
dc.description.abstractData are the foundation of empirical research, yet all too often the datasets underlying published papers are unavailable, incorrect, or poorly curated. This is a serious issue, because future researchers are then unable to validate published results or reuse data to explore new ideas and hypotheses. Even if data files are securely stored and accessible, they must also be accompanied by accurate labels and identifiers. To assess how often problems with metadata or data curation affect the reproducibility of published results, we attempted to reproduce Discriminant Function Analyses (DFAs) from the field of organismal biology. DFA is a commonly used statistical analysis that has changed little since its inception almost eight decades ago, and therefore provides an opportunity to test reproducibility among datasets of varying ages. Out of 100 papers we initially surveyed, fourteen were excluded because they did not present the common types of quantitative result from their DFA or gave insufficient details of their DFA. Of the remaining 86 datasets, there were 15 cases for which we were unable to confidently relate the dataset we received to the one used in the published analysis. The reasons ranged from incomprehensible or absent variable labels, the DFA being performed on an unspecified subset of the data, or the dataset we received being incomplete. We focused on reproducing three common summary statistics from DFAs: the percent variance explained, the percentage correctly assigned and the largest discriminant function coefficient. The reproducibility of the first two was fairly high (20 of 26, and 44 of 60 datasets, respectively), whereas our success rate with the discriminant function coefficients was lower (15 of 26 datasets). When considering all three summary statistics, we were able to completely reproduce 46 (65%) of 71 datasets. While our results show that a majority of studies are reproducible, they highlight the fact that many studies still are not the carefully curated research that the scientific community and public expects.en
dc.languageenen
dc.publisherPeerJ, Ltden
dc.relation.ispartofPeerJen
dc.titleAssessing the reproducibility of discriminant function analysesen
dc.typeJournal Articleen
dc.identifier.doi10.7717/peerj.1137en
dcterms.accessRightsGolden
dc.subject.keywordsEvolutionary Biologyen
dc.subject.keywordsEcologyen
local.contributor.firstnameRoseen
local.contributor.firstnameArianne YKen
local.contributor.firstnameSebastienen
local.contributor.firstnameDiana Jen
local.contributor.firstnameDan Gen
local.contributor.firstnameTimen
local.subject.for2008060299 Ecology not elsewhere classifieden
local.subject.for2008060399 Evolutionary Biology not elsewhere classifieden
local.subject.seo2008970106 Expanding Knowledge in the Biological Sciencesen
local.profile.schoolSchool of Environmental and Rural Scienceen
local.profile.emailrandre20@une.edu.auen
local.output.categoryC1en
local.record.placeauen
local.record.institutionUniversity of New Englanden
local.identifier.epublicationsrecordune-20160930-093255en
local.publisher.placeUnited Kingdomen
local.identifier.runningnumbere1137en
local.format.startpage1en
local.format.endpage22en
local.identifier.scopusid84940375536en
local.peerreviewedYesen
local.identifier.volume3en
local.access.fulltextYesen
local.contributor.lastnameAndrewen
local.contributor.lastnameAlberten
local.contributor.lastnameRenauten
local.contributor.lastnameRennisonen
local.contributor.lastnameBocken
local.contributor.lastnameVinesen
dc.identifier.staffune-id:randre20en
local.profile.orcid0000-0003-0099-8336en
local.profile.roleauthoren
local.profile.roleauthoren
local.profile.roleauthoren
local.profile.roleauthoren
local.profile.roleauthoren
local.profile.roleauthoren
local.identifier.unepublicationidune:19767en
dc.identifier.academiclevelAcademicen
local.title.maintitleAssessing the reproducibility of discriminant function analysesen
local.output.categorydescriptionC1 Refereed Article in a Scholarly Journalen
local.search.authorAndrew, Roseen
local.search.authorAlbert, Arianne YKen
local.search.authorRenaut, Sebastienen
local.search.authorRennison, Diana Jen
local.search.authorBock, Dan Gen
local.search.authorVines, Timen
local.uneassociationUnknownen
local.identifier.wosid000360844000009en
local.year.published2015en
local.subject.for2020310399 Ecology not elsewhere classifieden
local.subject.for2020310499 Evolutionary biology not elsewhere classifieden
local.subject.seo2020280102 Expanding knowledge in the biological sciencesen
Appears in Collections:Journal Article
Files in This Item:
2 files
File Description SizeFormat 
Show simple item record

SCOPUSTM   
Citations

7
checked on Jun 8, 2024

Page view(s)

1,324
checked on Apr 28, 2024
Google Media

Google ScholarTM

Check

Altmetric


Items in Research UNE are protected by copyright, with all rights reserved, unless otherwise indicated.