Recommendations for utilizing and reporting population genetic analyses: the reproducibility of genetic clustering using the program STRUCTURE

Author(s)
Gilbert, Kimberly J
Andrew, Rose
Vines, Timothy H
Bock, Dan G
Franklin, Michelle T
Kane, Nolan C
Moore, Jean-Sebastien
Moyers, Brook T
Renaut, Sebastien
Rennison, Diana J
Veen, Thor
Publication Date
2012
Abstract
Reproducibility is the benchmark for results and conclusions drawn from scientific studies, but systematic studies on the reproducibility of scientific results are surprisingly rare. Moreover, many modern statistical methods make use of 'random walk' model fitting procedures, and these are inherently stochastic in their output. Does the combination of these statistical procedures and current standards of data archiving and method reporting permit the reproduction of the authors' results? To test this, we reanalysed data sets gathered from papers using the software package STRUCTURE to identify genetically similar clusters of individuals. We find that reproducing STRUCTURE results can be difficult despite the straightforward requirements of the program. Our results indicate that 30% of analyses were unable to reproduce the same number of population clusters. To improve this, we make recommendations for future use of the software and for reporting STRUCTURE analyses and results in published works.
Citation
Molecular Ecology, 21(20), p. 4925-4930
ISSN
1365-294X
0962-1083
Link
Language
en
Publisher
Blackwell Publishing Ltd
Title
Recommendations for utilizing and reporting population genetic analyses: the reproducibility of genetic clustering using the program STRUCTURE
Type of document
Journal Article
Entity Type
Publication

Files:

NameSizeformatDescriptionLink