The effects of number of reference individuals on the accuracy of imputation from low and medium densities to high density

Author(s)
Ferdosi, M H
Connors, N K
Khansefid, M
Publication Date
2021
Abstract
Paper presented by Mohammad Ferdosi
Abstract
<p>Imputation is a common approach to infer the missing markers for individuals with low marker density (i.e. target population) from a reference population genotyped with higher-density Single-Nucleotide Polymorphism (SNP) panels. Several factors affect the imputation accuracy of untyped, including the number of reference individuals, marker density and population structure. This paper investigates the effects of these factors on the accuracy of imputationby using individuals of a single cattle breed or multiple cattle breeds in the reference population with 600k marker density, as well as assuming the target population was genotyped with low (15k) or medium (30k) marker density. To achieve a within breed imputation accuracy of >90%, we required at least 500 individuals in the reference population when the target population was genotyped with 15k SNP panel. Whereas, if the reference population consisted of a mixture of purebred and multi-breed individuals, the SNP density must be at least 30k in the target population, and there must be more than 900 individuals in the reference population to achieve a similar level of accuracy.</p>
Citation
Proceedings of the Association for the Advancement of Animal Breeding and Genetics, v.24, p. 114-117
ISSN
1328-3227
Link
Publisher
Association for the Advancement of Animal Breeding and Genetics (AAABG)
Title
The effects of number of reference individuals on the accuracy of imputation from low and medium densities to high density
Type of document
Conference Publication
Entity Type
Publication

Files:

NameSizeformatDescriptionLink