Please use this identifier to cite or link to this item: https://hdl.handle.net/1959.11/19653
Title: Efficient algorithms for using genotypic data
Contributor(s): Ferdosi, Mohammad Hossein (author); van der Werf, Julius  (supervisor)orcid ; Gondro, Cedric  (supervisor)orcid ; Tier, Bruce  (supervisor)
Conferred Date: 2016
Copyright Date: 2015
Open Access: Yes
Handle Link: https://hdl.handle.net/1959.11/19653
Abstract: The aim of this thesis is to explore the specific structure in livestock populations to unravel hidden information such as recombination events and parental origin of markers in the genomic data. This information then can be used to improve the accuracy of prediction of breeding values which is one of the main aims of animal breeding. In the first experimental chapter an efficient method for detecting opposing homozygotes was proposed. This method makes the detection of opposing homozygote for thousands of individuals and millions of markers feasible. An opposing homozygote matrix can be utilised to identify Mendelian inconsistency and to fix pedigree errors. The second experimental chapter used opposing homozygotes between individuals in a half-sib family to identify recombination events in the sire, to impute sire haplotype and to reconstruct haplotype of offspring. The algorithm was compared with other frequently used methods, using both simulated and real data. The accuracy of detecting recombination events and of haplotype reconstruction was higher with this algorithm than with other algorithms, especially when there were genotyping errors in the dataset. For example, the accuracy of haplotype reconstruction was around 0.97 for a half-sib family size of 4 and the accuracy of sire imputation was 0.75 and 1.00 for a half-sib family size of 4 and 40, respectively. In the third experimental chapter hsphase was developed which implements the algorithms used in the first two chapters into an efficient R package. In addition, an algorithm for grouping half-sib families utilising the opposing homozygote matrix was developed and verified with real datasets. The results show that the algorithm can group the half-sib families accurately, however the accuracy was depended on sample size and genetic diversity in the population. The package includes several diagnostic functions to visualise and check half-sib's pedigree, parentage assignments, and phased haplotypes of offspring in a half-sib family. The fourth experimental chapter utilised the half-sib population structure to fix switch errors. The switch error is a common problem in many haplotype reconstruction algorithms where the haplotype phase is locally correct but paternal and maternal strand are not consistently and correctly assigned across the longer segments (or across the entire genome). The algorithm partitions the genome into segments and creates a group matrix which is used to identify the switch points. Then the switches are fixed with a second algorithm. The results showed that this algorithm can fix the switch problems efficiently and increase the accuracy of genome-wide phasing. In chapter five relationship matrices generated from haplotype segments were used to improve the accuracy of predicting breeding values. The haplotypes were partitioned in three ways and with various size. The new relationship matrices were evaluated with three sets of real data and with simulated data. In all cases the accuracy of prediction and log-likelihood were significantly increased although the amount of increase was trait dependent.
Publication Type: Thesis Doctoral
Field of Research Codes: 070201 Animal Breeding
060412 Quantitative Genetics (incl Disease and Trait Mapping Genetics)
060408 Genomics
Socio-Economic Outcome Codes: 970107 Expanding Knowledge in the Agricultural and Veterinary Sciences
Rights Statement: Copyright 2015 - Mohammad Hossein Ferdosi
HERDC Category Description: T2 Thesis - Doctorate by Research
Statistics to Oct 2018: Visitors: 182
Views: 580
Downloads: 33
Appears in Collections:Animal Genetics and Breeding Unit (AGBU)
Thesis Doctoral

Files in This Item:
10 files
File Description SizeFormat 
open/MARCXML.xmlMARCXML.xml5.13 kBUnknownView/Open
open/SOURCE03.pdfAbstract471.32 kBAdobe PDF
Download Adobe
View/Open
open/SOURCE04.pdfThesis396.93 kBAdobe PDF
Download Adobe
View/Open
1 2 Next
Show full item record

Page view(s)

110
checked on Dec 29, 2018

Download(s)

44
checked on Dec 29, 2018
Google Media

Google ScholarTM

Check


Items in Research UNE are protected by copyright, with all rights reserved, unless otherwise indicated.