Please use this identifier to cite or link to this item: https://hdl.handle.net/1959.11/9233
Title: Haplotype metrics and functional data mining from high-throughput SNP genotyping: A bioinformatics approach
Contributor(s): Goodswen, Stephen James (author); Gondro, Cedric  (supervisor)orcid ; Werf, Julius Van Der  (supervisor)orcid ; Kadarmideen, Haja (supervisor)
Conferred Date: 2011
Copyright Date: 2010
Handle Link: https://hdl.handle.net/1959.11/9233
Abstract: The aftermath of the Human Genome Project has generated new revolutionary techniques and equipment such as high throughput measurement tools for collecting biological information. One notable tool is a microarray that can be used to genotype thousands of single nucleotide polymorphisms (SNPs) in one run. The main aim of the thesis was to implement a bioinformatics approach to transform biological data generated from high-throughput SNP genotyping into useful information. One of the main outcomes from whole genome association studies (WGAS) is a subset of statistically significant SNPs and a major challenge to a researcher is minimising false positive rates while maintaining the power to identify true positive associations. The need for a SNP annotation tool to assist a researcher in making informed judgments as to whether a significant SNP could be a causal variant or in linkage disequilibrium (LD) with a causal variant, provided the motivation to develop FunctSNP. The thesis describes the development of FunctSNP, which is an R package that provides the user interface to custom built species-specific databases. These local relational databases contain SNP data together with functional annotations extracted from online resources. The databases are scheduled for automatic creation or updated periodically by a suite of Perl scripts called dbAutoMaker. The thesis also describes the development of dbAutoMaker. The use of FunctSNP is illustrated with a livestock example. WGAS relies on a natural phenomenon of linkage disequilibrium between SNP markers and causal variants. For WGAS to be applied successfully there is a need to understand the extent and distribution of LD across the entire genome in a population. The need to know how LD (and haplotype diversity) varies from one region or population to another provided the motivation to develop SNPpattern. The thesis describes the development of SNPpattern, which is the collective name for a suite of Perl scripts essentially designed to group, count, and compare SNP allele patterns of various block sizes. Differences in SNP allele block frequency are used as a measure of haplotype diversity within and between groups. The use of SNPpattern is illustrated on sheep breeds.
Publication Type: Thesis Masters Research
Fields of Research (FoR) 2008: 060412 Quantitative Genetics (incl Disease and Trait Mapping Genetics)
Socio-Economic Objective (SEO) 2008: 830301 Beef Cattle
Rights Statement: Copyright 2010 - Stephen James Goodswen
HERDC Category Description: T1 Thesis - Masters Degree by Research
Publisher/associated links: http://www.aaabg.org/proceedings18/files/goodswen454.pdf
http://www.kongressband.de/wcgalp2010/assets/html/0947.htm
Appears in Collections:Thesis Masters Research

Files in This Item:
6 files
File Description SizeFormat 
Show full item record

Page view(s)

2,124
checked on Mar 7, 2023

Download(s)

2
checked on Mar 7, 2023
Google Media

Google ScholarTM

Check


Items in Research UNE are protected by copyright, with all rights reserved, unless otherwise indicated.