Primer to Analysis of Genomic Data Using R

Title
Primer to Analysis of Genomic Data Using R
Publication Date
2015
Author(s)
Gondro, Cedric
( author )
OrcID: https://orcid.org/0000-0003-0666-656X
Email: cgondro2@une.edu.au
UNE Id une-id:cgondro2
Type of document
Book
Language
en
Entity Type
Publication
Publisher
Springer
Place of publication
Cham, Switzerland
Edition
1
Series
Use R!
DOI
10.1007/978-3-319-14475-7
UNE publication id
une:19905
Abstract
Through this book, researchers and students will learn to use R for analysis of large-scale genomic data and how to create routines to automate analytical steps. The philosophy behind the book is to start with real world raw datasets and perform all the analytical steps needed to reach final results. Though theory plays an important role, this is a practical book for graduate and undergraduate courses in bioinformatics and genomic analysis or for use in lab sessions. How to handle and manage high-throughput genomic data, create automated workflows and speed up analyses in R is also taught. A wide range of R packages useful for working with genomic data are illustrated with practical examples. The key topics covered are association studies, genomic prediction, estimation of population genetic parameters and diversity, gene expression analysis, functional annotation of results using publically available databases and how to work efficiently in R with large genomic datasets. Important principles are demonstrated and illustrated through engaging examples which invite the reader to work with the provided datasets. Some methods that are discussed in this volume include: signatures of selection, population parameters (LD, FST, FIS, etc); use of a genomic relationship matrix for population diversity studies; use of SNP data for parentage testing; snpBLUP and gBLUP for genomic prediction. Step-by-step, all the R code required for a genome-wide association study is shown: starting from raw SNP data, how to build databases to handle and manage the data, quality control and filtering measures, association testing and evaluation of results, through to identification and functional annotation of candidate genes. Similarly, gene expression analyses are shown using microarray and RNAseq data. At a time when genomic data is decidedly big, the skills from this book are critical. In recent years R has become the de facto.
Link
ISBN
9783319144740
9783319144757

Files:

NameSizeformatDescriptionLink