Please use this identifier to cite or link to this item: https://hdl.handle.net/1959.11/60532
Title: Understanding sequencing data as compositions: an outlook and review
Contributor(s): Quinn, Thomas P (author); Erb, Ionas (author); Richardson, Mark F (author); Crowley, Tamsyn M  (author)
Publication Date: 2018-08
Early Online Version: 2018-03-28
Open Access: Yes
DOI: 10.1093/bioinformatics/bty175
Handle Link: https://hdl.handle.net/1959.11/60532
Abstract: 

Motivation: Although seldom acknowledged explicitly, count data generated by sequencing platforms exist as compositions for which the abundance of each component (e.g. gene or transcript) is only coherently interpretable relative to other components within that sample. This property arises from the assay technology itself, whereby the number of counts recorded for each sample is constrained by an arbitrary total sum (i.e. library size). Consequently, sequencing data, as compositional data, exist in a non-Euclidean space that, without normalization or transformation, renders invalid many conventional analyses, including distance measures, correlation coefficients and multivariate statistical models.

Results: The purpose of this review is to summarize the principles of compositional data analysis (CoDA), provide evidence for why sequencing data are compositional, discuss compositionally valid methods available for analyzing sequencing data, and highlight future directions with regard to this field of study.

Publication Type: Journal Article
Source of Publication: Bioinformatics, 34(16), p. 2870-2878
Publisher: ASFRA B V
Place of Publication: The Netherlands
ISSN: 0927-4588
Fields of Research (FoR) 2020: 3003 Animal production
Peer Reviewed: Yes
HERDC Category Description: C1 Refereed Article in a Scholarly Journal
Appears in Collections:Journal Article
PoultryHub Australia

Files in This Item:
2 files
File Description SizeFormat 
openpublished/UnderstandingCrowley2018JournalArticle.pdfPublished Version221.68 kBAdobe PDF
Download Adobe
View/Open
Show full item record

SCOPUSTM   
Citations

203
checked on Feb 15, 2025

Page view(s)

532
checked on Feb 16, 2025

Download(s)

98
checked on Feb 16, 2025
Google Media

Google ScholarTM

Check

Altmetric


This item is licensed under a Creative Commons License Creative Commons