Please use this identifier to cite or link to this item:
https://hdl.handle.net/1959.11/2915
Title: | A simple genetic algorithm for multiple sequence alignment | Contributor(s): | Gondro, Cedric (author) ; Kinghorn, Brian (author) | Publication Date: | 2007 | Handle Link: | https://hdl.handle.net/1959.11/2915 | Abstract: | Multiple sequence alignment plays an important role in molecular sequence analysis. An alignment is the arrangement of two (pairwise alignment) or more (multiple alignment) sequences of 'residues' (nucleotides or amino acids) that maximizes the similarities between them. Algorithmically, the problem consists of opening and extending gaps in the sequences to maximize an objective function (measurement of similarity). A simple genetic algorithm was developed and implemented in the software MSA-GA. Genetic algorithms, a class of evolutionary algorithms, are well suited for problems of this nature since residues and gaps are discrete units. An evolutionary algorithm cannot compete in terms of speed with progressive alignment methods but it has the advantage of being able to correct for initially misaligned sequences; which is not possible with the progressive method. This was shown using the BaliBase benchmark, where Clustal-W alignments were used to seed the initial population in MSA-GA, improving outcome. Alignment scoring functions still constitute an open field of research, and it is important to develop methods that simplify the testing of new functions. A general evolutionary framework for testing and implementing different scoring functions was developed. The results show that a simple genetic algorithm is capable of optimizing an alignment without the need of the excessively complex operators used in prior study. The clear distinction between objective function and genetic algorithms used in MSA-GA makes extending and/or replacing objective functions a trivial task. | Publication Type: | Journal Article | Source of Publication: | Genetics and Molecular Research, 6(4), p. 964-982 | Publisher: | Fundacao de Pesquisas Cientificas de Ribeirao Preto | Place of Publication: | Brazil | ISSN: | 1676-5680 | Fields of Research (FoR) 2008: | 080108 Neural, Evolutionary and Fuzzy Computation | Socio-Economic Objective (SEO) 2008: | 970106 Expanding Knowledge in the Biological Sciences | Peer Reviewed: | Yes | HERDC Category Description: | C1 Refereed Article in a Scholarly Journal | Publisher/associated links: | http://www.funpecrp.com.br/GMR/year2007/vol4-6/xm0016_abstract.html |
---|---|
Appears in Collections: | Journal Article |
Files in This Item:
File | Description | Size | Format |
---|
Page view(s)
1,048
checked on Jun 11, 2023
Items in Research UNE are protected by copyright, with all rights reserved, unless otherwise indicated.