Automated generation of heuristics for biological sequence comparison
- PMID: 15713233
- PMCID: PMC553969
- DOI: 10.1186/1471-2105-6-31
Automated generation of heuristics for biological sequence comparison
Abstract
Background: Exhaustive methods of sequence alignment are accurate but slow, whereas heuristic approaches run quickly, but their complexity makes them more difficult to implement. We introduce bounded sparse dynamic programming (BSDP) to allow rapid approximation to exhaustive alignment. This is used within a framework whereby the alignment algorithms are described in terms of their underlying model, to allow automated development of efficient heuristic implementations which may be applied to a general set of sequence comparison problems.
Results: The speed and accuracy of this approach compares favourably with existing methods. Examples of its use in the context of genome annotation are given.
Conclusions: This system allows rapid implementation of heuristics approximating to many complex alignment models, and has been incorporated into the freely available sequence alignment program, exonerate.
Figures
![Figure 1](https://www.ncbi.nlm.nih.gov/pmc/articles/instance/553969/bin/1471-2105-6-31-1.gif)
![Figure 2](https://www.ncbi.nlm.nih.gov/pmc/articles/instance/553969/bin/1471-2105-6-31-2.gif)
![Figure 3](https://www.ncbi.nlm.nih.gov/pmc/articles/instance/553969/bin/1471-2105-6-31-3.gif)
![Figure 4](https://www.ncbi.nlm.nih.gov/pmc/articles/instance/553969/bin/1471-2105-6-31-4.gif)
![Figure 5](https://www.ncbi.nlm.nih.gov/pmc/articles/instance/553969/bin/1471-2105-6-31-5.gif)
![Figure 6](https://www.ncbi.nlm.nih.gov/pmc/articles/instance/553969/bin/1471-2105-6-31-6.gif)
Similar articles
-
Heuristic reusable dynamic programming: efficient updates of local sequence alignment.IEEE/ACM Trans Comput Biol Bioinform. 2009 Oct-Dec;6(4):570-82. doi: 10.1109/TCBB.2009.30. IEEE/ACM Trans Comput Biol Bioinform. 2009. PMID: 19875856
-
Computation and analysis of genomic multi-sequence alignments.Annu Rev Genomics Hum Genet. 2007;8:193-213. doi: 10.1146/annurev.genom.8.080706.092300. Annu Rev Genomics Hum Genet. 2007. PMID: 17489682 Review.
-
transAlign: using amino acids to facilitate the multiple alignment of protein-coding DNA sequences.BMC Bioinformatics. 2005 Jun 22;6:156. doi: 10.1186/1471-2105-6-156. BMC Bioinformatics. 2005. PMID: 15969769 Free PMC article.
-
ParAlign: a parallel sequence alignment algorithm for rapid and sensitive database searches.Nucleic Acids Res. 2001 Apr 1;29(7):1647-52. doi: 10.1093/nar/29.7.1647. Nucleic Acids Res. 2001. PMID: 11266569 Free PMC article.
-
Approaches to the automatic discovery of patterns in biosequences.J Comput Biol. 1998 Summer;5(2):279-305. doi: 10.1089/cmb.1998.5.279. J Comput Biol. 1998. PMID: 9672833 Review.
Cited by
-
Haplotype-resolved chromosome-level genome assembly of Ehretia macrophylla.Sci Data. 2024 Jun 5;11(1):589. doi: 10.1038/s41597-024-03431-9. Sci Data. 2024. PMID: 38839803 Free PMC article.
-
Mitochondrial genomic characteristics and phylogenetic analysis of a brewing fungus, Rhizopus microsporus Tiegh. 1875 (Mucorales: Rhizopodaceae).Mitochondrial DNA B Resour. 2024 May 20;9(5):657-662. doi: 10.1080/23802359.2024.2356133. eCollection 2024. Mitochondrial DNA B Resour. 2024. PMID: 38774188 Free PMC article.
-
Two telomere-to-telomere gapless genomes reveal insights into Capsicum evolution and capsaicinoid biosynthesis.Nat Commun. 2024 May 20;15(1):4295. doi: 10.1038/s41467-024-48643-0. Nat Commun. 2024. PMID: 38769327 Free PMC article.
-
Signatures of transposon-mediated genome inflation, host specialization, and photoentrainment in Entomophthora muscae and allied entomophthoralean fungi.Elife. 2024 May 20;12:RP92863. doi: 10.7554/eLife.92863. Elife. 2024. PMID: 38767950 Free PMC article.
-
The chromosome-level genome and functional database accelerate research about biosynthesis of secondary metabolites in Rosa roxburghii.BMC Plant Biol. 2024 May 17;24(1):410. doi: 10.1186/s12870-024-05109-1. BMC Plant Biol. 2024. PMID: 38760710 Free PMC article.
References
-
- Box GE. Robustness in the Strategy of Scientific Model Building. In: Launer R, Wilkinson G, editor. Robustness in Statistics. Academic Press New York; 1979.
-
- Smith T, Waterman M. Identification of Common Molecular Subsequences. Journal of Molecular Biology. 1981;147:195–197. - PubMed
-
- Altschul SF, Gish W, Miller W, Myers EW, Lipman DJ. Basic Local Alignment Search Tool. Journal of Molecular Biology. 1990;215:403–410. - PubMed
-
- Searls DB, Murphy KP. Proceedings of the Third International Conference On Intelligent Systems for Molecular Biology. The AAAI Press; 1995. Automata-Theoretic Models of Mutation and Alignment; pp. 341–349. - PubMed
-
- Searls DB. Sequence alignment through pictures. Trends in Genetics. 1996;12:35–37. - PubMed
MeSH terms
Substances
LinkOut - more resources
Full Text Sources
Other Literature Sources