Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2004 Jul 1;32(Web Server issue):W309-12.
doi: 10.1093/nar/gkh379.

AUGUSTUS: a web server for gene finding in eukaryotes

Affiliations

AUGUSTUS: a web server for gene finding in eukaryotes

Mario Stanke et al. Nucleic Acids Res. .

Abstract

We present a www server for AUGUSTUS, a novel software program for ab initio gene prediction in eukaryotic genomic sequences. Our method is based on a generalized Hidden Markov Model with a new method for modeling the intron length distribution. This method allows approximation of the true intron length distribution more accurately than do existing programs. For genomic sequence data from human and Drosophila melanogaster, the accuracy of AUGUSTUS is superior to existing gene-finding approaches. The advantage of our program becomes apparent especially for larger input sequences containing more than one gene. The server is available at http://augustus.gobics.de.

PubMed Disclaimer

Figures

Figure 1
Figure 1
An example where the option ‘ignore conflicts with other strand’ helps. The lines in (a) show two nested Drosophila genes as annotated in FlyBase (12). The nine-exon gene on the forward strand includes a two-exon gene on the reverse strand within a long intron. The lines in (b) show the prediction with the default parameters. The gene on the forward strand is split into two genes by introducing two very short false positive exons so that the three predicted genes do not overlap. The lines in (c) show the prediction with the option ‘ignore conflicts with other strand’, which is identical to the annotation except for a short missed exon. This graphic has been obtained using gff2ps (13) from http://genome.imim.es/software/gfftools/GFF2PS.html.

Similar articles

Cited by

References

    1. Reese M.G., Kulp,D., Tammana,H. and Haussler,D. (2000) Gene finding in Drosophila melanogaster. Genome Res., 10, 529–538. - PMC - PubMed
    1. Burge C.B. (1997) Identification of genes in human genomic DNA. Ph.D. Thesis, ‘Stanford University’, Stanford, CA, USA.
    1. Parra G., Blanco,E. and Guigó,R. (2000) GeneID in Drosophila. Genome Res., 10, 511–515. - PMC - PubMed
    1. Rogic S., Mackworth,A.K. and Ouellette,F.B.F. (2001) Evaluation of gene-finding programs on mammalian sequences. Genome Res., 11, 817–832. - PMC - PubMed
    1. Claverie J.-M. (1997) Computational methods for the identification of genes in vertebrate genomic sequences. Hum. Mol. Genet., 6, 1735–1744. - PubMed
-