PseudoPipe: an automated pseudogene identification pipeline
- PMID: 16574694
- DOI: 10.1093/bioinformatics/btl116
PseudoPipe: an automated pseudogene identification pipeline
Abstract
Motivation: Mammalian genomes contain many 'genomic fossils' i.e. pseudogenes. These are disabled copies of functional genes that have been retained in the genome by gene duplication or retrotransposition events. Pseudogenes are important resources in understanding the evolutionary history of genes and genomes.
Results: We have developed a homology-based computational pipeline ('PseudoPipe') that can search a mammalian genome and identify pseudogene sequences in a comprehensive and consistent manner. The key steps in the pipeline involve using BLAST to rapidly cross-reference potential "parent" proteins against the intergenic regions of the genome and then processing the resulting "raw hits" -- i.e. eliminating redundant ones, clustering together neighbors, and associating and aligning clusters with a unique parent. Finally, pseudogenes are classified based on a combination of criteria including homology, intron-exon structure, and existence of stop codons and frameshifts.
Similar articles
-
Pseudogenes and Their Genome-Wide Prediction in Plants.Int J Mol Sci. 2016 Nov 28;17(12):1991. doi: 10.3390/ijms17121991. Int J Mol Sci. 2016. PMID: 27916797 Free PMC article. Review.
-
Pseudogenes and their composers: delving in the 'debris' of human genome.Brief Funct Genomics. 2013 Nov;12(6):536-47. doi: 10.1093/bfgp/elt026. Epub 2013 Jul 29. Brief Funct Genomics. 2013. PMID: 23900003 Review.
-
Frequent emergence and functional resurrection of processed pseudogenes in the human and mouse genomes.Gene. 2007 Mar 15;389(2):196-203. doi: 10.1016/j.gene.2006.11.007. Epub 2006 Nov 18. Gene. 2007. PMID: 17196768
-
[Analysis, identification and correction of some errors of model refseqs appeared in NCBI Human Gene Database by in silico cloning and experimental verification of novel human genes].Yi Chuan Xue Bao. 2004 May;31(5):431-43. Yi Chuan Xue Bao. 2004. PMID: 15478601 Chinese.
-
Molecular fossils in the human genome: identification and analysis of the pseudogenes in chromosomes 21 and 22.Genome Res. 2002 Feb;12(2):272-80. doi: 10.1101/gr.207102. Genome Res. 2002. PMID: 11827946 Free PMC article.
Cited by
-
Cicer super-pangenome provides insights into species evolution and agronomic trait loci for crop improvement in chickpea.Nat Genet. 2024 Jun;56(6):1225-1234. doi: 10.1038/s41588-024-01760-4. Epub 2024 May 23. Nat Genet. 2024. PMID: 38783120
-
Loss to gain: pseudogenes in microorganisms, focusing on eubacteria, and their biological significance.Appl Microbiol Biotechnol. 2024 May 8;108(1):328. doi: 10.1007/s00253-023-12971-w. Appl Microbiol Biotechnol. 2024. PMID: 38717672 Free PMC article. Review.
-
Genetic modification of Candida maltosa, a non-pathogenic CTG species, reveals EFG1 function.Microbiology (Reading). 2024 Mar;170(3):001447. doi: 10.1099/mic.0.001447. Microbiology (Reading). 2024. PMID: 38456839 Free PMC article.
-
Degeneration of the Olfactory System in a Murid Rodent that Evolved Diurnalism.Mol Biol Evol. 2024 Mar 1;41(3):msae037. doi: 10.1093/molbev/msae037. Mol Biol Evol. 2024. PMID: 38376543 Free PMC article.
-
A large-scale phylogeny-guided analysis of pseudogenes in Pseudomonas aeruginosa bacterium.Microbiol Spectr. 2023 Sep 26;11(5):e0170423. doi: 10.1128/spectrum.01704-23. Online ahead of print. Microbiol Spectr. 2023. PMID: 37750703 Free PMC article.
Publication types
MeSH terms
Grants and funding
LinkOut - more resources
Full Text Sources
Other Literature Sources
Research Materials