The Pfam protein families database: towards a more sustainable future
- PMID: 26673716
- PMCID: PMC4702930
- DOI: 10.1093/nar/gkv1344
The Pfam protein families database: towards a more sustainable future
Abstract
In the last two years the Pfam database (http://pfam.xfam.org) has undergone a substantial reorganisation to reduce the effort involved in making a release, thereby permitting more frequent releases. Arguably the most significant of these changes is that Pfam is now primarily based on the UniProtKB reference proteomes, with the counts of matched sequences and species reported on the website restricted to this smaller set. Building families on reference proteomes sequences brings greater stability, which decreases the amount of manual curation required to maintain them. It also reduces the number of sequences displayed on the website, whilst still providing access to many important model organisms. Matches to the full UniProtKB database are, however, still available and Pfam annotations for individual UniProtKB sequences can still be retrieved. Some Pfam entries (1.6%) which have no matches to reference proteomes remain; we are working with UniProt to see if sequences from them can be incorporated into reference proteomes. Pfam-B, the automatically-generated supplement to Pfam, has been removed. The current release (Pfam 29.0) includes 16 295 entries and 559 clans. The facility to view the relationship between families within a clan has been improved by the introduction of a new tool.
© The Author(s) 2015. Published by Oxford University Press on behalf of Nucleic Acids Research.
Figures
![Figure 1.](https://www.ncbi.nlm.nih.gov/pmc/articles/instance/4702930/bin/gkv1344fig1.gif)
Similar articles
-
UniProt and Mass Spectrometry-Based Proteomics-A 2-Way Working Relationship.Mol Cell Proteomics. 2023 Aug;22(8):100591. doi: 10.1016/j.mcpro.2023.100591. Epub 2023 Jun 8. Mol Cell Proteomics. 2023. PMID: 37301379 Free PMC article. Review.
-
The Pfam protein families database in 2019.Nucleic Acids Res. 2019 Jan 8;47(D1):D427-D432. doi: 10.1093/nar/gky995. Nucleic Acids Res. 2019. PMID: 30357350 Free PMC article.
-
Pfam: the protein families database.Nucleic Acids Res. 2014 Jan;42(Database issue):D222-30. doi: 10.1093/nar/gkt1223. Epub 2013 Nov 27. Nucleic Acids Res. 2014. PMID: 24288371 Free PMC article.
-
Pfam 10 years on: 10,000 families and still growing.Brief Bioinform. 2008 May;9(3):210-9. doi: 10.1093/bib/bbn010. Epub 2008 Mar 15. Brief Bioinform. 2008. PMID: 18344544 Review.
-
The Pfam protein families database.Nucleic Acids Res. 2008 Jan;36(Database issue):D281-8. doi: 10.1093/nar/gkm960. Epub 2007 Nov 26. Nucleic Acids Res. 2008. PMID: 18039703 Free PMC article.
Cited by
-
Genome-wide identification and investigation of monosaccharide transporter gene family based on their evolution and expression analysis under abiotic stress and hormone treatments in maize (Zea mays L.).BMC Plant Biol. 2024 Jun 4;24(1):496. doi: 10.1186/s12870-024-05186-2. BMC Plant Biol. 2024. PMID: 38831278 Free PMC article.
-
Comparative RNA Genomics.Methods Mol Biol. 2024;2802:347-393. doi: 10.1007/978-1-0716-3838-5_12. Methods Mol Biol. 2024. PMID: 38819565
-
UBC Gene Family Analysis in Salvia castanea and Roles of ScUBC2/5 Genes under Abiotic Stress.Plants (Basel). 2024 May 14;13(10):1353. doi: 10.3390/plants13101353. Plants (Basel). 2024. PMID: 38794424 Free PMC article.
-
Genome-Wide Identification and Characterization of Homeobox Transcription Factors in Phoma sorghina var. saccharum Causing Sugarcane Twisted Leaf Disease.Int J Mol Sci. 2024 May 14;25(10):5346. doi: 10.3390/ijms25105346. Int J Mol Sci. 2024. PMID: 38791383 Free PMC article.
-
Transcriptomic Profiles of Long Noncoding RNAs and Their Target Protein-Coding Genes Reveals Speciation Adaptation on the Qinghai-Xizang (Tibet) Plateau in Orinus.Biology (Basel). 2024 May 16;13(5):349. doi: 10.3390/biology13050349. Biology (Basel). 2024. PMID: 38785831 Free PMC article.
References
Publication types
MeSH terms
Substances
Grants and funding
LinkOut - more resources
Full Text Sources
Other Literature Sources
Molecular Biology Databases