TY - JOUR
T1 - The Pfam protein families database in 2019
AU - El-Gebali, Sara
AU - Mistry, Jaina
AU - Bateman, Alex
AU - Eddy, Sean R.
AU - Luciani, Aurélien
AU - Potter, Simon C.
AU - Qureshi, Matloob
AU - Richardson, Lorna J.
AU - Salazar, Gustavo A.
AU - Smart, Alfredo
AU - Sonnhammer, Erik L.L.
AU - Hirsh, Layla
AU - Paladin, Lisanna
AU - Piovesan, Damiano
AU - Tosatto, Silvio C.E.
AU - Finn, Robert D.
N1 - Publisher Copyright:
© 2018 The Author(s).
PY - 2019/1/8
Y1 - 2019/1/8
N2 - The last few years have witnessed significant changes in Pfam (https://pfam.xfam.org). The number of families has grown substantially to a total of 17,929 in release 32.0. New additions have been coupled with efforts to improve existing families, including refinement of domain boundaries, their classification into Pfam clans, as well as their functional annotation. We recently began to collaborate with the RepeatsDB resource to improve the definition of tandem repeat families within Pfam. We carried out a significant comparison to the structural classification database, namely the Evolutionary Classification of Protein Domains (ECOD) that led to the creation of 825 new families based on their set of uncharacterized families (EUFs). Furthermore, we also connected Pfam entries to the Sequence Ontology (SO) through mapping of the Pfam type definitions to SO terms. Since Pfam has many community contributors, we recently enabled the linking between authorship of all Pfam entries with the corresponding authors- ORCID identifiers. This effectively permits authors to claim credit for their Pfam curation and link them to their ORCID record.
AB - The last few years have witnessed significant changes in Pfam (https://pfam.xfam.org). The number of families has grown substantially to a total of 17,929 in release 32.0. New additions have been coupled with efforts to improve existing families, including refinement of domain boundaries, their classification into Pfam clans, as well as their functional annotation. We recently began to collaborate with the RepeatsDB resource to improve the definition of tandem repeat families within Pfam. We carried out a significant comparison to the structural classification database, namely the Evolutionary Classification of Protein Domains (ECOD) that led to the creation of 825 new families based on their set of uncharacterized families (EUFs). Furthermore, we also connected Pfam entries to the Sequence Ontology (SO) through mapping of the Pfam type definitions to SO terms. Since Pfam has many community contributors, we recently enabled the linking between authorship of all Pfam entries with the corresponding authors- ORCID identifiers. This effectively permits authors to claim credit for their Pfam curation and link them to their ORCID record.
UR - http://www.scopus.com/inward/record.url?scp=85059794859&partnerID=8YFLogxK
U2 - 10.1093/nar/gky995
DO - 10.1093/nar/gky995
M3 - Article
C2 - 30357350
AN - SCOPUS:85059794859
SN - 0305-1048
VL - 47
SP - D427-D432
JO - Nucleic Acids Research
JF - Nucleic Acids Research
IS - D1
ER -