TY - JOUR
T1 - RepeatsDB 2.0
T2 - Improved annotation, classification, search and visualization of repeat protein structures
AU - Paladin, Lisanna
AU - Hirsh, Layla
AU - Piovesan, Damiano
AU - Andrade-Navarro, Miguel A.
AU - Kajava, Andrey V.
AU - Tosatto, Silvio C.E.
N1 - Publisher Copyright:
© 2016 The Author(s).
PY - 2017/1/1
Y1 - 2017/1/1
N2 - RepeatsDB 2.0 (URL: http://repeatsdb.bio.unipd.it/) is an update of the database of annotated tandem repeat protein structures. Repeat proteins are a widespread class of non-globular proteins carrying heterogeneous functions involved in several diseases. Here we provide a new version of RepeatsDB with an improved classification schema including high quality annotations for ∼5400 protein structures. RepeatsDB 2.0 features information on start and end positions for the repeat regions and units for all entries. The extensive growth of repeat unit characterization was possible by applying the novel ReUPred annotation method over the entire Protein Data Bank, with data quality is guaranteed by an extensive manual validation for >60% of the entries. The updated web interface includes a new search engine for complex queries and a fully re-designed entry page for a better overview of structural data. It is now possible to compare unit positions, together with secondary structure, fold information and Pfam domains. Moreover, a new classification level has been introduced on top of the existing scheme as an independent layer for sequence similarity relationships at 40%, 60% and 90% identity.
AB - RepeatsDB 2.0 (URL: http://repeatsdb.bio.unipd.it/) is an update of the database of annotated tandem repeat protein structures. Repeat proteins are a widespread class of non-globular proteins carrying heterogeneous functions involved in several diseases. Here we provide a new version of RepeatsDB with an improved classification schema including high quality annotations for ∼5400 protein structures. RepeatsDB 2.0 features information on start and end positions for the repeat regions and units for all entries. The extensive growth of repeat unit characterization was possible by applying the novel ReUPred annotation method over the entire Protein Data Bank, with data quality is guaranteed by an extensive manual validation for >60% of the entries. The updated web interface includes a new search engine for complex queries and a fully re-designed entry page for a better overview of structural data. It is now possible to compare unit positions, together with secondary structure, fold information and Pfam domains. Moreover, a new classification level has been introduced on top of the existing scheme as an independent layer for sequence similarity relationships at 40%, 60% and 90% identity.
UR - http://www.scopus.com/inward/record.url?scp=85016144082&partnerID=8YFLogxK
U2 - 10.1093/nar/gkw1136
DO - 10.1093/nar/gkw1136
M3 - Article
C2 - 27899671
AN - SCOPUS:85016144082
SN - 0305-1048
VL - 45
SP - D308-D312
JO - Nucleic Acids Research
JF - Nucleic Acids Research
IS - D1
ER -