UNIGE document Scientific Article
previous document  unige:33485  next document
add to browser collection

Assignment of protein function and discovery of novel nucleolar proteins based on automatic analysis of MEDLINE.

Schuemie, Martijn
Roes, Peter-Jan
Kors, Jan
Mons, Barend
Published in Proteomics. 2007, vol. 7, no. 6, p. 921-31
Abstract Attribution of the most probable functions to proteins identified by proteomics is a significant challenge that requires extensive literature analysis. We have developed a system for automated prediction of implicit and explicit biologically meaningful functions for a proteomics study of the nucleolus. This approach uses a set of vocabulary terms to map and integrate the information from the entire MEDLINE database. Based on a combination of cross-species sequence homology searches and the corresponding literature, our approach facilitated the direct association between sequence data and information from biological texts describing function. Comparison of our automated functional assignment to manual annotation demonstrated our method to be highly effective. To establish the sensitivity, we defined the functional subtleties within a family containing a highly conserved sequence. Clustering of the DEAD-box protein family of RNA helicases confirmed that these proteins shared similar morphology although functional subfamilies were accurately identified by our approach. We visualized the nucleolar proteome in terms of protein functions using multi-dimensional scaling, showing functional associations between nucleolar proteins that were not previously realized. Finally, by clustering the functional properties of the established nucleolar proteins, we predicted novel nucleolar proteins. Subsequently, nonproteomics studies confirmed the predictions of previously unidentified nucleolar proteins.
Keywords Amino Acid SequenceAnimalsDEAD-box RNA Helicases/chemistry/genetics/metabolismDatabases, ProteinHumansMEDLINEMolecular Sequence DataNuclear Proteins/chemistry/genetics/metabolismProteome
PMID: 17370270
Full text
Article (Published version) (361 Kb) - document accessible for UNIGE members only Limited access to UNIGE
Research group Groupe de Protéomique biomédicale (635)
(ISO format)
SCHUEMIE, Martijn et al. Assignment of protein function and discovery of novel nucleolar proteins based on automatic analysis of MEDLINE. In: Proteomics, 2007, vol. 7, n° 6, p. 921-31. doi: 10.1002/pmic.200600693 https://archive-ouverte.unige.ch/unige:33485

517 hits

0 download


Deposited on : 2014-01-21

Export document
Format :
Citation style :