Article (Published version) (748 Kb) - Free access
Protein sequence annotation in the genome era: the annotation concept of SWISS-PROT+TREMBL.
|Published in||International Conference on Intelligent Systems for Molecular Biology. Proceedings. 1997, vol. 5, p. 33-43|
|Abstract||SWISS-PROT is a curated protein sequence database which strives to provide a high level of annotation, a minimal level of redundancy and high level of integration with other databases. Ongoing genome sequencing projects have dramatically increased the number of protein sequences to be incorporated into SWISS-PROT. Since we do not want to dilute the quality standards of SWISS-PROT by incorporating sequences without proper sequence analysis and annotation, we cannot speed up the incorporation of new incoming data indefinitely. However, as we also want to make the sequences available as fast as possible, we introduced TREMBL (TRanslation of EMBL nucleotide sequence database), a supplement to SWISS-PROT. TREMBL consists of computer-annotated entries in SWISS-PROT format derived from the translation of all coding sequences (CDS) in the EMBL nucleotide sequence database, except for CDS already included in SWISS-PROT. While TREMBL is already of immense value, its computer-generated annotation does not match the quality of SWISS-PROTs. The main difference is in the protein functional information attached to sequences. With this in mind, we are dedicating substantial effort to develop and apply computer methods to enhance the functional information attached to TREMBL entries.|
|Keywords||Amino Acid Sequence — Databases, Factual — Genome — Proteins/genetics — Software — Software Design|
|Research groups||Calipho (80)|
Swiss-Prot Research Group
|APWEILER, Rolf et al. Protein sequence annotation in the genome era: the annotation concept of SWISS-PROT+TREMBL. In: International Conference on Intelligent Systems for Molecular Biology. Proceedings, 1997, vol. 5, p. 33-43. https://archive-ouverte.unige.ch/unige:39289|