Scientific Article
previous document  unige:30581  next document
add to browser collection

Protein coding potential of retroviruses and other transposable elements in vertebrate genomes

Campillos, Mónica
Harrington, Eoghan D
Torrents, David
Bork, Peer
Published in Nucleic acids research. 2005, vol. 33, no. 3, p. 946-54
Abstract We suggest an annotation strategy for genes encoded by retroviruses and transposable elements (RETRA genes) based on a set of marker protein domains. Usually RETRA genes are masked in vertebrate genomes prior to the application of automated gene prediction pipelines under the assumption that they provide no selective advantage to the host. Yet, we show that about 1000 genes in four vertebrate gene sets analyzed contain at least one RETRA gene marker domain. Using the conservation of genomic neighborhood (synteny), we were able to discriminate between RETRA genes with putative functionality in the vertebrates and those that probably function only in the context of mobile elements. We identified 35 such genes in human, along with their corresponding mouse and rat orthologs; which included almost all known human genes with similarity to mobile elements. The results also imply that the vast majority of the remaining RETRA genes in current gene sets are unlikely to encode vertebrate functions. To automatically annotate RETRA genes in other vertebrate genomes, we provide as a tool a set of marker protein domains and a manually refined list of domesticated or ancestral RETRA genes for rescuing genes with vertebrate functions.
Keywords AnimalsComputational BiologyDNA Transposable ElementsEndogenous Retroviruses/geneticsGenetic CodeGenetic MarkersGenomicsHumansMiceProtein Structure, TertiaryProteins/geneticsRatsRetroelementsSyntenyTakifugu/genetics
PMID: 15716312
Full text
Article (Published version) (211 Kb) - document accessible for UNIGE members only Limited access to UNIGE
(ISO format)
ZDOBNOV, Evgeny et al. Protein coding potential of retroviruses and other transposable elements in vertebrate genomes. In: Nucleic acids research, 2005, vol. 33, n° 3, p. 946-54. doi: 10.1093/nar/gki236

522 hits

0 download


Deposited on : 2013-10-22

Export document
Format :
Citation style :