UNIGE document Doctoral Thesis
previous document  unige:86046  next document
add to browser collection

Analysis of large biological data: metabolic network modularization and prediction of N-terminal acetylation

Defense Thèse de doctorat : Univ. Genève, 2015 - Sc. 4883 - 2015/11/17
Abstract During last decades, biotechnology advances allowed to gather a huge amount of biological data. This data ranges from genome composition to the chemical interactions occurring in the cell. Such huge amount of information requires the application of complex algorithms to reveal how they are organized in order to understand the underlying biology. The metabolism forms a class of very complex data and the graphs that represent it are composed of thousands of nodes and edges. In this thesis we propose an approach to modularize such networks to reveal their internal organization. We have analyzed red blood cells' networks corresponding to pathological states and the obtained in-silico results were corroborated by known in-vitro analysis. In the second part of the thesis we describe a learning method that analyzes thousands of sequences from the UniProt database to predict the N-alpha-terminal acetylation. This is done by automatically discovering discriminant motifs that are combined in a binary decision tree manner. Prediction performances on N-alpha-terminal acetylation are higher than the other published classifiers.
Keywords Metabolic networkExtreme pathwaysNetwork ModularizationClusteringSequence motifN-terminal AcetylationMachine learning
URN: urn:nbn:ch:unige-860463
Full text
Thesis (4.9 MB) - public document Free access
Research group Scientific and Parallel Computing
(ISO format)
CHARPILLOZ, Christophe. Analysis of large biological data: metabolic network modularization and prediction of N-terminal acetylation. Université de Genève. Thèse, 2015. https://archive-ouverte.unige.ch/unige:86046

174 hits



Deposited on : 2016-08-15

Export document
Format :
Citation style :