Prytuliak, Roman (2018): Recognition of short functional motifs in protein sequences. Dissertation, LMU München: Fakultät für Biologie |
Vorschau |
PDF
Prytuliak_Roman.pdf 2MB |
Abstract
The main goal of this study was to develop a method for computational de novo prediction of short linear motifs (SLiMs) in protein sequences that would provide advantages over existing solutions for the users. The users are typically biological laboratory researchers, who want to elucidate the function of a protein that is possibly mediated by a short motif. Such a process can be subcellular localization, secretion, post-translational modification or degradation of proteins. Conducting such studies only with experimental techniques is often associated with high costs and risks of uncertainty. Preliminary prediction of putative motifs with computational methods, them being fast and much less expensive, provides possibilities for generating hypotheses and therefore, more directed and efficient planning of experiments. To meet this goal, I have developed HH-MOTiF – a web-based tool for de novo discovery of SLiMs in a set of protein sequences. While working on the project, I have also detected patterns in sequence properties of certain SLiMs that make their de novo prediction easier. As some of these patterns are not yet described in the literature, I am sharing them in this thesis. While evaluating and comparing motif prediction results, I have identified conceptual gaps in theoretical studies, as well as existing practical solutions for comparing two sets of positional data annotating the same set of biological sequences. To close this gap and to be able to carry out in-depth performance analyses of HH-MOTiF in comparison to other predictors, I have developed a corresponding statistical method, SLALOM (for StatisticaL Analysis of Locus Overlap Method). It is currently available as a standalone command line tool.
Dokumententyp: | Dissertationen (Dissertation, LMU München) |
---|---|
Keywords: | Motif search, protein sequences, positional data, statistical analysis |
Themengebiete: | 500 Naturwissenschaften und Mathematik
500 Naturwissenschaften und Mathematik > 570 Biowissenschaften, Biologie |
Fakultäten: | Fakultät für Biologie |
Sprache der Hochschulschrift: | Englisch |
Datum der mündlichen Prüfung: | 22. Juni 2018 |
1. Berichterstatter:in: | Leibold, Christian |
MD5 Prüfsumme der PDF-Datei: | a74430daae1c430f8c5c4bf2c0076a6d |
Signatur der gedruckten Ausgabe: | 0001/UMC 25737 |
ID Code: | 22474 |
Eingestellt am: | 06. Sep. 2018 09:29 |
Letzte Änderungen: | 23. Oct. 2020 17:09 |