Document Type
Article
Publication Date
4-19-2010
Publication Title
Plos One
Volume
5
Issue
8
First page number:
1
Last page number:
7
Abstract
Background
Minimotifs are short contiguous peptide sequences in proteins that are known to have a function in at least one other protein. One of the principal limitations in minimotif prediction is that false positives limit the usefulness of this approach. As a step toward resolving this problem we have built, implemented, and tested a new data-driven algorithm that reduces false-positive predictions.
Methodology/Principal Findings
Certain domains and minimotifs are known to be strongly associated with a known cellular process or molecular function. Therefore, we hypothesized that by restricting minimotif predictions to those where the minimotif containing protein and target protein have a related cellular or molecular function, the prediction is more likely to be accurate. This filter was implemented in Minimotif Miner using function annotations from the Gene Ontology. We have also combined two filters that are based on entirely different principles and this combined filter has a better predictability than the individual components.
Conclusions/Significance
Testing these functional filters on known and random minimotifs has revealed that they are capable of separating true motifs from false positives. In particular, for the cellular function filter, the percentage of known minimotifs that are not removed by the filter is ~4.6 times that of random minimotifs. For the molecular function filter this ratio is ~2.9. These results, together with the comparison with the published frequency score filter, strongly suggest that the new filters differentiate true motifs from random background with good confidence. A combination of the function filters and the frequency score filter performs better than these two individual filters.
Keywords
Algorithms; Amino Acid Motifs; Animals; Artificial intelligence; Data mining; Data Mining/Methods; Databases; Protein; Humans; MimoSA; Minimotifs; Protein binding; Proteins/Chemistry; Proteins/Metabolism; Proteins—Metabolism; Sequence Analysis; Protein
Disciplines
Biology | Computer Sciences | Life Sciences | Molecular Biology | Structural Biology
Language
English
Creative Commons License
This work is licensed under a Creative Commons Attribution 3.0 License.
Repository Citation
Rajasekaran, S.,
Mi, T.,
Merlin, J. C.,
Oommen, A.,
Gradie, P. R.,
Schiller, M. R.
(2010).
Partitioning of Minimotifs Based on Function with Improved Prediction Accuracy.
Plos One, 5(8),
1-7.
http://dx.doi.org/10.1371/journal.pone.0012276
Included in
Biology Commons, Computer Sciences Commons, Molecular Biology Commons, Structural Biology Commons