Document Type
Article
Publication Date
2009
Publication Title
Cancer Informatics
Volume
7
First page number:
75
Last page number:
89
Abstract
Efficient and effective analysis of the growing genomic databases requires the development of adequate computational tools. We introduce a fast method based on the suffix tree data structure for predicting novel targets of hypoxia-inducible factor 1 (HIF-1) from huge genome databases. The suffix tree data structure has two powerful applications here: one is to extract unknown patterns from multiple strings/sequences in linear time; the other is to search multiple strings/sequences using multiple patterns in linear time. Using 15 known HIF-1 target gene sequences as a training set, we extracted 105 common patterns that all occur in the 15 training genes using suffix trees. Using these 105 common patterns along with known subsequences surrounding HIF-1 binding sites from the literature, the algorithm searches a genome database that contains 2,078,786 DNA sequences. It reported 258 potentially novel HIF-1 targets including 25 known HIF-1 targets. Based on microarray studies from the literature, 17 putative genes were confirmed to be upregulated by HIF-1 or hypoxia inside these 258 genes. We further studied one of the potential targets, COX-2, in the biological lab; and showed that it was a biologically relevant HIF-1 target. These results demonstrate that our methodology is an effective computational approach for identifying novel HIF-1 targets.
Keywords
Computer algorithms; Gene mapping—Technique; Human genome—Research; Pattern recognition systems
Disciplines
Biology | Cell Biology | Electrical and Computer Engineering | Engineering | Genetics and Genomics | Immunology and Infectious Disease | Life Sciences | Microbiology
Language
English
Creative Commons License
This work is licensed under a Creative Commons Attribution 3.0 License.
Repository Citation
Jiang, Y.,
Cukic, B.,
Adjeroh, D. A.,
Skinner, H. D.,
Lin, J.,
Shen, Q. J.,
Jiang, B.
(2009).
An Algorithm for Identifying Novel Targets of Transcription Factor Families: Application to Hypoxia-inducible Factor 1 Targets.
Cancer Informatics, 7
75-89.
https://digitalscholarship.unlv.edu/ece_fac_articles/407
Included in
Biology Commons, Cell Biology Commons, Electrical and Computer Engineering Commons, Genetics and Genomics Commons, Immunology and Infectious Disease Commons, Microbiology Commons