Abstract
Abstract
Identifying disease genes is very important not only for better understanding of gene function and biological process but also for human medical improvement. Many computational methods have been proposed based on the similarity between all known disease genes (seed genes) and candidate genes in the entire gene interaction network. Under the hypothesis that potential disease-related genes should be near the seed genes in the network and only the seed genes that are located in the same module with the candidate genes will contribute to disease genes prediction, three modularized candidate disease gene prioritization algorithms (MCDGPAs) are proposed to identify disease-related genes. MCDGPA is divided into three steps: module partition, genes prioritization in each disease-associated module, and rank fusion for the global ranking. When applied to the prostate cancer and breast cancer network, MCDGPA significantly improves previous algorithms in terms of cross-validation and disease-related genes prediction. In addition, the improvement is robust to the selection of gene prioritization methods when implementing prioritization in each disease-associated module and module partition algorithms when implementing network partition. In this sense MCDGPA is a general framework that allows integrating many previous gene prioritization methods and improving predictive accuracy.
Get full access to this article
View all access options for this article.
References
Supplementary Material
Please find the following supplemental material available below.
For Open Access articles published under a Creative Commons License, all supplemental material carries the same license as the article it is associated with.
For non-Open Access articles published, all supplemental material carries a non-exclusive license, and permission requests for re-use of supplemental material or any part of supplemental material shall be sent directly to the copyright owner as specified in the copyright notice associated with the article.
