Abstract
Motivation:
Uncovering the relationship between micro-RNAs (miRNAs) and their target messenger RNAs (mRNAs) can provide critical information regarding the mechanisms underlying certain types of cancers. In this context, we have proposed a computational method, referred to as prediction analysis by optimization method (PAOM), to predict miRNA-mRNA relations using data from normal and cancer tissues, and then applying the relevant algorithms to colon and breast cancers. Specifically, we used 26 miRNAs and 26 mRNAs with 676 (= 26 × 26) relationships to be recovered as unknown parameters.
Results:
Optimization methods were used to detect 61 relationships in breast cancer and 32 relationships in colon cancer. Using sequence filtering, we detected 18 relationships in breast cancer and 15 relationships in colon cancer. Among the 18 relationships,
Availability:
The numerical code is available from the authors on request.
Introduction
Micro-RNAs (miRNAs) are a large class of small, non-coding RNAs consisting of 22 nucleotides that are expressed from longer and endogenous hairpin-shaped transcripts generally referred to as pre-miRNAs. 1 The miRNAs regulate protein-coding gene expression post-transcriptionally via the translational repression or transcript degradation of their target messenger RNAs (mRNAs),1,2 thereby indicating that miRNAs perform crucial roles in a variety of biological functions. The results of recent studies have demonstrated that miRNAs are deregulated in cancers.2,3 Several computational methods have been proposed to determine how miRNAs pair with their target mRNAs,3,4 in an effort to unravel the roles of miRNAs in the deregulated expression of their target mRNAs during cancer development5,6 and some relationships were identified between miRNAs and their target mRNAs.5,7–11 However, the development of a computational method for the identification of such relationships in cancer remains a difficult issue. Thus far, 2 computational methods have been developed: either the identification of miRNAs that are conserved in different species or stem loop prosecutors 11 or the identification of the relationship between miRNAs and their target mRNAs via the use of sequence homologues. Previously, we proposed a numerical optimization method for multi (miRNAs) -to -multi (mRNAs), which was used for the identification of 16 miRNAs-mRNA relations in colon cancer microarray profiles. 7 The proposed method was used to identify 207 relationships successfully, out of 484. Some relationships detected in that study were verified through previous experimental evidences. For example, the relationship between miR-17 and its target E2F1 was identified in a previous colon cancer study. 12 Like previous studies, however, the difference between the expression profiles of normal and cancer tissues, which we believe to be a critical factor in determining the relationship between miRNAs and their target mRNAs, was ignored.
Here, we propose a novel method, referred to as prediction analysis by optimization method (PAOM), which is composed of a mathematical model and computational method designed to predict miRNA-mRNA relations in the context of cancer development. In this model, the predicted relations are filtered using sequence analysis resources. For mathematical modeling, we employed linear system equations to obtain the inhibiting parameters. The role of the computational method is to optimize the relations and to allow for comparisons of the differences in the parameters between normal and cancer genes. We employed 2 optimization methods—the Broyden-Fletcher-Goldfarb-Shannon (BFGS) and the Powell method, 13 both of which are well known for the optimization of a multi-dimensional matrix problem. For filtering sequence analysis, we used PicTar, based on the scanning multiple alignment of 3′ UTR (untranslated region) sequences and a search set of miRNA sequences, and scored the overlapping position. 11 MiRanda is a miRNA target prediction algorithm that optimizes sequence complementarity using position-specific rules and relies on strict interspecies conservation requirements. 14 In this study, we considered breast and colon cancer data with 676 relationships between 26 miRNAs and 26 mRNAs.
Methods
The mathematical formulation and computational method of the PAOM are described in this section.
Mathematical formulation
It is generally accepted that miRNAs regulate gene expression via either the transcript cleavage or translational repression of their specific target mRNAs, whereas 1 mRNA expression is regulated by multiple miRNAs. For this mechanism, we have a linear equation model, in which 1 mRNA is affected by several miRNAs
where
in which the measurements of the expression of
We solve the equation as an inverse problem, and then obtain
Computational method
In this section, we present a computational scheme for the identification of the relationships between miRNAs and their target mRNAs. The computational method is composed of 3 components—the optimization routine, the objective function, and the direct code. Among them, the optimization routine is the principal component of the computational method, the role of which is to obtain a new set of parameter estimations by solving the inverse problem shown in Equation 3. In particular, we employ the BFGS method for optimization, the so-called quasi-Newton method, which requires second derivatives of the objective function and thereby makes a quadratic convergent to the minimum of error norm, coupled with a drastic reduction of the computational burden. The second method is the Powell method known as direction set method, in which no such second derivatives of the objective function are required. Both methods are useful for multi-dimensional optimization, but they do not work successfully in all cases. After testing both methods with small nodes, we selected an appropriate one for the current cases. Next, the objective function is to provide the criterion for further processing to the next iteration on the basis of iterative error norms. Regarding the error norm, we employed the
where

Comparison of real data set with reconstructed data set using obtained parameters from Powell and Broyden-Fletcher-Goldfarb-Shannon methods with
As CV becomes bigger, the relation is proportional to the strength of the relation between mRNA and miRNA. With the normal and cancer expression data sets, we calculated the unknown parameters as inhibitory relations in the subroutine. The overall numerical scheme of the proposed algorithm follows.
Data setting
We extracted experimentally known 26 miRNAs and 26 mRNAs from the RNA expression profiles of human cancers reported by Lu et al. 15 In colon cancers, each gene consists of 4 normal and 7 cancer data points. In breast cancer, each gene consists of 3 normal and 6 cancer data points. As the data sets are much smaller than unknown parameters, malpositioning problem frequently occurs in the algorithm. For filtering analysis, we used PicTar (https://pictar.mdc-berlin.de/) and miRanda (http://www.microrna.org). PicTar is a computational method used to identify common targets of miRNAs, based on scanning multiple alignments of 3′ UTR sequences and a search set of miRNA sequences, followed by the scoring of the overlapping positions combining the PicTar scores of orthologous transcripts. 11 miRanda is a miRNA target prediction algorithm that optimizes sequence complementarity using position-specific rules and relies on strict interspecies conservation requirements. 14 Neither sequence filtering has any relation with cancers.
Main routine
Main input: data sets
Call subroutine
Input : miRNA cancer, miRNA normal mRNA cancer, and mRNA normal data sets.
Output:
2. Compare the values of parameters
Output represents the relationship between miRNAs and their target mRNAs
Subroutine
Set
Read experimental data of the expression profiles of mRNAs and miRNAs;
Set initial guesses to zero
Construct linear model of Equation 2, and generate numerical data using miRNAs:D
Run Optimization method (BFGS)
Read expression microarray data profiles (normal:N or cancer:C)
Implement Objective function:
Output
Results
Figure 1 represents the comparisons of the real and reconstructed data based on the parameters derived from the Powell and BFGS methods. We tested 2 optimization methods: Powell and BFGS, with 4 nodes of mRNAs (MAPK14, E2F1, HMGA2, and NOTCH1) and 4 nodes of miRNAs (miR-34a, miR-17a, miR-24, and miR-30) from data points of the normal colon tissues. We also assessed each method with the

Distribution of the number of mRNAs of each miRNA with 5 PAOM in breast cancer. The most significant genes are
The frequency of miR-20, miR-23a, and miR-223 are relatively high, which suggests that those miRNAs do exert some effect on breast cancer.16,17 Zhang et al 6 presented miR-20 with the copy number lost in breast cancer. Figure 3 represents the frequency of negative relation with each of the colon cancer miRNAs by cutting PAOM score 5. The frequencies of miR-17, miR-30a, and miR-124 were highest, which implies that those miRNAs exert an effect on colon cancer. Recently, Monzo et al 12 observed that miR-17 was detected in human colon cancer development, and Silber et al 18 observed upregulated miR-124 in colon (HCT-116) cancer cell lines. Tables 1 and 2 show the miRNAs-mRNAs relationships using the proposed method with filtering sequence analysis.

Distribution of the number of mRNAs of each miRNA with 5.0 PAOM in colon cancer.
The relation of miRNAs and target mRNAs with sequence analysis of breast cancer, experimental analysis, and optimization analysis.
Abbreviations: mRNA, messenger RNA; miRNA, micro-RNA; PAOM, prediction analysis by optimization method.
Predicted targets based on sequence analysis.
Predicted targets based on the proposed method.
The relation of miRNAs and target mRNAs with sequence analysis, experimental analysis, and the proposed optimization analysis based on colon cancer.
Abbreviations: mRNA, messenger RNA; miRNA, micro-RNA; PAOM, prediction analysis by optimization method.
Predicted targets based on sequence analysis.
Predicted targets based on the proposed methods.
Prediction of breast cancer
Table 1 shows the predicted relationships between miRNAs and mRNAs with sequence analysis in breast cancer. Based on our proposed method, we acquired 61 strong candidates out of a total of 676 miRNA-mRNA relations. With the integration of filtering sequence analysis, we predicted 18 miRNA-mRNA relationships in the breast cancer cells. Filtering analysis include sequence conservation-based miRanda and PicTar applications.
Those filtering resources are not generally associated with breast cancer. However, the current methods predicted some relationships using normal breast and cancer breast expression data. The results of our proposed method indicate that let-7a, miR-223, miR-98a, and miR-34a downregulate CD24 expression in cases of breast cancer. With sequence analysis, we suggest that let-7a (PAOM score 48.2) and miR-98a (PAOM score 12.0) are predicted to be strongly associated with CD24 in breast cancer. Those relationships were experimentally verified.19,20 Kaipparettu et al
19
found that CD24 expression was downregulated by estrogen in breast cancer stem cells. Verghese et al
20
found that let-7 family was downregulated significantly in breast tumor-initiating cells (BT-1Cs) that were enriched with CD24. Dai et al
21
showed that
Prediction of colon cancer
Table 2 shows the prediction of 15 relationships in colon cancer using the proposed method with filtering sequence analysis. HMGA2 (high mobility group [HMGI]) was observed to be abundantly expressed in human colorectal carcinomas.32–34 MAPK14 that is regulated by miR-12411,35 maintains a high level of ERbeta for E2 anti-proliferative effects in colon cancer cells
36
and in giloblastoma.
37
In addition, MAPK14 is involved in apoptosis in colorectal cancer induced by growth factors.
38
The activation of the Wnt signaling pathway appears to suppress the expression of the
In addition, Kiriakidou et al
42
experimentally reported that
Discussion
Most previous computational studies have been conducted to predict miRNA-mRNA relations on the basis of DNA sequence data. The resultant large number of the sequence predictions makes biological validation quite difficult. On the contrary, a variety of previous studies have demonstrated that an miRNA deregulates its target mRNA in a cancer type-specific manner. For example, miR-34a deregulates E2F in cancer cell lines
44
whereas miR-17 deregulates E2F in breast cancer cells.
20
In this article, we suggested a PAOM consisting of a mathematical model and computational method using microarray data sets and filtering sequence analysis, such that the cancer-specific relationships between miRNAs and mRNAs can be predicted. The proposed PAOM was assessed and compared with normal and cancer microarray profiles in both breast and colon cancers. Among 676 relationships, we predicted 61 and 28 miRNA-mRNA relationships that might exert some effects on breast and colon cancer development, respectively. According to the results of sequence analysis filtering, we uncovered 18 breast putative relations with 12 mRNAs and 14 miRNAs and 15 colon putative relations with 12 mRNAs and 10 miRNAs. We confirmed that 8 genes—
Footnotes
Declaration of conflicting interests:
The author(s) declared no potential conflicts of interest with respect to the research, authorship, and/or publication of this article.
Funding:
The author(s) disclosed receipt of the following financial support for the research, authorship, and/or publication of this article: This work was supported by Basic Science Research Program through the National Research Foundation of Korea (NRF), the Ministry of Science, ICT & Future Planning (NRF-2017R1A2B4010684).
Author Contributions
SK performed all analyses, drafted the manuscript and designed the project.
