Abstract
One of the core content of the prediction is that, on the basis of text label attributes, we can use the algorithm and a heuristic approach to acquire the association of texts, and extract the available text for the user. Therefore, this paper proposes a new content. First, the multi-label attributes are chosen to be the feature structure of text, and it is given the classification and assignment according to the distinguish method of the statistical data. Second, considering the relation between texts, we improve the traditional maximum entropy method. We are able to control the number of multiple leading text and subsequent text at the same time. Our method makes stronger association of text, and it leads to a more unified direction and higher correlation of obtained text through the label attributes. Then we can predict the similar texts. Experiments show that with the consideration of multi-label attributes of text and the control of the number of leading text as well as the subsequent text, the recall rate and precision are definitely improved when compared to similar existing methods.
Get full access to this article
View all access options for this article.
