Abstract
Online reviews are a new source for the valuable voice of customers. By identifying the customer’s opinion, designers can comprehend the important features of a product to satisfy customer demand, thus enhancing the market competitiveness of the product. Customers have opinions on multiple aspects of products hidden in reviews, and sentiment divergence may exist. Moreover, there is a gap between customer requirements and the product’s system requirements. How to effectively analyze a large number of reviews to extract the aspect-level customer opinion and thus determine the most important product engineering characteristics in design are the critical challenges for market-driven design. A systematic requirement analysis framework is proposed in this work. First, a convolutional neural network and sentiment analysis are used for opinion mining of online reviews. Then, based on fuzzy logic, the customer sentiment divergence (which is quantified by controversy indexes) and the average sentiment of a requirement are used to determine the degree of satisfaction. Finally, based on the product’s quality function development matrix, the satisfaction and frequency of the customer requirements are used to estimate the importance of the product’s engineering characteristics, which identifies the focus of product design. A case study of a hair dryer is given to demonstrate the effectiveness of the proposed methods.
Keywords
Introduction
Customer satisfaction of features and expected use scenarios of a product determine whether a customer will purchase a product. 1 Therefore, developers should have methods for detecting and responding to market feedback. By capturing and analyzing customer requirements, the important features of a product, including the functionality, structure, and behavioral characteristics, can be determined.
With the development of e-commerce, online reviews have become an easy-to-access source for customer requirements.1,2 However, compared to traditional requirement analysis, there are many challenges in analyzing online reviews: (1) customer’s attitudes on various aspects of a product are implied in comments and expressed by natural language, rather than rating each individual aspect explicitly; (2) e-commerce platforms usually use the average rating of a product to measure customer satisfaction, which exacerbates the controversy of the opinions of each aspect; and (3) customer opinions on product features cannot be directly used to improve product design and should be translated to the importance of product engineering characteristics.
Based on the above analysis, an online review-based framework for customer requirement analysis is proposed to help designers determine the focus of product design. First, the customer-concerned product features and the corresponding customer opinions are extracted from the comment texts in online reviews based on a convolutional neural network (CNN) and sentiment analysis. Second, the customers’ sentiment divergence for each aspect of the product is quantified. Then, the average sentiment score is considered to determine the satisfaction of this aspect by fuzzy logic reasoning. Finally, the importance of each engineering characteristic is calculated.
The article is organized as follows: relevant research studies are reviewed in section “Related work.” Section “Method overview” presents a framework for online review–based requirement analysis. Sections “Customer requirement extraction of online reviews,”“Fuzzy logic-based customer satisfaction analysis,” and “Evaluation of the importance of engineering characteristics” are the foundation of this work, that is, automated customer opinion mining from online reviews, fuzzy logic-based customer satisfaction analysis, and determination of the importance of engineering characteristics. The experiments on the proposed method and the requirement analysis of the hair dryer product are shown in “Experiments and Analysis” to demonstrate the proposed framework. In section “Conclusion,” conclusions are drawn and future work is discussed.
Related work
Customer-driven product design
The voice of the customer (VOC) is widely used to capture customer expectations, preferences, and aversions.3–5 Traditionally, market research methods, such as customer interviews, market questionnaires, and surveys, have been used to collect data and conduct customer requirements.2,6 Under the guidance of an expert, a group of typical users is gathered together to freely discuss product-related issues in accordance with a predetermined process. Moreover, several methods, such as the analytic hierarchy process, 7 fuzzy linguistic model, 8 and fuzzy expert system, 9 are used to find customers’ satisfaction with different kinds of requirements.
However, it is hard to obtain deep demands of customers and generate novel product concepts using this method.10,11 With the development of e-commerce, the online comments of products provide a new source for capturing customer requirements. Therefore, a new requirement analysis based on massive online comments should be developed.
Opinion mining
Current research on opinion mining of online reviews focuses on feature extraction and sentiment analysis.12–15 Jin et al. 16 gave a complete review of recent advances in information mining from big consumer opinion data for product design. Aspect-level sentiment analysis 17 is used to summarize customers’ attitudes of different attributes or features (which are referred to as “aspects”). Hu and Liu18,19 used the apriori algorithm to mine the words of each product and predict whether a customer’s opinion on a specific feature is positive or negative. Based on dependency grammar, Qiu et al. 20 proposed an unsupervised method for extracting feature and opinion words through the double propagation of information between opinion words. 21 Jain et al. 22 employed CNN and long short-term memory architectures for extracting sentiment present in social media data and customer reviews with effective training solutions. Wu et al. 23 used CNN to directly map each review sentence into predefined aspects.
Based on sentiment analysis and analogy reasoning, Zhou et al. 24 proposed a model to obtain potential requirements of customers implicated in use cases. Yang et al. 25 combined local context information and global context information to rank features based on the rating and frequency of each feature. Jiang et al. 26 used both frequencies and sentiment scores to determine the importance of product features and predict their future importance based on fuzzy time series. However, the sentiment divergences between different customers are ignored in all the above methods. Xu et al. 27 explored the controversy of aspects in reviews by the distribution of sentiment scores, and quantitative measures were used to evaluate customer sentiment divergence.
Customer needs translation
Quality function deployment (QFD) 28 is widely used for translating customer requirements into system requirements. Here, each element indicates the weighted relationship between a customer requirement and an engineering characteristic.29,30 Fuzzy set theory is usually introduced to address the vagueness of the description of customer requirements; 31 thus, the importance of customer requirements can be determined. 32 Then, computer-aided approaches are used to determine the weight between customer requirements and engineering characteristics33,34 for customer-driven product design processes.
To convert customer preferences into the importance of engineering characteristics, Kang and Tucker 35 used the latent Dirichlet allocation (LDA)-based method to extract the user-desired product features implied in reviews and the engineering characteristics hidden in the patent documents. Moreover, the PageRank algorithm 36 was used to match a customer requirement term to corresponding engineering characteristics. However, the topic vectors that are extracted by LDA are difficult to understand and difficult to use to directly represent a product feature.
By employing machine learning techniques, Wang and Zhang 37 trained support vector machine (SVM)-based and decision tree–based classification models. A product can be recognized if it meets a specific customer requirement according to the technical parameters. However, it cannot address the direct conversion between individual customer requirements and engineering features. Based on the identified helpful online reviews, 38 Jin et al.39,40 used a statistical language model to label the feature words in online reviews to indicate the engineering features in QFD. However, the product features and keywords in the comment sentences were all marked manually, which increases the workload and may introduce unnecessary subjectiveness.
Method overview
Based on the above analysis, a comprehensive analysis framework is proposed to effectively extract the customer opinions contained in online reviews, as shown in Figure 1. Here, the customers’ attitudes and the sentiment divergence of their opinions are fully considered to evaluate their satisfaction of each aspect to help measure the importance of product engineering characteristics to determine the focus in design. The proposed comprehensive requirement analysis process based on online reviews is as follows:
Customer requirements extraction. First, online reviews are collected, and the natural language processing technique is used to clean the obtained data. Then, the customer requirements hidden in each review are obtained with the opinion mining method. Finally, each customer’s aspect-level sentiment of the product is measured.
Customer satisfaction analysis. First, the average customer’s sentiment score for each requirement is calculated. Then, the distribution and the variance in the scores are analyzed, and the controversy indexes are used to quantify customer sentiment divergences. Finally, the sentiment score and controversy indexes of each customer requirement are used to evaluate the customer’s satisfaction for each requirement by applying the fuzzy logic method.
Importance determination of engineering characteristics. Based on the relationship between customer requirements and product engineering characteristics established in QFD, the weighted summation of the satisfaction of associated customer requirements is calculated for each engineering characteristic. Then, the importance of engineering characteristics for product design is estimated and ranked.

Customer requirement analysis framework.
Customer requirement extraction of online reviews
Often, customers can express their attitudes freely based on comment texts, and thus, the number of reviews is very large. It is cumbersome to manually analyze the customer requirements from online reviews. Here, an approach for automated extraction of hidden customer opinions is proposed based on natural language and text analysis technique.
Problem analysis
Generally, the comments in an online review consist of short texts. Text classification methods can be used to label a comment sentences with customer requirements based on the implicit semantic features of these sentences. Here, TextCNN 41 is used. By fine-tuning a small number of hyperparameters, the model works well for benchmark problems of different applications. One advantage of using CNN for text classification is that it can automatically obtain the feature representation of the text, and thus, complicated artificial feature engineering is avoided.
Note that the customer rating can only reflect a customer’s overall attitude for the product. As shown in Figure 2, the customer’s rating for the product is two points, which is a negative opinion. However, the customer is only dissatisfied with the heating effect of the hair dryer and has no clear bias to other aspects. In this section, automated aspect-level sentiment analysis is used to mind customer sentiment for each aspect to improve the accuracy of customer opinion analysis.

An instance of a customer’s comments.
TextCNN-based text classification
Since each comment may express customer requirements of multiple aspects of a product, the automated annotation of reviews can be recognized as a problem of multilabel classification. To simplify the classification problem, an independent classifier is constructed for each of the customer requirement labels here. Therefore, the multilabel classification problem is simplified to multiple binary classifications.
Training strategy
For each binary classification model for a certain requirement label, the corresponding training set and verification set are prepared. Then, these models are trained independently to adjust the model parameters and determine whether the model has converged.
Note that the obtained samples annotated with a certain requirement label are extremely unbalanced during the manual annotation process. The number of negative samples is usually much larger than that of positive samples. The resampling strategy is introduced in each epoch of model training to equalize the negative and positive samples. Moreover, the 10-fold cross-validation method is applied for training a TextCNN model. In each training epoch, the whole data set for this epoch is divided into 10 subdata sets, nine of which are selected for model training, and the last is used for verification. The minibatch training protocol is used here. The pseudocode for model training is shown in Algorithm 1.
Model parameter setting
At the input layer of a textCNN model, the input text should be encoded to word vectors. A dictionary for the corpus (all the collected reviews data) should be established to develop the model and randomly initialize the encoding of each word and then dynamically update the word vectors through backpropagation of the neural network during the model training process. The parameters used for model training in this study are as follows. The input includes 128D word vectors. The convolution layers are created with the region size h =[1,2,3,4,5] and 128 feature maps for each region size. The probability of dropout during training is 0.5 (randomly choose 50% of neurons and set the weight to 0) to prevent overfitting.
Sentiment analysis
Sentiment analysis refers to the use of natural language processing, text analysis, and computational linguistics to systematically determine the attitude with respect to some topics. Here, the opinions expressed by the comment sentences refer to the positive or negative sentiment of some aspects of the product. The sentiment dictionary SentiWordNet 42 is adopted to calculate the sentiment score of each comment in which each word is associated with its synsets. Each synset can be viewed as an interpretation of the semantics of the word in different contexts. Moreover, according to the statistics of sentiments in massive documents, it gives the estimated degrees of positivity, negativity, and neutrality for each synset. Therefore, for each meaningful word in a comment sentence, its sentiment can be determined by searching the synsets from SentiWordNet. After the comment, sentences are annotated, and the sentiment score of each word is calculated, the sentiment score of a comment sentence is calculated as follows
Here,
After that, all the sentiment scores of the sentences in a review are integrated to obtain the aspect-level sentiment score of that review. If all sentences in the review are annotated with only one requirement r, its rating is used as the sentiment score of r. Otherwise, for each requirement
By analyzing the sentiment scores of all the requirements mentioned in each review, the average sentiment score of each requirement can be obtained. Moreover, by analyzing the distribution of sentiment scores from individual customers, it can be evaluated that the customer group has aspect-level sentiment divergences.
Fuzzy logic-based customer satisfaction analysis
Based on the opinion mining of online reviews, the average sentiment score of each aspect can be obtained to measure the satisfaction of each customer requirement. However, the customers may have sentiment divergences of an aspect. Here, the term “controversy” is introduced to quantify the customer’s sentiment divergences. A fuzzy logic approach is used to derive the satisfaction with each customer requirement, and a fuzzy logic reasoning framework is given, as shown in Figure 3. Here, the hard controversy and soft controversy are considered first, and a series of fuzzy rules are used to determine the controversy of requirements. Then, 20 fuzzy rules are used to determine the satisfaction of each requirement.

The fuzzy reasoning framework of requirement satisfactions.
The quantitative measure of controversy
Amendola et al. 43 proposed two kinds of controversy to evolve the perception of controversial movies: hard controversy and soft controversy. Two indexes, H and S, were defined as their normalized measures. Here, the two indexes are adapted. H measures the degree of sentiment difference among customer reviews, whereas S measures the sentiment scores of reviews with evenly distributed reviews, which are defined as follows
Here, Hi measures the hard controversy of the ith customer requirement;
Si is a square root of a
Satisfaction determination based on fuzzy logic
As mentioned above, H and S are used to reflect the degree of controversy of the sentiment scores. However, there is a contradiction between human fuzzy language habits and precise mathematical quantities. In this section, a fuzzy logic approach is proposed to determine the satisfaction degree of a customer requirement based on its controversy indexes and sentiment score.
The inference of controversy based on fuzzy logic
Obviously, the lower the value of H, the more scores that are concentrated to the mean. Similarly, S = 0 if all the sentiment scores are concentrated to a single value. The higher the value S is, the more uniform the distribution of scores. Here, three fuzzy sets are defined for the value of H according to the conclusion of Amendola: 39 low (L), medium (M), and high (H). Their membership functions are shown in Figure 4, which are trapezoidal fuzzy functions, triangular fuzzy functions, and trapezoidal fuzzy functions, respectively. The values of the three segmentation points are 0.4, 0.6, and 0.8. Similarly, the fuzzy sets of S are defined with low (L), medium (M), and high (H). The membership functions are shown in Figure 5, where the values of the three segmentation points are 0.6, 0.7, and 0.8, respectively.

Membership functions of H.

Membership functions of S.
The value of the output linguistic variable, the controversiality, is also divided into four fuzzy sets: low (L), medium (M), high (H), and very high (VH), and the corresponding membership function is shown in Figure 6. After the value of H and the value of S are converted into the membership degrees of corresponding linguistic variables, these membership degrees, together with the linguistic variables, are regarded as the antecedent parts for fuzzy inference.

Membership functions of controversiality.
To comprehensively consider the indexes of hard controversy and soft controversy to determine the degree of the controversiality of a customer requirement, nine fuzzy rules are defined in the form of “IF H = X AND S = Y,
THEN controversiality = Z” as shown in Table 1.
The fuzzy rules for the determination of controversiality.
VH: very high.
The inference of satisfaction
Using the average sentiment score and controversiality of a customer requirement as inputs, the customers’ satisfaction of this requirement can be determined by fuzzy logic. Since the sentiment scores are normalized to [1,5], the corresponding fuzzy sets are defined with very negative (VN), negative (N), neutral (M), positive (P), and very positive (VP). Their membership functions are shown in Figure 7. For the output variables, based on the experience of marketing experts, the fuzzy sets of satisfaction are set to extremely not satisfied (ENS), not satisfied (NS), satisfied (S), very satisfied (VS), and extremely satisfied (ES). Their membership functions are shown in Figure 8.

Membership functions of sentiment.

Membership functions of satisfaction.
In this work, “controversy” is introduced as a factor for the determination of customers’ satisfaction degree of a certain requirement. If a customer requirement has a high average sentiment score with a single polarity distribution of sentiment scores, the customers have a census of positive comments. Otherwise, when there is no census of customers’ attitudes for a requirement, even if it has a high average sentiment score, it indicates that customers are not fully satisfied with this requirement. Based on the above analysis, 20 fuzzy rules are used to determine the satisfaction of a customer requirement, as shown in Table 2.
The fuzzy rules for the determination of satisfaction.
VN: very negative; VP: very positive; ENS: extremely not satisfied; VS: very satisfied; ES: extremely satisfied; VH: very high.
Evaluation of the importance of engineering characteristics
During the product design process, designers often develop a QFD matrix to interpret the correspondence between the requirements and the product’s engineering characteristics. According to statistics, 80% of product design is an adaptive design in which there are existing product design documents, including the QFD matrix. Therefore, based on a product’s QFD matrix, the importance of various customer requirements can be translated into that of all engineering characteristics.
The relationship between 21 customer requirements and 26 engineering characteristics of hair dryer products is given in Ulrich and Eppinger. 3 Here, 12 types of customer requirements are selected and integrated for analysis. Moreover, 21 corresponding engineering characteristics and the relations between requirements and these engineering characteristics are listed, as shown in Figure 9. For example, when designers want to evaluate the importance of the engineering characteristic “supply front airflow” in product design, the frequency of mention and customer satisfaction of the four requirements of “reliability,”“price,”“noise,” and “weight” need to be considered. In addition, the weights of these four requirements to “supply front airflow” are 9, 4, 9, and 9.

The QFD matrix of hair dryer product.
The frequency and sentiment scores of each customer requirement mentioned in the online reviews imply the importance of this requirement for product development, while the product QFD matrix reflects the product’s engineering characteristics that are closely related to the customer requirements. If an engineering characteristic of the product is exactly associated with the requirements that are of high concern by customers with low satisfaction, the improvement of this characteristic can effectively meet the market’s demands. Based on the above analysis, the evaluation of the importance of a product’s engineering characteristics is proposed to improve product design efficiency. The calculation formula is as follows
Here,
Experiments and analysis
In this section, hair dryers are chosen as research objects, and online reviews of multiple popular hair dryers from e-commerce platforms are collected. All of the reviews are manually annotated with customer requirement labels for training the classifiers for automated review annotations. Finally, the online reviews of a specific hair dryer are used to verify the proposed method for assessing the importance of each engineering characteristic in the product design based on customer requirements analysis.
Online review collection and annotation
All the customer reviews are crawled from Amazon (www.amazon.us). Using “hair dryer” as the search keyword, Amazon automatically sorts and displays the products of different brands and models according to their sales and ratings. Based on the ranking results and the number of customer reviews, four best-selling products from 4 well-known brands are selected: Revlon infrared hair dryer, Remington hair dryer with ionic +ceramic + tourmaline technology, Conair mid-size styler hair dryer, and Conair ionic ceramic hair dryer. The scale of the customer review data is listed in Table 3 to show the number of reviews obtained for each product, the number of valid sentences after text cleaning and segmentation (the sentences that contain only one word or only illegal characters are removed), the maximum sentence length of the valid sentences (the number of words in the longest sentence), and the average sentence length.
The review data of four hair dryer products.
After that, all the reviews of the four products listed in Table 3 are labeled by two graduate students from industrial design. During the annotation process, all the comment sentences are first labeled by the two annotators separately. If a comment sentence has different annotation results, the annotators negotiate with each other to determine the final labels of that sentence. The reviews of the four products are labeled by the 12 selected types of customer requirements. The annotation results are shown in Figure 10. The y-axis shows the number of sentences labeled to the corresponding customer requirements for each product.

The manual annotation results for the four hair dryer products.
Performance analysis of the automated annotation
Based on the selected CNN architecture and parameter settings, the TensorFlow framework is used to construct the binary classification models for the automated annotation of reviews of hair dryers to 12 customer requirement labels. Two experiments are designed to validate the performance of the models and verify our hypothesis: “the classifiers that are trained with the review data set of some products belong to a specific category and can be used to label the customer reviews of the same type of product.”
Experiment 1
Customer reviews of a single hair dryer product are used to develop the classification model, and customer reviews of other hair dryer products are used to test the automated annotation performance. For the review data set shown in Table 3, product B00132ZG3U has the largest number of samples (6050 valid sentences). Therefore, it is selected as the training set, while the reviews of the other three products are used as the test set. The experimental results are shown in Table 4. Although the binary discriminant classifiers of the 12 customer requirements exhibit more than 95% accuracy for the test set, there are still some cases where the precise and recall are 0. The low precision and recall are due to the imbalance of positive and negative samples in the training set. Thus, even if the classifier labels all positive samples in the test set as negative samples, it can still achieve high accuracy. Moreover, in the review data of a single product, the customers may describe a particular requirement with a limited vocabulary. For example, the word “purple” is extensively used to comment on the appearance of the B00132ZG3U. However, the feature does not appear in the reviews of the other three products that were used as the test set. The singularity of the training set may lead to overfitting the classifiers.
Performance of the classifier trained by the reviews of a single product.
Experiment 2
Combining the customer review corpus of different products can improve the diversity of training data. As a result, the trained discriminant classifiers can have a better performance for labeling reviews of the same type of products. In this experiment, two sets of classifiers are trained. First, the reviews of B00132ZG3U, B00MMRFUG8, and B001LQY1X4 are used as training sets to develop the classifiers, while B00GZG4KSG’s customer reviews are used as test sets. Then, the reviews of B00132ZG3U, B00MMRFUG8, and B001LQY1X4 are used as the training set, while B001LQY1X4’s customer reviews are used as the test set. The experimental results are shown in Table 5. Compared with Table 4, the precision and recall of the classifiers of each customer requirement improve, especially in the three aspects of “reliability,”“appearance,” and “hand grip.” It can be concluded that the low precision and recall caused by overfitting can be avoided using larger and more diverse training data.
Performance of the classifier trained by the reviews of multiple products.
By comparing the results of the two experiments, it can be concluded that the classifiers that are trained with the review data set of some products belonging to a specific category can be used to label the customer reviews of the same type of products. Therefore, if a sufficient number of customer reviews of the typical and popular products for a certain category are collected and manually labeled as the training data, classifiers can be trained for the automated requirement type annotation of customer reviews of other products of the same category. Note that the classification model has universality for reviews of the products belonging to the same category, which can reduce the cumbersome manual annotation.
Case study of customer requirement analysis
In this section, the Revlon hair dryer (B000A3I2X4) is analyzed based on its online reviews using the proposed method step by step. First, for each comment sentence of the reviews, the corresponding requirement types should be labeled for subsequent customer opinion mining and customer satisfaction estimation. The scale of the customer reviews of the hair dryer is shown in Table 6. There are 1341 reviews with 2007 valid sentences in which the average and the maximum length are 14 and 87 words, respectively.
Customer review of Revlon quick dry lightweight hair dryer.
To obtain the highest possible accuracy for the CNN short text classifier, it is necessary to provide as much training data as possible. Therefore, the manually labeled customer reviews of the B00GZG4KSG, B00MMRFUG8, B001LQY1X4, and B00132ZG3U are used as training sets to train 12 binary discriminant classifiers for the selected 12 customer requirements.
The customer reviews of the Revlon hair dryer are automatically annotated by the trained classifiers. The frequencies of the 12 customer requirements that have been mentioned in the collected 2006 comment sentences are shown in Table 7. The four items mentioned with highest frequencies are “price” (305 times), “weight” (295 times), “temperature setting” (215 times), and “efficiency” (163 times). Therefore, the customer’s concern reflected in the classification results of its reviews meets the characteristics of this product as its name “Revlon quick dry lightweight hair dryer.”
The frequencies of the 12 customer requirements mentioned in the reviews.
Then, sentiment analysis and controversy calculation are performed on the review data corresponding to each of the customer requirements. The calculation results and comparisons of the average sentiment score, hard controversy, and soft controversy for the 12 customer requirements of the hair dryer are shown in Figure 11. Among all the requirement items, the “appearance” has a very high sentiment score with relatively low soft/hard controversy. It can be determined that customers have high satisfaction with the appearance of the hair dryer. In addition, there are low sentiment scores and a high degree of controversy from the reviews for the “reliability” of this product, which means that customers are not satisfied with the reliability of this product.

Customers’ opinions of the 12 requirements.
Using the soft/hard controversy and average sentiment score of each requirement item as input, the degree of customer satisfaction of each requirement can be evaluated based on the fuzzy logical framework proposed in section “Satisfaction determination based on fuzzy logic.” The average sentiment score, soft/hard controversy, and the overall controversy inferred by fuzzy logic with the evaluated satisfaction for each customer requirement are shown in Table 8. The “appearance” has the highest satisfaction with a value of 0.823, and the “reliability” has the lowest satisfaction with a value of 0.086.
The obtained satisfaction of 12 requirements based on fuzzy logic.
Then, based on the relationships between customer requirements and engineering characteristics in the QFD matrix, as shown in Figure 9, the importance of all engineering characteristics can be calculated with the formula given in section “Evaluation of the importance of engineering characteristics.” The importance of engineering characteristics reflects how important a certain aspect of the product needs to be considered in the design process to maximize customer satisfaction. The importance of the 12 engineering characteristics of the Revlon hair dryer is ranked from high to low, as shown in Figure 12. Compared to other features, “heats front air flow” has the highest priority in the design process. In addition, “offers foldability,”“allowing control of burst,” and “supplies front air flow” also have very high importance. Therefore, if Revlon intends to develop an upgrade to its quick dry lightweight hair dryer, it is necessary to attach great importance to these four features to enhance the customer experience and meet the user’s needs.

The rank of the importance of engineering characteristics based on customer requirement analysis.
To verify the effectiveness of this approach, we designed a questionnaire about the importance of the 21 engineering characteristics and identified 12 persons using this kind of hair dryer. Similarly, the importance is ranged and quantified by a score from 5 to 0. The returned results are tabulated as follows (here, only the most important seven engineering characteristics are given). It can be seen from Table 9 that the proposed method is quite effective even though there are some differences between the values of the first seven important engineering characteristics.
Review comments directly collected from real users.
Conclusion
An automated customer opinion mining and analysis framework is proposed in this work. The customers’ attitudes and sentiment divergences of each aspect of the product are considered to determine the satisfaction of different customer requirements by fuzzy logic inference. The main contributions are as follows:
An aspect-level opinion mining method for the automated analysis of online reviews is proposed that can process a large number of online reviews efficiently and thus avoid time-consuming and labor-intensive manual analysis.
A fuzzy logic system is developed to infer the degree of satisfaction of each aspect of a product in which the hard and soft controversy indexes are also considered for measuring the sentiment divergences among customers. It helps to find the product attributes with negative opinions or a polarized distribution of opinions.
A systematic customer requirement analysis framework for market-driven product design is proposed. The importance of a product’s engineering characteristics is determined using the product’s QFD matrix through which the customer’s attitudes and preferences are used to guide the design.
A series of experiments with the practical products of hair dryers and their online reviews are conducted. The experimental results show that the importance of different engineering characteristics can be analyzed based on customer requirements, which are quite useful for improving product design.
In the future, we will look for benchmark data sets to compare the opinion mining methods proposed in this work with other text classification or text annotation methods. Moreover, we also intend to use machine learning-based sentiment polarity classification on the data set used in this work. Therefore, finding the most suitable opinion mining method to improve the effectiveness of online review-based requirements analysis is a future focus. Moreover, the mapping of the comment text to the corresponding product’s engineering characteristics is another interesting topic that helps to directly analyze the customer’s satisfaction with the product’s engineering characteristics.
Footnotes
Declaration of conflicting interests
The author(s) declared no potential conflicts of interest with respect to the research, authorship, and/or publication of this article.
Funding
The author(s) disclosed receipt of the following financial support for the research, authorship, and/or publication of this article: The authors appreciate the support from the National Key Technology Support Program (2018YFB1700901/3), the National Science Foundation of China (61672247, 61772247, and 61873236), and the public service projects of Zhejiang province (LGG18E050006).
