An effective similarity determination model for case-based reasoning in support of low-carbon product design

Abstract

In the initial stage of low-carbon product design, design information is always uncertain and incomplete, as well as the coupling properties between design attributes, thus it requires retrospective coordination for design conflicts resulting from the inclusion of low-carbon requirements. Reusing the prior design knowledge can promote design efficiency, however, the acquisition of similar cases knowledge not only needs to consider the similarity of design problems, but also the adaptability of candidate cases. This study presents an effective similarity determination model to support low-carbon product design, and targets of the proposed model are (1) to reasonably determine design ranges of attribute values for product cases retrieval by representing the uncertain design attributes with fuzzy set theory; (2) to construct an efficient indexing structure to generate the index set of similar cases based on the improved discretized highest similarity method by proposing two effective strategies; and, (3) to establish similarity estimation models for different types of attributes, and it calculates the information content of each attribute to evaluate the adaptability of cases based on the Information Axiom. The applicability of the proposed model is demonstrated through a case study of similar cases retrieval for the vacuum pump low-carbon design.

Keywords

Low-carbon design case-based reasoning fuzzy requirement information indexing structure adaptability estimation

Introduction

In the 21^st century, human society confronts with significant challenges from climate change. The IPCC’s fourth report of environmental assessment pointed out that climate change and global warming are mainly caused by the large amount of greenhouse gases generated by human activities, where the industrial production and manufacturing account for a large proportion.¹ The International Academy for Production Engineering (CIRP) established the low-carbon manufacturing working group (CWG) in Pairs, France, in January 2008, it aims to guide enterprises to take measures to reduce carbon emissions throughout product life cycle, including energy saving and emission reduction at manufacturing stage, green supply chain, services at use stage, and components recycling at the end of life stage.² The preliminary design stage of the product influences about 80% economic cost, environmental and social impacts in its whole life cycle.³ Low-carbon product design takes account of the environmental indicators during product planning and conceptual design stages, and puts forward the strategies and ideas for the generation of new design schemes with the purpose of carbon emission reduction, and it has attracted a heated research in both academia and industry.^4–6

The research of low-carbon product design mainly focuses on the carbon footprint estimation,^7–9 low-carbon design theory and methodology,^10,11 and low-carbon optimization.^12–14 Carbon footprint, the sum of greenhouse gases emissions at each life cycle stage, is a wildly accepted quantitative metric to evaluate the environmental impact of the product. Song and Lee¹⁵ estimated the life cycle carbon footprint of parts based on the design information obtained from the bill of material (BOM), and established the greenhouse gases-BOM (g-BOM) to identify parts with high carbon emissions, which indicates the direction for subsequent improvement and optimization of the parts. Devanathan et al.¹⁶ combined the quality function deployment (QFD) and life cycle assessment (LCA) method to construct a function-environment impact matrix, which enables designers to evaluate the environmental impact of the product function at design stage. However, in the low-carbon product design, since the coupling properties of design quality characteristics, it will inevitably break the original design equilibrium system when low-carbon requirements are incorporated, which results in the complex design conflicting problems. For instance, modifying the component to reduce its carbon emission, but the economic cost is increased and the mechanical property is deteriorated. Therefore, it is necessary to employ the conflicting problem-solving method to coordinate the design contradictions and generate design strategies to make a trade-off between carbon footprint, cost and function performance. The common used contradictory problems solving methods include the theory of inventive problems solving (TRIZ, a Russian acronym),^17,18 Extenics,^19–21 constraint satisfaction problems (CSP),²² the negotiation method.^23,24

From the abovementioned studies in low-carbon product design, it indicates that whether it is the carbon footprint estimation, the generation of low-carbon design schemes through solving the contradictory problems, and the low-carbon optimization method, they are all dependent on the preceding design knowledge. That is, it is a reasoning innovation on the basis of the product case knowledge. Thus, integrating the case-based reasoning (CBR) technique and design conflicting problem-solving methods can promote the design efficiency,²⁵ and to accurately grasp the similar cases knowledge corresponding to the low-carbon requirements is the prerequisite for low-carbon design. In our research work, it presents an effective similarity determination model to obtain similar product cases in consideration of the similarity and adaptability for knowledge reuse and adaptation for the low-carbon product design.

The CBR method reuses the prior case knowledge to solve the similar design problems, it stores the product cases with the solution to problems previously solved in the case base, rather than extracting the complex rule knowledge, such as the rule-based method.^26,27 Whereas CBR method requires less knowledge extraction in the knowledge acquisition task, it needs to collect a significant number of product cases to wildly cover the design problem and solution spaces in order to handle the new emergent design problems, which contributes to the retrieval utility problem of the case base. Retrieval utility problem refers to the opposite property between the accuracy and the efficiency,²⁸ that is, when the number of product cases reaches the saturation point, the retrieval accuracy is basically stable, and if it continually adds new cases, the retrieval efficiency decreases. On the other hand, it is unreasonable to select the candidate cases by the single metric, similarity. Because the most similar candidate case may be not the most appropriate for the adaptation purpose. Consequently, it requires additional knowledge to construct the similarity determination model for the candidate cases in consideration of the similarity and adaptability.^29,30

The most commonly used cases retrieval method is the nearest neighbor algorithm and its various improved variants,^31,32 nearest neighbor method is wildly adopted for the flat case base as its simplicity and practicability. However, it needs to calculate the distance between each case in the case base (memory) and the query case, and then makes the final decision, thus, it will consume extensive searching time when facing the large size case base. Compared with the flat case base, the structure tree indexing method is more suitable for the cases retrieval in a hierarchical structure case base; it hierarchically divides the case base into multiple subsets based on the attribute nodes, thus it only needs to search the subset, in which the attribute values of the cases falling within the value range of the node attributes. Hjaltason and Samet³³ summarized various structure tree indexing methods, and Stephane et al.³⁰ pointed out the complexity in case base maintenance caused by the structural expansion under dynamic cases addition, and revealed the limitation in processing the overlapping cases. In addition, some useful machine learning methods are applied to the case retrieval, such as the neural network classification method,³⁴ genetic algorithm,³⁵ the clustering method.^36,37

Aiming at addressing the retrieval utility problem of the case base, the following three measures are generally adopted:²⁸ (1) construct the indexing structure for the case base, in this approach, the number of cases is not limited, and the constructed indexing structure can improve the retrieval efficiency without reducing the accuracy; however, it is not easy to construct an appropriate indexing structure according to the attribute properties of the cases; (2) formulate the principle of cases addition, that is, control the number of cases in the case base by assessing whether the new case satisfies the addition principle; (3) formulate the principle of cases deletion, namely, control the number of cases by evaluating whether the existing cases should be removed from the case base based on the deletion principle. The difficulty of the latter two measures lies in the formulation of evaluation criteria for the principles. In terms of the adaptability of the similar cases, Smyth and Keane²⁹ pointed out that it requires additional knowledge to select the candidate case in addition to the similarity, and they proposed the concept of adaptation-guided retrieval scheme. Leake et al.³⁸ further studied the relationship between the similarity and adaptability. Rousu and Aarts³⁹ incorporated the adaptation cost into the evaluation indicators of the similarity and adaptability. Roldan Reyes et al.⁴⁰ reported that the CSP and CBR depend on the general knowledge and contextual knowledge respectively, and they integrated these two methods to address adaptation problems in the process of engineering design. Qi et al.⁴¹ incorporated the support vector machine (SVM) and adaptability-related knowledge to obtain adaptable cases, the knowledge of adaptability contains the adaptability characteristics of the old cases and the guideline to the construction of the adaptability-involving SVM adaptation model with respect to retrieved cases.

In addition, in the earlier low-carbon design stage for products, facing the fuzzy low-carbon requirements, the incomplete design information, and the potential complex design conflicting problems, even if the designers are experienced domain experts, they cannot grasp the deep knowledge to determine the precise values for the design attributes. But they roughly know the possible design ranges, such as the lower and upper boundaries of the attribute values to guarantee that the decision making for the design parameters is still satisfying the design targets. Giachetti et al.⁴² revealed this uncertainty in the expression of the design information at each design stage, from the linguistic variables, fuzzy numbers, to the final real numbers. It also exactly demonstrates that the essence of product design is a complex backtracking process of clarifying the “ill-defined” target problems.⁴³ The rest of this article is organized as follows: in the next section, low-carbon requirements are detailed analyzed, and the fuzzy set theory is used to represent the uncertain design attributes; the discretized highest similarity indexing method is improved with two proposed strategies to construct the effective indexing structure for the low-carbon product case base; the comprehensive estimation model by integrating the similarity and adaptability is presented to evaluate the candidate cases; a case study about the similar cases retrieval for the vacuum pump low-carbon design is provided to demonstrate the effectiveness of our proposed method; discussion is given, and conclusion is finally drawn along with the recommendations for the further research.

Requirement analysis and representation for low-carbon product design

For low-carbon product design, it is essential to obtain similar cases from the case base according to the low-carbon requirements, and reuse the prior knowledge and conduct the adaptation to generate a new design scheme. Thus, it needs to extract the retrieval attributes from the requirement information. In addition, as the design information is incomplete and unascertained at earlier design stage, in this paper, different kinds of descriptions of the requirement information are converted into the standardized fuzzy numbers.

Low-carbon requirement analysis

Low-carbon requirement comprises the traditional customer preference, that is the appearance attributes of the product, the general quality performance, etc., which is named the voice of customers (VOC), and the low-carbon demand, such as less material usage, low energy consumption, less carbon emissions. Mausi et al.⁴⁴ summarized the customer requirements for the environment, defined as voice of environment (VOE), and the quality characteristics (QCs) for the product environmental conscious design (Eco-design), as listed in Table 1. Low-carbon design is a branch of the Eco-design, while the difference in VOC with different products, the VOE and corresponding QCs involved in the design requirements for different products are basically similar. Thus, this study incorporates those low-carbon related VOE and QCs into the design requirement analysis.

Table 1.

The customer requirements and quality characteristics for the environment.⁴⁴

Requirements for the environment	Quality characteristics
Less material usage	Mass
Less energy consumption	Volume
Use of environmental friendly materials	Number of parts
Use of environmental friendly technology	Number of material types
Easy to transport and retain	Mechanical properties
Easy to process and assemble	Physical lifetime
High durability	Amount of energy consumption
Easy to reuse	Rate of recycled materials
Easy to disassemble	Toxicity of materials
Easy to sort	Degree of air pollutant
Harmless to the living environment	Degree of water pollutant
Safe to incinerate	Degree of soil pollutant
Safe to landfill	Biodegradability
Safe to emission	Noise, electromagnetic wave

In addition, QFD method is a useful design tool to collect vaguely described needs and deploy them to the actual design work. The relationship of low-carbon requirements and quality characteristics can be established based on the house of quality (HoQ), as shown in Figure 1. According to the deployment for the low-carbon requirements, designers can identify the important quality characteristics with high relative weight, and extract them as the retrieval attributes to search the similar cases knowledge for low-carbon design.

Figure 1.

Mapping relationship of low-carbon requirements and QCs based on the HoQ.

Representation of the uncertain requirement information with fuzzy set theory

In the life cycle-oriented low-carbon design process, the requirement information not only comes from the product users, but also includes the feedback information from the workers who are in charge of the product manufacturing, assembly, sales, maintenance, and recycling. However, it should be noted that some of this requirement information can be quantitatively expressed, for instance, specify the numerical attribute value with a crisp number, or designate it in a required range; and some requirement information is quantitative immeasurable, it should be expressed with a linguistic term. For example, it adopts the linguistic term to describe the low-carbon and environmental impact (LCEI) of the product as “LCEI is poor”, “LCEI is fair”, “LCEI is good”.

Thus, the different types of descriptions of the requirement information, including the precise numerical value, a range of the numerical value, and the linguistic term, make it difficult to construct the evaluation criteria of the similarity and adaptability for the cases. Fortunately, the above requirement information can be quantitative measured based on the fuzzy set theory.^45,46 The fuzzy set in a domain U is defined by a membership function μ, which has the values in range [0, 1]. μ(x) indicates the degree to which x is the possible value in a fuzzy subset. In this paper, the uncertain attribute values of the requirement information are transformed into the standardized triangular fuzzy number, named STFN (Triangle), or the standardized trapezoidal fuzzy number, named STFN (Trapezoid), as illustrated in Figure 2.

Figure 2.

Membership functions for the standardized fuzzy number conversion: (a) membership function of STFN (Triangle) and (b) membership function of STFN (Trapezoid).

Here, it takes the STFN (Trapezoid) as an example to describe its transformation effect. Suppose the attribute value A, the transformation result of A based on the fuzzy set theory is represented as STFN (A) = (a^l, a^m, aⁿ, a^u), a^l≤a^m≤aⁿ≤a^u≤u, as shown in Figure 2(b), where the domain U of the fuzzy set is U = [0, u], and the membership function is as equation (1).

μ (x) = {\begin{matrix} (x - a^{l}) / (a^{m} - a^{l}) & a^{l} \leq x \leq a^{m} \\ 1 & a^{m} \leq x \leq a^{n} \\ (a^{u} - x) / (a^{u} - a^{n}) & a^{n} \leq x \leq a^{u} \\ 0 & for otherwise \end{matrix}

(1)

When a^l = a^m = aⁿ = a^u, A is a numerical value and it is transformed into a STFN(Trapezoid); when a^l = a^m, aⁿ = a^u, A is a range value; and when a^m = aⁿ, the STFN (Trapezoid) is equal to the STFN (Triangle).

Set U = [0, 1], the linguistic term can be quantitatively expressed by STFN (Triangle) or STFN (Trapezoid) as described in Figure 2(a) and Table 2, and the fuzzy numerical value can also be converted into the STFN (Triangle) or STFN (Trapezoid) as expressed in Table 3.

Table 2.

Descriptions of the linguistic term (take the LCEI as the example).

The linguistic term	Interpretation	STFN (Triangle)	STFN (Trapezoid)
Very poor (VP)	LCEI of the candidate case is very poor	0.0, 0.0, 0.25	0.0, 0.0, 0.0, 0.25
Poor (P)	LCEI of the candidate case is poor	0.0, 0.25, 0.50	0.0, 0.25, 0.25, 0.50
Fair (F)	LCEI of the candidate case is fair	0.25, 0.50, 0.75	0.25, 0.50, 0.50, 0.75
Good (G)	LCEI of the candidate case is good	0.50, 0.75, 1.0	0.50, 0.75, 0.75, 1.0
Very good (VG)	LCEI of the candidate case is very good	0.75, 1.0, 1.0	0.75, 1.0, 1.0, 1.0

Table 3.

Descriptions of the fuzzy numerical values.

Fuzzy numerical values	Interpretation	STFN (Triangle)	STFN (Trapezoid)
About 0.7	the attribute value is about 0.7	0.6, 0.7, 0.8	0.6, 0.7, 0.7, 0.8
Range (0.4, 0.8)	the attribute value is in range (0.4, 0.8)	0.4, 0.6, 0.8	0.4, 0.6, 0.6, 0.8
At least 0.8	the attribute value is at least 0.8	0.8, 1.0, 1.0	0.8, 1.0, 1.0, 1.0
At most 0.4	the attribute value is at most 0.4	0.0, 0.0, 0.4	0.0, 0.0, 0.0, 0.4

The improved D-HS indexing method for the index set of similar cases

Indexing method is used to gather a group of similar cases for similarity calculation, and thus it can avoid the extensive searching for the whole memory. In this paper, an efficient indexing method named Discretized Highest Similarity (D-HS)⁴⁷ is introduced, however, this general approach presents its weakness in application, two effective strategies are proposed to construct the improved indexing structure of the case base for low-carbon design.

Principle of the D-HS indexing method

With the principle of D-HS method, it distributes the intervals of each attribute value of the product cases, and designates the spatial coordinates for the cases in the memory. Cases with the same coordinates as that of the query case constitute the similar group. The simple indexing structure of D-HS method facilitates the maintenance for the case base, and it can effectively address the utility problem.

Let C represent the low-carbon case base, it is composed of N product cases (c_i), and each case consists of M attributes (c_j) in a vector, as expressed in equations (2) and (3).

C = {c_{i}, i = 1, 2, \dots, N}

(2)

c = [c_{j}], j = 1, 2, \dots, M

(3)

where, it defines the expression c_j(c_i) as the value of attribute c_j of the c_i.

Suppose the low-carbon query case q, that is the target case, which consists of M attributes q_i, and thus the query case can be described as below.

q = [q_{i}], i = 1, 2, \dots, M

(4)

where, it defines the expression q_i(q) as the value of the attribute q_i of the query case q.

Let S represent the index set of similar cases, generated by the indexing rule R based on the D-HS principle as expressed in equation (5).

\begin{matrix} S = R (q) = \cap_{i = 1}^{M} C^{i}, \\ C^{i} = {c_{j} \in C : c_{i} (c_{j}) \in [q_{i} (q) - \frac{Δ δ_{i}}{2}, q_{i} (q) + \frac{Δ δ_{i}}{2}]} \end{matrix}

(5)

Where, △δ_i denotes the interval width of the i-th attribute value; Cⁱ is the subset of cases, in which the attribute value of each case falls within the same interval as that of the query case for the i-th attribute.

Based on the principle, the indexing rule R can be described in detail as equation (6).

{\begin{matrix} S = R (q) = \cap_{i = 1}^{M} C_{l_{i}}^{i} \\ C_{l_{i}}^{i} = {c_{j} \in C : c_{i} (c_{j}), q_{i} (q) \in [(l_{i} - 1) Δ δ_{i}, l_{i} Δ δ_{i}]} \\ Δ δ_{i} = \frac{1}{L_{i}}, l_{i} = 1, . . ., L_{i}, j = 1, 2, . . ., N \end{matrix}

(6)

Where, L_i denotes the number of intervals of the i-th attribute, l_i is the index number of L_i.

In equation (6), each attribute is discretized by designating the intervals. The number of intervals of the nominal attributes equals to the number of discrete attribute values, while the number of intervals of the numerical attributes is predefined by the designer. Take the two dimensional indexing space as an example, M = 2, c = [c₁, c₂] = [c_x, c_y], define L_x = L_y = 4, then the discrete intervals of c_x are described as: [0, 0.25], (0.25, 0.5], (0.5, 0.75], (0.75, 1]; the corresponding subsets of cases are ${C_{1}^{x}, C_{2}^{x}, C_{3}^{x}, C_{4}^{x}}$ . The discrete intervals of c_y are described as: [0, 0.25], (0.25, 0.5], (0.5, 0.75], (0.75, 1]; and the corresponding subsets of cases are ${C_{1}^{y}, C_{2}^{y}, C_{3}^{y}, C_{4}^{y}}$ . The two dimensional indexing structure is given as in Figure 3. Suppose the query case falls in the interval with index 3 of the attribute c_x, and the interval with the index 3 of the attribute c_y, then the cases falling within the intersection space constitute the index set S, that is $S = C_{3}^{x} \cap C_{3}^{y}$ = {c₄, c₇, c₁₂,c₁₅}.

Figure 3.

The indexing structure of D-HS method adapted from the literature 47: (a) the distribution of cases in two dimensional space and (b) the scaled up view of the index set S.

D-HS method makes the similarity evaluation limited to the cases in the index set S, thus it greatly promotes the retrieval efficiency. Besides, it is easy to keep the maintenance for the case base when new cases are added or removed. However, the indexing structure of D-HS in Figure 3 has the limitations in retrieval accuracy, which mainly come from two aspects: (1) while the interval width of each attribute is equally designated, the attribute values of cases cannot be evenly distributed. (2) when the location of the query case is close to the boundaries of intervals, the similar cases are easily neglected.

Strategies to improve the D-HS method

Strategy that keep approximately equal number of cases for attribute intervals division

From the equation (6), it can be seen that the widths of intervals of one certain attribute are equally divided, that is, each interval width of the i-th attribute is 1/L_i. However, it cannot make sure the even distribution of the i-th attribute values of cases in each interval, which significantly impacts the retrieval result. For instance, the most cases fall within one certain interval, while few cases are sparsely distributed in other intervals. In this context, the index set S may gathers most of the cases, and thus it cannot achieve the effect of reducing the similarity calculation for searching the whole memory by the indexing approach. On the contrary, S may contain no one case.

Fayyad and Irani⁴⁸ proposed a multi-interval discretized method for continuous numerical attributes based on the information entropy, and constructed the MDLP interval division criteria. Galushka and Patterson⁴⁷ applied the above method to construct a new indexing structure, name D-HS^E, and proposed a selection mechanism between the D-HS^M and the D-HS^E according to the distribution of the cases, the former has better retrieval efficiency, and the latter has higher accuracy. Based on the principle of D-HS method, this study proposes the strategy that keep the number of cases approximately equal in each attribute interval to make cases evenly distributed in a new indexing structure, the procedure is as below.

For the i-th attribute, cases are sorted by the attribute value from smallest to largest, and it defines the number of cases in each interval of the i-th attribute as Int(N/L_i), where, Int() is the rounding function. Thus, the lower boundary of each interval is the minimum attribute value of cases falling within the interval, and the upper boundary of the interval is the maximum attribute value. The indexing rule is improved as equation (7).

{\begin{matrix} S = R (q) = \cap_{i = 1}^{M} C_{l_{i}}^{i} \\ C_{l_{i}}^{i} = {c \in C : c_{i} (c_{j}), q_{i} (q) \in [[c_{i} (c_{((l_{i} - 1) Δ δ_{i})}), c_{i} (c_{(l_{i} Δ δ_{i})})] \cup [c_{i} (c_{((L_{i} - 1) Δ δ_{i})}), 1]]} \\ Δ δ_{i} = Int (N / L_{i}) \\ c_{i} (c_{0}) = 0, l_{i} = 1, . . ., (L_{i} - 1), j = 1, 2, . . ., N \end{matrix}

(7)

Where, N is the number of cases. L_i is the predefined number of intervals of the i-th attribute; $c_{i} (c_{(l_{i} Δ δ_{i})})$ denotes the attribute value of the (l_i△δ_i)-th case in the sequence.

However, the number of cases in each attribute interval is usually not equal as the overlapping attribute values of cases, and thus, it should adjust the number of cases in each attribute interval, and even redefine the number of attribute intervals. For this problem, the criterion to identify whether the attribute values of the cases are evenly distributed in one certain attribute intervals is constructed, as expressed in equation (8).

A_{i}^{Well} = {\begin{matrix} 1, & \frac{σ_{i}}{σ_{max (i)}} < σ * \\ 0, & \frac{σ_{i}}{σ_{max (i)}} \geq σ * \end{matrix}

(8)

In equation (8), when $A_{i}^{Well}$ = 1, it indicates the values of the i-th attribute of cases are basically evenly distributed; and when $A_{i}^{Well}$ = 0, it reveals that the interval division for the i-th attribute is inappropriate, and thus it needs to adjust the number of intervals L_i, and the number of cases in each interval. σ_i denotes the standard deviation of counts of values of the i-th attribute falling within the predefined L_i intervals, it is calculated as equation (9); σ_max(i) denotes the standard deviation in the scenario where all the values of the i-th attribute falls into a single interval, based on equation (9), $σ_{max (i)} = N / \sqrt{L_{i}}$ ; σ* is a threshold to identify the acceptable balance between the well and poor distribution of the attribute values.

σ_{i} = \sqrt{\frac{L_{i} \sum_{l_{i} = 1}^{L_{i}} n_{l_{i}}^{2} - {(\sum_{l_{i} = 1}^{L_{i}} n_{l_{i}})}^{2}}{L_{i} (L_{i} - 1)}}

(9)

Where, n_li denotes the number of cases falling within the interval with index l_i of the i-th attribute.

Strategy of similarity matching for attributes

When the query case q does not fall within the center of the intervals, as shown in Figure 3, the location of q is near to the boundaries of the intersection. In this scenario, cases c₁, c₃, c₅, and c₁₉ are not included in the index set S, however they are more closer to the query case q than some cases in S, which results in the retrieval inaccuracy.

Brodu⁴⁹ proposed a spherical indexing method for neighboring cases, such as the two dimensional attributes in Figure 3(b), it takes the location of query case q as the center to draw a circle, then cases in the circle constitute the index set S. However, this query sphere algorithm is not applicable to the hyper polyhedron in case that the lower and upper boundary values of one or more attribute intervals are asymmetric with respect to the recommended points. In this paper, a simple and effective strategy of similarity matching for attributes is proposed to obtain the similar edge candidate cases.

Suppose there are 10 cases in the case base with four dimensional attributes, and each attribute is designated four intervals. Set the attribute values of candidate cases and the query case: c₁(0.30, 0.55, 0.40, 0.15), c₂(0.30, 0.85, 0.20, 0.65), c₃(0.15, 0.35, 0.30, 0.95), c₄(0.20, 0.60, 0.35, 0.90), c₅(0.80, 0.60, 0.65, 0.70), c₆(0.40, 0.45, 0.65, 0.80), c₇(0.80, 0.30, 0.35, 0.85), c₈(0.20, 0.90, 0.70, 0.65), c₉(0.80, 0.70, 0.20, 0.20), c₁₀(0.80, 0.30, 0.40, 0.80), q = (0.20, 0.30, 0.30,0.80), which can be arranged in Table 4.

Table 4.

Attribute values matching between the candidate cases and query case.

IntervalsAttributes	0.0–0.25	0.25–0.50	0.50–0.75	0.75–1.0
A ₁	q, c₃, c₄, c₈	c₁, c₂, c₆		c₅, c₇, c₉, c₁₀
A ₂		q, c₃, c₆, c₇, c₁₀	c₁, c₄, c₅, c₉	c₂, c₈
A ₃	c₂, c₉	q, c₁, c₃, c₄, c₇, c₁₀	c₅, c₆, c₈
A ₄	c₁, c₉		c₂, c₅, c₈	q, c₃, c₄, c₆, c₇, c₁₀

In Table 4, each attribute value of case c₃ is matched with that of the query case q; cases c₄, c₇, c₁₀ all have three attribute values falling within the same intervals as that of the query case q, respectively; for case c₆, two attribute values are matched, that is, the values of attributes A₂ and A₄; cases c₁ and c₈ only have single attribute value matched with that of the query case q. It concludes that cases having more matched attribute values may close to the query case q, such as the cases c₃, c₄, c₇, and c₁₀. Based on this principle, the strategy of similarity matching for attributes can be quantitatively modeled as equation (10).

SMA (c_{i}, q) = \sum_{j = 1}^{M} Match (c_{j} (c_{i}), q_{j} (q))

(10)

Where, SMA(c_i, q) indicates the number of matched attributes between case c_i and the query case q; Match() is a matching function, based on the strategy that keep approximately equal number of cases for attribute intervals division as expressed in equation (7), then the matching function is constructed as below.

If $c_{j} (c_{i}), q_{j} (q) \in [[c_{j} (c_{((l_{j} - 1) Δ δ_{j})}), c_{j} (c_{(l_{j} Δ δ_{j})})] \cup [c_{j} (c_{((L_{j} - 1) Δ δ_{j})}), 1]]$ ,

then Match (c_j(c_i), q_j(q)) = 1;

If ${\begin{matrix} [c_{j} (c_{i}) \in [c_{j} (c_{((l_{j 1} - 1) Δ δ_{j})}), c_{j} (c_{(l_{j 1} Δ δ_{j})})]] \cap [c_{j} (c_{i}) \notin [c_{j} (c_{((L_{j} - 1) Δ δ_{j})}), 1]] \\ [q_{j} (q) \in [c_{j} (c_{((l_{j 2} - 1) Δ δ_{j})}), c_{j} (c_{(l_{j 2} Δ δ_{j})})]] \cap [q_{j} (q) \notin [c_{j} (c_{((L_{j} - 1) Δ δ_{j})}), 1]] \end{matrix},$ and l_j₁≠l_j₂,

then Match (c_j(c_i), q_j(q)) = 0.

Based on the strategy of similarity matching for attributes, a new index set of similar cases S_b is constructed as in equation (11), which is a supplement to the index set S obtained from equation (7), and it can effectively avoids the neglect of similar cases when the query case q locates near the boundaries of the intervals by setting the threshold Const of the number of matched attributes. In final, the index set of similar cases S₀ in equation (12) is obtained.

S_{b} = {c_{i} : SMA (c_{i}, q) \geq Const, Const \leq M}

(11)

S_{0} = {c_{i} : c_{i} \in S \cup S_{b}}

(12)

Construction of the indexing structure for the low-carbon product case base

Based on the improved D-HS method, the indexing structure of the product case base for low-carbon design is established as illustrated in Figure 4.

Step 1: sequence the cases for M dimensional attributes, respectively; designate the number of intervals of each attribute; and set up the attributes coordinate system for the product case base, labeled as ((0, L_i), (0, L_j), (0, L_k),…).

Step 2: designate the coordinates for each product case according to the values of M dimensional attributes, labeled as (l_i(c_i), l_j(c_i), l_k(c_i),…).

Step 3: Input the requirement information, namely the values of M dimensional attributes of the query case q, and designate the attribute coordinates for the query case, labeled as (l_i(q), l_j(q), l_k(q),…).

Step 4: generate the index set S, in which the coordinates of cases are same as that of the query case based on the indexing rule S = R(q).

Step 5: define the threshold Const and gather the cases which satisfy the condition SMA(c_i, q)≥Const, these cases constitute the index set S_b, and it obtains the final index set S₀ = S∪S_b.

Step 6: evaluate the similarity and adaptability of each product case in S₀, and output the acceptable cases for knowledge reuse and adaptation. The evaluation models for the similarity and adaptability are described in the following sections.

Step 7: if the output case is successfully adapted to satisfy the design requirements, then it is considered as a new product case added to the case base with updated attribute coordinates.

Step 8: the total number of cases will increase when new adapted cases are continuously added into the case base, thus, if the current number of cases N₁ satisfies the condition N₁≥(1+10%) N, then the coordinate system of the case base should be updated.

Figure 4.

The procedure of the construction of the indexing structure for the case base.

Similarity and adaptability evaluation models for cases retrieval

Similarity evaluation

In this section, it establishes the similarity evaluation models for different kinds of attributes in consideration of the fuzzy requirement information for the cases in the index set S₀.

Similarity evaluation model for the precise numerical attribute

When the requirement information for one certain numerical attribute is ascertained, namely, the numerical attribute value of the query case q is a crisp number. For the precise numerical attribute, the classical distance measurement model is used to calculate the local similarity as expressed in equation (13).

si m_{1} (c_{j}, q_{j}) = 1 - \frac{| c_{j} (c_{i}) - q_{j} (q) |}{D_{j}}

(13)

Where, D_j = maximum value –minimum value of the j-th attribute.

Similarity evaluation model for the fuzzy numerical attribute

When the requirement information for the numerical attribute is unascertained, that is, the numerical attribute value is a fuzzy number, and usually it is a range number. In this paper, it sets up the feasible interval and desired interval for each fuzzy numerical attribute to make the decision making of the parameters more flexible for the designer; and the dependent function presented in Extenics^19,50 is adopted to establish the similarity evaluation model for fuzzy numerical attributes, as expressed in equation (14).

si m_{2} (c_{k}, q_{k}) = {\begin{matrix} 1, & K (c_{k} (c_{i})) \geq 1 \\ e^{K (c_{k} (c_{i})) - 1}, & K (c_{k} (c_{i})) < 1 \end{matrix}

(14)

Where, K(c_k(c_i)) is the dependent function for the value of the k-th attribute of case c_i.

Suppose the feasible interval X = [a, b], and the desired interval X₀ = [c, d], satisfying X₀⊆X; given the ideal point x₀ of interval X₀, then the dependent function of an arbitrary point x is calculated as in equation (15), here, x can be taken as the attribute value of case c_i, namely x = c_k(c_i).

K (x) = \frac{ρ (x, x_{0}, X)}{ρ (x, x_{0}, X) - ρ (x, x_{0}, X_{0})}

(15)

Where, ρ(x, x₀, X) is named extension distance, which describes the distance of the point x to interval X; and ρ(x, x₀, X₀) is the distance of the point x to interval X₀, accordingly. However, the calculation of the extension distance depends on the location of the ideal point x₀. Here, it takes the feasible interval X as the example, when x₀ locates at the left side of the middle point of interval X, then it should calculate left side extension distance ρ_l(x, x₀, X), otherwise, it should calculate the right side extension distance ρ_r(x, x₀, X), as expressed in equation (16).

\begin{matrix} ρ_{l} (x, x_{0}, X) = {\begin{matrix} a - x, & x \leq a \\ \frac{b - x_{0}}{a - x_{0}} (x - a), & x \in (a, x_{0}) \\ x - b, & x \geq x_{0} \end{matrix} \\ ρ_{r} (x, x_{0}, X) = {\begin{matrix} a - x, & x \leq x_{0} \\ \frac{a - x_{0}}{b - x_{0}} (b - x), & x \in (x_{0}, b) \\ x - b, & x \geq b \end{matrix} \end{matrix}

(16)

The dependent function of each attribute value of cases has the properties as below:

when x∈X₀, then K(x)≥1, it reveals that the attribute value completely meets the requirement;

when x∈X-X₀, then 0≤K(x)<1, it indicates that the attribute value is acceptable;

and when x∉X, then K(x)<0, it reveals that the attribute value cannot satisfy the requirement.

Similarity evaluation model for the nominal attribute

The l-th nominal attribute value of case c_i and query case q are firstly transformed into the standardized triangular fuzzy number STFN(c_l(c_i)) and STFN(q_l(q)), respectively; and then the local similarity of the l-th nominal attribute is estimated based on the ratio of area of the common range A_CR to the area of design range A_STFN(DR), as expressed below.

si m_{3} (c_{l}, q_{l}) = \frac{A_{CR}}{A_{STFN (DR)}}

(17)

Where, for the l-th nominal attribute, the area of STFN(DR) is equal to the area of STFN(q_l(q)); A_CR is the overlapping section between the area of STFN(c_l(c_i)) and the area of STFN(DR).

Therefore, the global similarity of case c_i and query case q is calculated as in equation (18).

\begin{matrix} SIM (c_{i}, q) = \sum_{j = 1}^{m_{1}} ω_{j} si m_{1} (c_{j}, q_{j}) \\ + \sum_{k = 1}^{m_{2}} ω_{k} si m_{2} (c_{k}, q_{k}) + \sum_{l = 1}^{m_{3}} ω_{l} si m_{3} (c_{l}, q_{l}) \end{matrix}

(18)

Where, m₁ is the number of precise numerical attributes, m₂ is the number of fuzzy numerical attributes, and m₃ is the number of nominal attributes; ω_j, ω_k, ω_l are the weight of the corresponding attributes, ∑ω_j+∑ω_k+∑ω_l = 1. In addition, each local similarity should be normalized in order to avoid the distorted result because of the different amplitudes caused by the different similarity evaluation models.

Adaptability evaluation

In this paper, it construct a novel adaptability evaluation model based on the fuzzy axiomatic design (FAD).^51,52 Axiomatic design (AD)⁵³ is wildly used for the product design and development as its two foundational axioms, the Independence Axiom and the Information Axiom. In Independence Axiom, it requires that each functional requirement (FR) should be independent in order to construct the stable product system with an uncoupled design environment. The information in Information Axiom is referred to the probability of satisfying FRs, which provides a theoretical support for the robust design, where the design system can accommodate the large variations in design attributes values and yet satisfy the FRs. Based on the Information Axiom, in this section, it construct the evaluation model for the adaptability of candidate cases in S₀ by estimating the information content of each attribute, the smaller the information content of the case, the better the adaptability.

The concept of information content is defined to quantitatively estimate the information of the design system, suppose the information content I_k for the given FR_k, then I_k is expressed as below.

I_{k} = \log_{2} (\frac{1}{P_{k}})

(19)

Where, P_k is the probability of achieving the FR_k; and when there are more than one functional requirements, then the information content I of the design system is the sum of all probabilities. If one or more probabilities are zero, then I approaches to the infinity; in this scenario, the design system will not work although other design attributes are completely satisfied. Conversely, if all the probabilities are equal to one, then I approaches to zero, and this is an ideal design system.

In a design project, the probability of success for the design task is given by the tolerance specified by the designer, that is, the design range (DR), and the tolerance that the product system is capable of satisfying, that is, the system range (SR). Suppose the functional requirement FR_k follows the uniform distribution, and the system probability density function (pdf) is clear, as shown in Figure 5, it illustrates that the intersection between the design range and system range is the region where the acceptable solution exists. Therefore, P_k can be expressed as equation (20).

P_{k} = \frac{A_{CR}}{A_{SR}} = \frac{L_{CR}}{L_{SR}}

(20)

Where, A_CR, A_SR are the area of common range and system range respectively; and L_CR, L_SR are the interval width of common range and system range; accordingly, the information content for the FR_k is written as below.

I_{k} = \log_{2} (\frac{L_{SR}}{L_{CR}})

(21)

When FR_k follows the random distribution, as shown in Figure 6, the probability of achieving the FR_k in the common range (CR) is expressed as equation (22).

P_{k} = \int_{C R^{l}}^{C R^{u}} Ps (F R_{k}) d F R_{k}

(22)

Figure 5.

Identification for DR, SR, CR and A_CR with the uniform distribution.

Figure 6.

Identification for DR, SR, CR, and A_CR with the random distribution.

Where, CR^l, CR^u are the lower boundary and upper boundary of CR, respectively; Ps denotes the system probability density function (pdf). In this context, the A_SR is equal to one, thus, the information content I_k for FR_k is calculated as in equation (23).

I_{k} = \log_{2} (\frac{1}{A_{CR}})

(23)

However, in the earlier design stage, the information of design system is always incomplete and unascertained, thus it is hard to obtain the precise probability density function of the design system. In this paper, we adopt the membership function to transform the DR and SR to the STFN(DR) and STFN(SR) with crisp numbers as illustrated in Figure 7, and thus, the information content I_k is expressed as in equation (24).

I_{k} = \log_{2} (\frac{A_{STFN (SR)}}{A_{CR}})

(24)

Where, A_STFN(SR) is the area of STFN(SR).

Figure 7.

The standardized triangular fuzzy number transformation for the DR, SR.

According to the evaluation model, it is crucial to obtain the DR, SR, and A_CR of each attribute for the information content calculation. Here, we also specify the rules for Benefit type and Cost type attributes to construct the design range.

For Benefit type attributes, that is, the attribute value is like “at least a”, or “more than a”, the design range can be constructed as in the Figure 8(a). where, a^l = DR^l, a^m = a^u = max(U(x)), μ(a^l) = 0, μ(a^u) = 1; DR^l denotes the lower boundary of the design range, U(x) denotes the whole design domain.

Figure 8.

The STFN (triangle) transformation for DR of benefit and Cost type attributes: (a) STFN(DR)of the Benefit type attribute and (b) STFN (DR) of the Cost type attribute.

For Cost type attributes, that is, the attribute value is like “at most a”, or “less than a”, then the design range can be constructed as in the Figure 8(b). where, a^l = a^m = min(U(x)), a^u = DR^u, μ(a^l) = 1, μ(a^u) = 0; DR^u denotes the upper boundary of the design range.

In addition, when the system range of the design attribute is a crisp value, that is, x_j = c_j(c_i), in this scenario, the system range is constructed as in Figure 9, here, the Benefit type attribute is taken as the example; and the information content is calculated as in equation (25).⁵²

\begin{matrix} I_{j} = \log_{2} (\frac{1}{P (c_{j} (c_{i}))}), P (c_{j} (c_{i})) = u (x_{j}) \\ = {\begin{matrix} \frac{x_{j} - a^{l}}{a^{u} - a^{l}}, & Benefit type attributes \\ \frac{a^{u} - x_{j}}{a^{u} - a^{l}}, & Cost type attributes \end{matrix} \end{matrix}

(25)

Figure 9.

Construction of A_CR for the crisp SR (the Benefit type attribute).

A case study

Low-carbon requirement analysis for the vacuum pump

The CNC cutting machine is widely used to automatically cut the leather and fabric materials, however, unlike the rigid material processing, the materials should be fixed to the working table by the adsorption force with the pressure difference. In the design task, it needs to configure a suitable vacuum pump for the working requirement of the cutting machine in consideration of low-carbon impact for the environment throughout the product life cycle as the demand of the government policy.

For the task, we concludes twelve design requirements for the vacuum pump after detailed discussion with the staff from the cutting machine industry, which include traditional performance needs and low-carbon demand, and related quality characteristics, as illustrated in Figure 10. Based on the HoQ for the requirements analysis, designers can clearly revels the important QCs and pay more attentions to these attributes during the redesign process, and this study takes the QCs with high relative weight as the retrieval attributes to obtain the similar cases knowledge.

Figure 10.

The mapping analysis for low-carbon requirements and QCs based on the HoQ.

There are 151 product cases in the case base, to obtain the similar cases for knowledge reuse and adaptation, it extracts nine QCs as the retrieval attributes, the total sum of their relative weight are more than 80%. The nine QCs are labeled as A₁-A₉: rate of suction and exhaust (A₁), ultimate pressure (A₂), rated power (A₃), noise (A₄), cost (A₅), low-carbon and environmental impact (A₆), failure rate (A₇), maintenance convenience (A₈), and material recycling and reuse (A₉); A₁-A₅ are numerical attributes, A₆-A₉ are nominal attributes, and each attribute value is provided with the feasible interval, desired interval, and the recommended value according to the fuzzy low-carbon requirements, as shown in Table 5.

Table 5.

Attribute values of the vacuum pump with fuzzy low-carbon requirements.

Retrieval attributes	Feasible interval X	Desired interval X₀	Recommended value x
A ₁/(L/s)	[50, 250]	[120, 180]	150
A ₂/Pa	[0, 9000]	[3000, 7000]	5000
A ₃/kW	[7.5, 37]	[11, 18.5]	15
A ₄/dB	[68, 82]	[72, 78]	75
A ₅/Yuan	[2000, 20000]	[4000, 12000]	8000
A ₆	(0.75, 1.0, 1.0)	(0.75, 1.0, 1.0)	(0.75, 0.75, 0.75)
A ₇	(0.0, 0.0, 0.30)	(0.0, 0.0, 0.30)	(0.30, 0.30, 0.30)
A ₈	(0.75, 1.0, 1.0)	(0.75, 1.0, 1.0)	(0.75, 0.75, 0.75)
A ₉	(0.75, 1.0, 1.0)	(0.75, 1.0, 1.0)	(0.75, 0.75, 0.75)

For example, the determination of the attribute value of the rate of suction and exhaust A₁, during the cutting process, the air in the working chamber needs to be exhausted within 5–10 s, and thus the value of A₁ can be derived from the equation (26).⁵⁴

t = 2.3 K \frac{V}{S} \lg (\frac{P_{0}}{P})

(26)

Where, t is the time to exhaust the air from the working chamber, P₀ is the initial pressure of the working chamber, and P is the final pressure, V is the working chamber volume, S is the value of A₁, K is the coefficient. Based on the calculation result, the rate of the suction and exhaust varies in range [50, 250]L/s. However, for the practical situation, factors such as pumping efficiency, air leakage, cost, and the demand of energy saving need to be considered, thus, in addition to the feasible interval X = [50, 250]L/s, the desired interval is also set as X₀ = [120, 180]L/s, and the recommended value is provided x = 150L/s.

The ultimate pressure (A₂) is determined by the required pressure difference to produce the adsorption force to guarantee that the leather and fabric materials are stably fixed during the cutting process. Suggestion knowledge from the cutting machine customers is that when the pressure difference is more than 10,000 Pa, there is no relative motion between the cutting materials. Thus the working pressure of the vacuum pump is less than 90,000 Pa (set 1atm = 100 kPa); and according to the design principle that the ultimate pressure is one order of magnitude smaller than the working pressure, thus the required ultimate pressure should be less than 9000 Pa. It also should take account of the air leakage factor and the harmful friction in the cutting process caused by the excessive pressure difference; thus, in addition to the feasible interval X = [0, 9000]Pa, the desired interval is set as X₀ = [3000, 7000]Pa, and the recommended value is provided x = 5000 Pa.

For the nominal attributes, the vacuum pump is required with a good low-carbon and environmental performance, low failure rate, good property in maintenance, and high rate of material recycling and reuse. These linguistic terms for the requirements are transformed into the standardized triangular fuzzy numbers, and according to the actual needs from the working situation of the cutting machine, the feasible and desired intervals for attributes can also be designated along with the recommended values.

Generation of the index set of similar cases

The indexing structure of the product case base is constructed based on the strategy that keep approximately equal number of cases for attribute intervals division. Values of each attribute are normalized and sequenced from smallest to largest, respectively. For attributes A₁-A₆, the number of intervals are predefined as 5, and for attributes A₇-A₉, the number of the intervals are predefined as 3, as illustrated in Figures 11 and 12.

Figure 11.

The distribution of cases of attributes A₁-A₅ (η_i = σ_i/σ_(max(i), σ^*= 0.20).

Figure 12.

The distribution of cases of attributes A₆-A₉ (η_i = σ_i/σ_(max(i), σ^*= 0.20).

Take A₁ as an example, attribute A₁ is split into five intervals, in the interval with index 1, there are 33 cases, the attribute value varies in a range (0, 0.019); in the interval with index 2, there are 30 cases, the attribute value varies in a range (0.019, 0.063); in the interval with index 3, there are 32 cases, the attribute value varies in a range (0.063, 0.125); in the interval with index 4, there are 30 cases, the attribute value varies in a range (0.125, 0.25); and in the interval with index 5, there are 26 cases, the attribute value varies in a range (0.25, 1). The distribution of cases for each attribute is estimated based on the criterion in equation (8). It indicates the ratio of σ_i to σ_max(i) is smaller than the threshold σ^* of each attribute, especially for A₇-A₉, although many attribute values are overlapping, the splitting result still make sure these attributes belong to the “good” attributes with basically uniform distribution.

In order to make a clear comparison with the original method for intervals division in the D-HS, that is each interval width is equal to 1/L_i, L_i is the interval number of the i-th attribute. The result of the cases distribution is illustrated in Figure 13, where the interval number of each numerical attribute A₁-A₅ is predefined as 5, and the interval number of each nominal attribute A₆-A₉ is predefined as 10. It indicates that most of cases are falling within the interval with index 1 for A₁-A₅, and the uniformity of cases distribution of attributes A₆-A₉ is also not ideal, which will badly impact the accuracy of the similar cases retrieval.

Figure 13.

The distribution of cases by the original attribute intervals division method: (a) distribution of cases of A₁-A₅ and (b) distribution of cases of A₆-A₉.

The distribution of cases in Figures 11 and 12 demonstrates that the predefined interval number of each attribute is acceptable, and accordingly, the coordinates system of the case base can be established as shown in Table 6; and then the coordinates of each case should be designated according to the attribute values. Take the product case c₇ as the example, the coordinates are given as described in Table 7.

Table 6.

The coordinates system of the case base.

A_i	Interval 1		Interval 2		Interval 3		Interval 4		Interval 5
	l ₁	Range	l ₂	Range	l ₃	Range	l ₄	Range	l ₅	Range
A ₁	1	0, 0.019	2	0.019, 0.063	3	0.063, 0.125	4	0.125, 0.25	5	0.25, 1
A ₂	1	0, 3E-6	2	3E-6, 0.001	3	0.001, 0.130	4	0.130, 0.20	5	0.20, 1
A ₃	1	0, 0.019	2	0.019, 0.05	3	0.05, 0.094	4	0.094, 0.188	5	0.188, 1
A ₄	1	0, 0.175	2	0.175, 0.25	3	0.25, 0.40	4	0.40, 0.50	5	0.50, 1
A ₅	1	0, 0.022	2	0.022, 0.049	3	0.049, 0.089	4	0.089, 0.179	5	0.179, 1
A ₆	1	0, 0.50	2	0.50, 0.65	3	0.65, 0.70	4	0.70, 0.75	5	0.75, 1
A ₇	1	0, 0.25	2	0.25, 0.30	3	0.30, 1	–	–	–	–
A ₈	1	0, 0.65	2	0.65, 0.70	3	0.70, 1	–	–	–	–
A ₉	1	0, 0.65	2	0.65, 0.70	3	0.70, 1	–	–	–	–

Table 7.

The coordinates l_i(c₇) of the product case c₇.

Attributes	A ₁	A ₂	A ₃	A ₄	A ₅	A ₆	A ₇	A ₈	A ₉
Values	150	6666	15	74	3200	0.75	0.30	0.75	0.75
Normalization	0.188	0.333	0.188	0.350	0.027	0.75	0.30	0.75	0.75
l_i(c₇)	4	5	4	3	2	4	2	3	3

Query case is the target case in the retrieval algorithm, the coordinates of query case are also designated as shown in Table 8 based on the recommended value of each attribute provided in Table 5.

Table 8.

The coordinates of l_i(q) of the query case q.

Attributes	A ₁	A ₂	A ₃	A ₄	A ₅	A ₆	A ₇	A ₈	A ₉
Values	150	5000	15	75	8000	0.75	0.30	0.75	0.75
Normalization	0.188	0.25	0.188	0.370	0.075	0.75	0.30	0.75	0.75
l_i(q)	4	5	4	3	3	4	2	3	3

The cases labeled with the same coordinates as the query case are extracted to constitute the index set S, and the number of attributes which are matched with the query case q for each case c_i, SMA(c_i, q) is estimated, as shown in Table 9. From Table 9, it indicates that there is no one case can completely match with the query case in each attribute, that is, the index set S is an empty set. In this situation, the threshold Const is set, Const = 4, and to obtain the index set S_b = {c_i: SMA(c_i, q)≥4}; thus the index set S₀ = {c_i: c_i∈SUS_b} = S_b; there are 25 cases in S₀, the attributes of which are listed in Table 10.

Table 9.

The result of SMA(c_i, q) between the product case and query case.

SMA(c_i, q)	Number of cases	Cases corresponding to the SMA(c_i, q)
9	0	–
8	1	c₇;
7	5	c₆, c₈, c₃₉, c₄₃, c₄₄;
6	1	c₃₈;
5	4	c₉, c₁₇, c₂₃, c₁₃₄;
4	14	c₅, c₁₈, c₂₂, c₂₄, c₂₅, c₄₇, c₈₇, c₁₁₄, c₁₂₃, c₁₃₀, c₁₃₃, c₁₄₄, c₁₄₅, c₁₅₀;
3	29	c₁, c₂, c₃, c₄, c₁₀, c₁₁, c₂₆, c₂₇, c₂₈, c₃₆, c₃₇, c₄₅, c₄₆, c₈₈, c₉₄, c₉₇, c₉₈, c₁₀₂, c₁₀₃, c₁₀₆, c₁₀₈, c₁₀₉, c₁₁₀, c₁₁₅, c₁₂₆, c₁₂₇, c₁₃₈, c₁₃₉, c₁₄₉;
2	37	c₁₂, c₁₃, c₁₄, c₁₅, c₁₆, c₂₉, c₃₀, c₃₁, c_32, c₃₄, c₄₁, c₄₂, c₄₈, c₅₁, c₅₂, c₅₅, c₅₉, c₆₁, c₆₃, c₆₆, c₆₇, c₇₅, c₈₅, c₉₁, c₁₀₄, c₁₀₅, c₁₀₇, c₁₁₁, c₁₁₂, c₁₁₃, c₁₂₄, c₁₂₉, c₁₃₅, c₁₃₆, c₁₄₁, c₁₄₃, c₁₅₁;
1	39	c₁₉, c₂₁, c₃₃, c₃₅, c₄₀, c₅₃, c₅₄, c₅₆, c₅₇, c₆₀, c₆₂, c₆₄, c₆₅, c₆₈, c₆₉, c₇₀, c₇₁, c₇₄, c₇₆, c₇₇, c₇₈, c₈₄, c₈₆, c₉₃, c₉₅, c₉₆, c₁₀₀, c₁₀₁, c₁₁₇, c₁₁₈, c₁₂₁, c₁₂₂, c₁₂₅, c₁₂₈, c₁₃₁, c₁₃₂, c₁₄₀, c₁₄₁, c₁₄₂;
0	21	c₂₀, c₄₉, c₅₀, c₅₈, c₇₂, c₇₃, c₇₉, c₈₀, c₈₁, c₈₂, c₈₃, c₈₉, c₉₀, c₉₂, c₉₉, c₁₁₆, c₁₁₉, c₁₂₀, c₁₃₇, c₁₄₆, c₁₄₇, c₁₄₈.

Table 10.

Cases in the index set S₀.

Cases/c_i	A ₁/(L/s)	A ₂/Pa	A ₃/kW	A ₄/dB	A ₅/Yuan	A ₆	A ₇	A ₈	A ₉
c₇	150	6666	15	74	3200	0.75*	0.3*	0.75*	0.75*
c₆	100	6666	11	72	2500	0.75*	0.3*	0.75*	0.75*
c₈	200	6666	18.5	75	4900	0.75*	0.3*	0.75*	0.75*
c₃₉	138	3300	15	74	6800	0.75*	0.3*	0.65*	0.75*
c₄₃	111	3300	11	73	5500	0.75*	0.3*	0.65*	0.75*
c₄₄	138	3300	15	74	7200	0.75*	0.3*	0.65*	0.75*
c₃₈	111	3300	11	73	4050	0.75*	0.3*	0.65*	0.75*
c₉	250	6666	30	78	6000	0.7*	0.3*	0.75*	0.75*
c₁₇	100	3333	15	74	2700	0.65*	0.3*	0.75*	0.75*
c₂₃	95	3300	11	76	3800	0.75*	0.25*	0.75*	0.75*
c₁₃₄	70	2600	7.5	75	7500	0.5*	0.3*	0.75*	0.75*
c₅	50	6666	5.5	67	1950	0.75*	0.25*	0.75*	0.75*
c₁₈	200	3333	22	78	5400	0.65*	0.3*	0.75*	0.75*
c₂₂	65	3300	7.5	72	3600	0.75*	0.25*	0.75*	0.75*
c₂₄	113	3300	15	80	4500	0.75*	0.25*	0.7*	0.75*
c₂₅	130	3300	15	80	4850	0.75*	0.25*	0.7*	0.75*
c₄₇	191	8106	22	82	9400	0.65*	0.3*	0.7*	0.65*
c₈₇	150	1	15	84	18000	0.5*	0.3*	0.65*	0.75*
c₁₁₄	120	10	15	78	85000	0.85*	0.3*	0.7*	0.75*
c₁₂₃	150	0.05	2.2	80	6200	0.7*	0.3*	0.75*	0.7*
c₁₃₀	166	5489.9	15	82	7150	0.7*	0.35*	0.7*	0.7*
c₁₃₃	50	2600	5.5	72	5200	0.5*	0.3*	0.75*	0.75*
c₁₄₄	150	2600	11	80	14000	0.7*	0.3*	0.7*	0.75*
c₁₄₅	200	2600	15	80	18300	0.7*	0.3*	0.7*	0.75*
c₁₅₀	50	300	7.5	76	8200	0.7*	0.3*	0.7*	0.75*

Note: 0.25* = (0.1, 0.25, 0.4); 0.3* = (0.2, 0.3, 0.4); 0.35* = (0.2, 0.35, 0.5); 0.5* = (0.4, 0.5, 0.6); 0.65* = (0.5, 0.65, 0.8); 0.7* = (0.6, 0.7, 0.8); 0.75* = (0.65, 0.75, 0.85); 0.85* = (0.75, 0.85, 0.95).

Estimation of the similarity and adaptability

When the index set S₀ is generated, cases in the S₀ still need to be estimated from two perspectives, the similarity and adaptability according to the low-carbon requirements. For the estimation of similarity, A₁-A₅ belong to fuzzy numerical attributes, thus it adopts the dependent function to calculate the local similarity of each attribute; and for the nominal attributes A₆-A₉, the estimation model based on the membership function as constructed in equation (17) is used to calculate the local similarity of each attribute. The result of the similarity estimation for each case is listed in Table 11.

Table 11.

The result of similarity estimation for cases in S₀.

c_i	sim₂(A_i, q_i), i = 1, 2, 3, 4, 5;					sim₃(A_i, q_i), i = 6, 7, 8, 9				SIM(c_i, q)
	A ₁	A ₂	A ₃	A ₄	A ₅	A ₆	A ₇	A ₈	A ₉	SIM(c_i, q)
c₂₅	1.000	1.000	1.000	0.607	1.000	0.229	0.592	0.057	0.229	0.659
c₂₃	0.700	1.000	1.000	1.000	0.949	0.229	0.592	0.229	0.229	0.654
c₃₉	1.000	1.000	1.000	1.000	1.000	0.229	0.167	0.050	0.229	0.653
c₄₄	1.000	1.000	1.000	1.000	1.000	0.229	0.167	0.050	0.229	0.653
c₂₄	0.912	1.000	1.000	0.607	1.000	0.229	0.592	0.057	0.229	0.646
c₇	1.000	1.000	1.000	1.000	0.780	0.229	0.167	0.229	0.229	0.644
c₄₃	0.882	1.000	1.000	1.000	1.000	0.229	0.167	0.050	0.229	0.635
c₃₈	0.882	1.000	1.000	1.000	1.000	0.229	0.167	0.050	0.229	0.635
c₈	0.753	1.000	1.000	1.000	1.000	0.229	0.167	0.229	0.229	0.631
c₁₁₄	1.000	0.369	1.000	1.000	0.0001	0.914	0.167	0.057	0.229	0.589
c₆	0.751	1.000	1.000	1.000	0.550	0.229	0.167	0.229	0.229	0.583
c₁₈	0.753	1.000	0.827	1.000	1.000	0.050	0.167	0.229	0.229	0.579
c₁₇	0.751	1.000	1.000	1.000	0.619	0.050	0.167	0.229	0.229	0.562
c₁₄₄	1.000	0.875	1.000	0.607	0.779	0.057	0.167	0.057	0.229	0.559
c₁₃₀	1.000	1.000	1.000	0.368	1.000	0.057	0.148	0.057	0.057	0.558
c₂₂	0.452	1.000	0.368	1.000	0.896	0.229	0.592	0.229	0.229	0.526
c₁₄₅	0.753	0.875	1.000	0.607	0.455	0.057	0.167	0.057	0.229	0.488
c₉	0.368	1.000	0.537	1.000	1.000	0.057	0.167	0.229	0.229	0.483
c₄₇	0.844	0.641	0.827	0.368	1.000	0.050	0.167	0.057	0.050	0.472
c₁₃₄	0.490	0.875	0.368	1.000	1.000	0.000	0.167	0.229	0.229	0.456
c₈₇	1.000	0.368	1.000	0.223	0.472	0.000	0.167	0.050	0.229	0.432
c₁₃₃	0.368	0.875	0.208	1.000	1.000	0.000	0.167	0.229	0.229	0.416
c₁₂₃	1.000	0.368	0.082	0.607	1.000	0.057	0.167	0.229	0.057	0.400
c₁₅₀	0.368	0.407	0.368	1.000	1..000	0.057	0.167	0.057	0.229	0.384
c₅	0.368	1.000	0.208	0.287	0.36	0.229	0.592	0.229	0.229	0.378

Make a comparison between Tables 9 and 11, it indicates that the cases with high number of matched attributes cannot guarantee high similarity. For instance, case c₇ has the highest number of matched attributes, but the similarity of c₇ is not the best. This situation is same to the cases c₆, c₈. Conversely, cases c₂₅, c₂₃, and c₂₄ have relative high similarity although the number of matched attributes of which are not very high. Therefore, the strategy of similarity matching for attributes can effectively prevent similar cases with high similarity from being missed.

Estimation of the adaptability for cases is based on the information content of each attribute, which is determined by the design range and system range. The design range of each attribute is obtained from the requirement analysis, as explained in Table 5, and the range values of numerical attributes need to be transformed into the standardized triangular fuzzy number. The system range of each attribute reveals the maximum range for adaptation of the series product cases under the existing technical conditions. For example, the ultimate pressure of the SK series vacuum pump is generally at the range 6000–8000 Pa, but it can reach 3000 Pa or lower by installing a fore-stage nozzle. Here, it takes case c₂₅ as the example to describe the estimation of the adaptability for each attribute.

In Table 12, the normalized attribute values of design range and system range of the case c₂₅ are provided. For attributes A₁(rate of suction and exhaust), A₂(ultimate pressure), A₃(rated power), they follow the fuzzy triangular distribution; A₂ belongs to the Cost type attribute, and the system range is a crisp value, thus the system area of A₂ is constructed based on the principle in Figure 9, here with the Cost type attribute. In Figure 14, the calculation models of information content I_Ai with respect to A₁, A₂, A₃ of case c₂₅ are illustrated, respectively.

I_A ₁ = log₂(1/ (A_CR /A_STFN(SR) )) = log₂(1/0.28) = 1.836;

I_A ₂ = log₂(1/ (A_CR /A_STFN(SR) )) = log₂(1/0.747) = 0.421;

I_A ₃ = log₂(1/ (A_CR /A_STFN(SR) )) = log₂(1/0.679) = 0.558.

Table 12.

Design range of the requirements and system range of c₂₅.

A_i	Design range	System range
A ₁	0.15, 0.188, 0.225	0.081, 0.163, 0.20
A ₂	0.15, 0.25, 0.35	0.165, 0.165, 0.165
A ₃	0.138, 0.188, 0.231	0.094, 0.188, 0.231
A ₄	0.30, 0.45	0.375, 0.75
A ₅	0.035, 0.116	0.023, 0.067
A ₆	0.75, 1.0, 1.0	0.65, 0.75, 0.85
A ₇	0.0, 0.0, 0.30	0.10, 0.25, 0.40
A ₈	0.75, 1.0, 1.0	0.60, 0.70, 0.80
A ₉	0.75, 1.0, 1.0	0.65, 0.75, 0.85

Figure 14.

Information content estimation for attributes A₁-A₃: (a) information content I_A₁, (b) information content I_A₂, and (c) Information content I_A₃.

For attributes A₄(noise), A₅(cost), they follow the uniform distribution, the calculation models of the information content for A₄ and A₅ are illustrated in Figure 15.

I_A ₄ = log₂(1/ (L_CR/L_SR )) = log₂(1/0.20) = 2.322;

I_A ₅ = log₂(1/ (L_CR/L_SR )) = log₂(1/0.727) = 0.460.

Figure 15.

Information content estimation for attributes A₄ and A₅: (a) information content I_A₄, and (b) information content I_A₅.

For nominal attributes A₆-A₉, the linguistic term of each attribute value is transformed into the standardized triangular fuzzy number, and the calculation models of the information content for A₆-A₉ are illustrated in Figure 16.

I_A ₆ = log₂(1/ (A_CR/_ASTFN(SR) )) = log₂(1/0.143) = 2.806;

I_A ₇ = log₂(1/ (A_CR/A_STFN(SR) )) = log₂(1/0.296) = 1.756;

I_A ₈ = log₂(1/ (A_CR /A_STFN(SR) )) = log₂(1/0.036) = 4.796;

I_A ₉ = log₂(1/ (A_CR/A_STFN(SR) )) = log₂(1/0.143) = 2.806.

Figure 16.

Information content estimation for attributes A₆-A₉: (a) information content I_A₆, (b) information content I_A₇, (c) information content I_A₈, and (d) information content I_A₉.

Then the information content of case c₂₅ is obtained:

$\begin{matrix} I_{c 25} = (ω_{A 1}, {ω_{A}}_{2}, {ω_{A}}_{3}, {ω_{A}}_{4}, {ω_{A}}_{5}, {ω_{A}}_{6}, {ω_{A}}_{7}, {ω_{A}}_{8}, {ω_{A}}_{9}) \\ • {({I_{A}}_{1}, {I_{A}}_{2}, {I_{A}}_{3}, {I_{A}}_{4}, {I_{A}}_{5}, {I_{A}}_{6}, {I_{A}}_{7}, {I_{A}}_{8}, {I_{A}}_{9})}^{T} \\ = (0.149, 0.106, 0.136, 0.080, 0.105, 0.157, 0.087, 0.082, 0.098) \\ • {(1.836, 0.421, 0.558, 2.322, 0.460, 2.806, 1.756, 4.796, 2.806)}^{T} \\ = 1.890 . \end{matrix}$

The information content of the remaining cases in S₀ also can be calculated with the same method, the result is shown in Table 13. In addition, in order to consider the comprehensive influence of similarity and adaptability of the product cases, the comparison analysis histogram of the similarity and information content of each case in the index set S₀ is generated, as illustrated in Figure 17.

Table 13.

The result of information content estimation for cases in S₀.

C_i	I_A ₁	I_A ₂	I_A ₃	I_A ₄	I_A ₅	I_A ₆	I_A ₇	I_A ₈	I_A ₉	I _ci
c₂₅	1.836	0.421	0.558	2.322	0.460	2.806	1.756	4.796	2.806	1.890
c₂₃	2.988	0.421	1.910	1.114	0.460	2.806	1.756	2.806	2.806	1.986
c₃₉	1.966	0.421	0.558	0.737	0.615	2.806	3.000	5.573	2.806	1.970
c₄₄	1.966	0.421	0.558	0.737	0.474	2.806	3.000	5.573	2.806	1.956
c₂₄	2.988	0.421	0.558	1.806	0.460	2.806	1.756	4.796	2.806	2.020
c₇	1.221	2.556	0.558	1.586	0.480	2.806	3.000	2.806	2.806	1.913
c₄₃	3.059	0.421	1.910	1.224	0.571	2.806	3.000	5.573	2.806	2.352
c₃₈	3.059	0.421	1.910	1.000	0.648	2.806	3.000	5.573	2.806	2.342
c₈	3..023	2.556	0.548	1.224	0.460	2.806	3.000	2.806	2.806	2.149
c₁₁₄	2.857	1.977	0.910	0.737	α	0.806	3.000	4.796	2.806	α
c₆	3.265	2.556	1.910	1.224	0.927	2.806	3.000	2.806	2.806	2.419
c₁₈	3.023	0.421	1.514	1.737	0.460	4.158	3.000	2.806	2.806	2.307
c₁₇	3.265	0.421	0.910	1.586	0.808	4.158	3.000	2.806	2.806	2.286
c₁₄₄	1.224	0.816	1.910	1.486	1..552	4.796	3.000	4.796	2.806	2.493
c₁₃₀	1.214	0.396	0.910	3.699	1.351	4.796	3.756	4.796	4.796	2.728
c₂₂	3.427	0.421	2.434	1.415	0.619	2.806	1.756	2.806	2.806	2.163
c₁₄₅	3.023	0.816	0.558	1.486	2.523	4.796	3.000	4.796	2.806	2.679
c₉	3.857	2.556	3.878	1.224	0.514	4.796	3.000	2.806	2.806	3.044
c₄₇	2.699	α	3.718	2.322	0.876	5.573	3.000	4.796	5.573	α
c₁₃₄	3.251	0.816	2.434	1.221	0.533	α	3.000	2.806	2.806	α
c₈₇	0.941	1.977	0.558	3.171	3.699	α	3.000	5.573	2.806	α
c₁₃₃	3.836	0.816	3.012	1.224	0.514	α	3.000	2.806	2.806	α
c₁₂₃	0.941	1.977	α	2.322	0.456	4.796	3.000	2.806	4.796	α
c₁₅₀	3.836	1.862	1.565	1..000	0.626	4.796	3.000	4.796	2.806	2.810
c₅	3.836	2.556	3.012	1.114	1.806	2.806	1.756	2.806	2.806	2.629

Note: α denotes that the corresponding information content approaches infinity.

Figure 17.

Comparison analysis of the similarity and information content for cases in S₀.

In Figure 17, it indicates that cases c₂₅, c₂₃, c₃₉, and c₄₄ have high similarity and small information content; and for the case c₇, although the similarity is not high, the information content is small; therefore, these cases are considered as the similar cases knowledge for reuse. For cases c₄₃, c₃₈, c₈, although the similarity is not low, the information content is relative large; however, the information content of cases c₁₁₄, c₄₇, c₁₃₄, c₈₇, c₁₃₃, c₁₂₃ are all approaching infinity, it indicates that there are one or more requirement attributes cannot be satisfied by adaptation under the current technical conditions.

Meanwhile, the local attribute similarity and information content can guide the designer to make targeted improvement for the candidate cases, and it also could indicate the direction of technology updates to some extent. For instance, the attributes A₄ (noise) and A₈ (maintenance convenience) of c₂₅ cannot be completely satisfied, and the related information content of A₄ and A₈ are relative high. The designer maybe adopt new scheme to enhance the local similarity rather than make adaption on the original structure, such as install the noise reduction device, and replace the mechanical sealing component with a simple and effective packing sealing component to reduce the failure rate and facilitate the maintenance. Case c₁₁₄ is a screw vacuum pump, which has good environmental performance and high work efficiency, but its cost is far beyond the scope of the cost constraint; and thus reducing the cost with new technologies is a key issue. The information content of attribute A₆ (low-carbon and environmental impact) of cases c₈₇, c₁₃₃, c₁₃₄ are all approaching infinity, it reveals that the traditional technology can no longer meet the customer’s low-carbon and environmental needs, and it is urgent to implement the green manufacturing technologies to update the product cases to meet the market demand and the government environmental regulations.

Discussion

In this paper, an effective similarity determination model which integrates the representation of fuzzy requirement information, the efficient indexing approach, and the evaluation method for the similarity and adaptability into a holistic similar cases retrieval methodology is proposed to support low-carbon product design. A case study concerning the similar cases retrieval for the vacuum pump low-carbon design is conducted to verify the applicability of the proposed methodology; and the theoretical and practical implications of our research work are discussed below.

In the earlier design stage, designers can basically specify the accurate values for the very familiar requirement attributes. However, when taking account of the low-carbon requirements, it requires the reconfiguration for corresponding quality characteristics, which possibly contributes to complex design conflicting problems as the coupling property between the quality characteristics, namely, one design factor about the low-carbon requirements is improved, while the related factor is deteriorated. Consequently, even if there are experienced domain experts for the design task, sometimes they cannot grasp deep knowledge and precisely address the complex low-carbon design problems with incomplete design information, but they can roughly know the possible range of design attribute values, and the tolerable differences between the query case attribute values and that of the candidate cases to ensure the designated parameter ranges are still satisfying the design purpose. Therefore, for the fuzzy requirement attributes, this study adopts the fuzzy set theory to represent the incomplete and uncertain design information, and it defines the feasible interval and desired interval to make flexible for the designers in decision-making to the design parameters.

For the improved D-HS method, it adopts the recommended attribute values to assign the indexing coordinates for the query case rather than uses the design range of each attribute to construct the index set mainly due to the following considerations: if it uses design ranges of attributes to search the similar cases, then it must identify all cases in the memory whether they are falling within the index set every time with a new requirement, and thus it loses the meaning of the intervals division. Conversely, with the recommended values, the coordinates of the query case can be designated, and cases having the same coordinates with that of the query case are extracted to build the index set; in addition, when the coordinates of the current query case are same as that of the previous one, similar cases in the index set saved in the database can be immediately reused.

In our study, we propose the strategy that the number of cases is approximately equal for each interval of attributes to make sure the cases can be evenly distributed. In case study, it predefine the interval number of the i-th attribute L_i, however, the determination of L_i is actually an iterative process to satisfy the criteria in equation (8). Besides, for two neighboring cluster cases, if the average values of the i-th attribute of the two clusters are quite different, then it is necessary to construct the attribute interval for two clusters separately, and re-designate the L_i, even though only the sum of the number of cases of two clusters can satisfy the condition of uniform distribution of the cases. The strategy of similarity matching for attributes proposed in the paper provides a supplementary index set S_b by defining the required number of matched attributes, namely the threshold Const, which effectively avoids the omission of the similar cases. For example, in the case study, cases c₂₅,c₂₄,c₂₃ are included in the initial index set S₀, the number of the matched attributes of these three cases are not high, they can be seen as the edge cases as illustrated in Figure 3(b); although they do not fall within the same intervals with the query case, they have the high similarity, and luckily, they are not neglected.

The effective estimation models of the similarity and adaptability of the cases are respectively established in our study. For the similarity calculation of precise numerical attributes, it adopts the traditional distance measurement to calculate the local similarity; for the fuzzy numerical attributes, it uses the dependent function to construct the local similarity evaluation model, as it can better distinguish the degree to which the attribute values of the candidate case meet the design requirements. That is, when the value of dependent function of the attribute is equal or more than one, it indicates the attribute value is completely meet the demand; when the value of dependent function is between zero and one, then the attribute value is acceptable; while when the value of dependent function is less than zero, the attribute value cannot be satisfied. For the nominal attributes, the traditional method is to use the binary estimation model, 0 and 1. For instance, the linguistic term of the attribute low-carbon and environmental impact (A₆) of the query case is required “good”, when the linguistic term of A₆ of the candidate case is “good”, then the similarity is equal to 1, otherwise is 0; however, when the linguistic term of A₆ of the candidate case is “very good”, “fair”, “poor”, or “very poor”, it cannot clearly distinguish the similarity of these values. In our study, the linguistic terms of the nominal attribute are transformed into the standardized fuzzy triangular numbers, and the similarity estimation model is constructed based on the ratio of area of the common range to the area of the design range.

The estimation model of the adaptability of the candidate case is proposed based on the information content presented in the Information Axiom. For one certain attribute of the case, the smaller the information content, the better the adaptability performance, because the system range of this attribute has a high degree of overlap with the design range, which indicates it has a high probability to meet the design requirements through modification; and the information content of the case is the sum of the information content of each attribute. To estimate the information content of each attribute, it is important to identify its system range that is based on the existing manufacturing technologies, design specification, related laws and regulations, etc. Thus it is hard to delineate a clear system range of the attribute with the incomplete design information. In this study, for the numerical attribute, the upper and lower boundary values that the attribute can reach under the current technical conditions of the series product cases are defined as the system range. For example, the rate of suction and exhaust of the SK series vacuum pump can be promoted by enhancing the rated power or increasing the number of impellers, and thus to obtain the maximum and minimum value of the rate of suction and exhaust to construct the system range. For the nominal attribute, the system range is defined by evaluating the corresponding sub-factors. For instance, to define the system range of the attribute A₆, it needs to evaluate its sub-factors, such as the life cycle carbon footprint of the product, assemble and disassemble for the components reuse, the material recycling and disposal methods; the linguistic term of the attribute is final obtained by scoring the each sub-factor, and then the system range of the attribute is specified by comparing series product cases.

Conclusion and further research

In low-carbon product design, it takes account of the low-carbon requirements at preliminary design stage in consideration of the reduction of the carbon emissions throughout the product life cycle. However, as the coupling properties between the quality characteristics, it inevitably causes the design conflicting problems when incorporating the low-carbon indicators, which needs to go through multiple backtracking design processes to coordinate the contradictions. Thus, it is a good strategy for exploiting the prior design knowledge and experience to solve the similar design problems. In our study, we propose an effective similarity determination method to obtain acceptable cases knowledge for reuse, similarity here is not only referred to the similarity between the attribute values of the query case and that of the candidate case, but the similarity of the system range of the candidate case and the design range of the query case.

The incomplete and uncertain requirements information is represented based on the fuzzy set theory. The efficient D-HS indexing approach is improved with two valid strategies, the strategy that keep approximately equal number of the cases for intervals division, and the strategy of similarity matching for attributes; the improved D-HS method can effectively guarantee the basically even distribution of cases in each attribute interval, and avoid the omission of the similar cases when the location of the query case is close to the boundaries of the design ranges. In terms of the similarity estimation, it adopts the classical distance measurement and dependent function to calculate the local similarity for the precise numerical attribute and the fuzzy numerical attribute; and for the fuzzy nominal attribute, the similarity is based on the ratio of area of the common range to the area of the design range. For the adaptability estimation, it adopts the information content presented in the Information Axiom to construct the evaluation model for the adaptability of each attribute, the smaller the total information content, the better the adaptability of the candidate case. Thus, the cases having high similarity and small information content can be considered as the acceptable cases for knowledge reuse and adaptation.

In our proposed methodology, the information content of each attribute depends on the corresponding design range and system range, in the further research, it is necessary to use the sensitivity analysis and Monte Carlo analysis methods to more reasonably determine the boundaries of the design range and system range of each attribute, as well as the distribution of its fuzzy membership function. In addition, it is essential to adopt the contradictory problems solving methods, such as the TRIZ, Extenics, to address the design conflicts during the adaptation for the similar cases, which facilitate the generation of the low-carbon innovative design schemes.

Footnotes

Handling Editor: James Baldwin

Declaration of conflicting interests

The author(s) declared no potential conflicts of interest with respect to the research, authorship, and/or publication of this article.

Funding

The author(s) disclosed receipt of the following financial support for the research, authorship, and/or publication of this article: this research work was supported by the National Natural Science Foundation of China (Grant No. 51875524 and No. 61873240) and the Zhejiang Province Postdoctoral Research Foundation (Grant No. zj2019130).

ORCID iDs

Shedong Ren

Fangzhi Gui

Yanwei Zhao

Min Zhan

Wanliang Wang

References

Huisingh

Zhang

Moore

, et al. Recent advances in carbon emissions reduction: policies, technologies, monitoring, assessment and modeling. J Clean Prod 2015; 103: 1–12.

Kellens

Dewulf

Overcash

, et al. Methodology for systematic analysis and improvement of manufacturing unit process life cycle inventory (UPLCI) CO2PE! initiative (cooperative effort on process emissions in manufacturing). Part 2: case studies. Int J Life Cycle Ass 2011; 17: 242–251.

Wang

Tang

Yin

, et al. Bi-objective optimization for low-carbon product family design. Robot Cim-Int Manuf 2016; 41: 53–65.

Kuo

TC.

The construction of a collaborative framework in support of low carbon product design. Robot Cim-Int Manuf 2013; 29: 174–183.

Zhang

, et al. Skeleton model-based product low carbon design optimization. J Clean Prod. Epub ahead of print 2020. DOI: 10.1016/j.jclepro.2020.121687.

Chen

Fortuny Santos

Lujan

, et al. Sustainable manufacturing: exploring antecedents and influence of Total Productive Maintenance and lean manufacturing. Adv Mech Eng 2019; 11: 1–16.

BSI. Guide to PAS 2050 how to assess the carbon footprint of goods and services. British Standards Institution, London, 2008.

Jeswiet

Kara

Carbon emissions and CES™ in manufacturing. CIRP Ann: Manuf Techn 2008; 57: 17–20.

Pan

Deng

ZQ.

Product carbon footprint for product life cycle under uncertainty. J Clean Prod 2018; 187: 459–472.

10.

Sakao

A QFD-centred design methodology for environmentally conscious product design. Int J Prod Res 2007; 45: 4143–4162.

11.

Peng

, et al. Innovative product design method for low-carbon footprint based on multi-layer carbon footprint information. J Clean Prod 2019; 228: 729–745.

12.

Wang

Teng

, et al. Low-carbon product multi-objective optimization design for meeting requirements of enterprise, user and government. J Clean Prod 2015; 103: 747–758.

13.

Zhang

Huang

Zhang

, et al. Low-carbon design of structural components by integrating material and structural optimization. Int J Adv Manuf Tech 2018; 95: 4547–4560.

14.

Gui

Ren

Zhao

, et al. Activity-based allocation and optimization for carbon footprint and cost in product lifecycle. J Clean Prod. Epub ahead of print 2019. DOI: 10.1016/j.jclepro.2019.117627.

15.

Song

Lee

KM.

Development of a low-carbon product design system based on embedded GHG emissions. Resour Conserv Recy 2010; 54: 547–556.

16.

Devanathan

Ramanujan

Bernstein

, et al. Integration of sustainability into early design through the function impact matrix. J Mech Design 2010; 132: 1–8.

17.

Chechurin

Borgianni

Understanding TRIZ through the review of top cited publications. Comput Ind 2016; 82: 119–134.

18.

Chou

JR.

An ARIZ-based life cycle engineering model for eco-design. J Clean Prod 2014; 66: 210–223.

19.

Yang

Cai

Extenics: theory, method and application. Beijing, China: Science Press, 2013.

20.

Chen

Zhao

, et al. Transforming bridge-based conflict resolution for product green design. J Mech Eng 2010; 46: 132–142 (in Chinese).

21.

Song

Mao

, et al. Using Extenics to describe coupled solutions in Axiomatic design. J Eng Design 2018; 30: 1–31.

22.

Popoff

Millet

Sustainable life cycle design using constraint satisfaction problems and quality function deployment. In: 24th CIRP conference on life cycle engineering, Kamakura, Japan, 8–10 March 2017, pp.75–80. Amsterdam, Netherlands: Elsevier Ltd.

23.

Chen

Wang

Huang

A negotiation methodology for multidisciplinary collaborative product design. Adv Eng Inform 2014; 28: 469–478.

24.

Dong

Wei

Liu

FR.

Novel collaborative optimization framework with a negotiation model for satellite system design. Eng Optimiz 2018; 50: 1395–1414.

25.

Ren

Gui

Zhao

, et al. Accelerating preliminary low-carbon design for products by integrating TRIZ and Extenics methods. Adv Mech Eng 2017; 9: 1–18.

26.

Aamodt

Plaza

Case-based reasoning - Foundational Issues, methodological variations, and system approaches. AI Commun 1994; 7: 39–59.

27.

Zhang

Wang

CK.

Integrating case-based with rule-based reasoning in body-in-white fixture design. Int J Adv Manuf Tech 2015; 85: 1807–1824.

28.

Patterson

Rooney

Galushka

. Efficient similarity determination and case construction techniques for case based reasoning. In: 6th European conference on case-based reasoning, Robert Gordon Univ, Aberdeen, Scotland, 4–7 September 2002, pp.292–305. Berlin: Springer-Verlag.

29.

Smyth

Keane

MT.

Adaptation-guided retrieval: questioning the similarity assumption in reasoning. Artif Intell 1998; 102: 249–293.

30.

Stéphane

Hector

Marc

LLJ

. Effective retrieval and new indexing method for case based reasoning: application in chemical process design. Eng Appl Artif Intel 2010; 23: 880–894.

31.

Cost

Salzberg

A weighted nearest neighbor algorithm for learning with symbolic features. Mach Learn 1993; 10: 57–78.

32.

Diab

El Hindi

Using differential evolution for improving distance measures of nominal values. Appl Soft Comput 2018; 64: 14–34.

33.

Hjaltason

Samet

Index-driven similarity search in metric spaces. ACM T Database Syst 2003; 28: 517–580.

34.

Zhao

Ren

Chen

, et al. Extension classifier construction based on improved BP neural network. Comput Integ Manuf Syst 2015; 21: 2807–2815 (in Chinese).

35.

Bueno

Traina

AJM

Traina

Genetic algorithms for approximate similarity queries. Data Knowl Eng 2007; 62: 459–482.

36.

Polat

Similarity-based attribute weighting methods via clustering algorithms in the classification of imbalanced medical datasets. Neural Comput Appl 2018; 30: 987–1013.

37.

Liu

Tan

Cao

, et al. Creative design through knowledge clustering and case-based reasoning. Eng Comput 2019; 36: 527–541.

38.

Leake

Kinley

David

Case-based similarity assessment: estimating adaptability from experience. In: Proceedings of the fourteenth national conference on artificial intelligence, Providence, Rhode Island, USA, 27–31 July 1997, pp.674–679. Menlo Park, California: AAAI Press.

39.

Rousu

Aarts

. Adaptation cost as a criterion for solution evaluation. In: 3rd European workshop on case-based reasoning, Lausanne, Switzerland, 14–16 November 1996, pp.354–361. Berlin: Springer-Verlag.

40.

Roldan Reyes

Negny

Cortes Robles

, et al. Improvement of online adaptation knowledge acquisition and reuse in case-based reasoning: application to process engineering design. Eng Appl Artif Intel 2015; 41: 1–16.

41.

Peng

YH.

Incorporating adaptability-related knowledge into support vector machine for case-based design adaptation. Eng Appl Artif Intel 2015; 37: 170–180.

42.

Giachetti

Young

Roggatz

, et al. A methodology for the reduction of imprecision in the engineering process. Eur J Oper Res 1997; 100: 277–292.

43.

Chandrasegaran

Ramani

Sriram

, et al. The evolution, challenges, and future of knowledge representation in product design systems. Comput-Aided Des 2013; 45: 204–228.

44.

Masui

Sakao

Kobayashi

, et al. Applying quality function deployment to environmentally conscious design. Int J Qual Reli Manag 2003; 20: 90–106.

45.

Zadeh

Fuzzy sets. Inf Control 1965; 8: 338–353.

46.

Zeng

Smith

NJ.

Application of a fuzzy based decision making methodology to construction project risk assessment. Int J Proj Manag 2007; 25: 589–600.

47.

Galushka

Patterson

Intelligent index selection for case-based reasoning. Knowl-Based Syst 2006; 19: 625–638.

48.

Fayyad

Irani

KB.

On the handling of continuous-valued attributes in decision tree generation. Mach Learn 1992; 8: 87–102.

49.

Brodu

Query Sphere Indexing for neighborhood requests. J Graphics Tools 2008; 13: 35–51.

50.

Florentin

Extenics in higher dimensions. Columbus, Ohio: Education Publisher, 2012.

51.

Kulak

Kahraman

Fuzzy multi-attribute selection among transportation companies using axiomatic design and analytic hierarchy process. Inf Sci 2005; 170: 191–210.

52.

Kahraman

Cebi

A new multi-attribute decision making method: hierarchical fuzzy axiomatic design. Expert Syst Appl 2009; 36: 4848–4861.

53.

Suh

NP.

Axiomatic design theory for systems. Res Eng Des 1998; 10: 189–209.

54.

Tan

, et al. Guide to vacuum equipment selection and procurement. Beijng, China: Chemical Industry Press, 2013 (in Chinese).