Full Analysis of Lung Cancer Mortality/Radon Relationship with Simple Nonlinear Concepts

Abstract

We analyze the relationship between the lung cancer mortality and the indoor radon intensity from the viewpoint of nonlinear mathematics. We conclude that their relationship is governed by the proportionality law where the cumulative lung cancer mortality Y is negatively proportional to the cumulative radon intensity X; or specifically, the nonlinear change of nonlinear face value (qYu – qY) is negatively proportional to the nonlinear change of nonlinear face value (X – Xb).

The author obtained a set of data from late Professor Cohen on the lung-cancer mortality rate versus indoor radon level collected from 1,597 counties and territory of the USA. We initially presented the data as various primitive elementary graphs; then extended them to the primary graphs, leading graphs, and the proportionality graphs. The article emphasizes the building of a straight-line proportionality relationship for the dose-response data in a log-linear and/or log-log graphs. It demonstrates a straightforward methodology for solving the key upper asymptotes (Yu) for the proportionality equation using the Microsoft Excel via determining the “coefficient of determination”. (Note: q = log, Yu = upper asymptote of Y, Xb = bottom asymptote of X)

Keywords

nonlinear concepts upper asymptote Yu Alpha Beta (αβ) Math Excel spreadsheet coefficient of determination regression equation proportionality (or rate) constant K position (or integral) constant C

(Symbols: θ = 10; q = log; αβ (extension of XY); ϕ = (0) (nonlinear zero); y = elementary numbers y or equation y; Y = cumulative numbers Y (i.e., cumulative of y).

Introduction

In the past, researchers in life and biomedical sciences do not have reliable nonlinear mathematical concepts for comprehensive understanding and in-depth analysis of the experimental data, resulting in inconsistent data presentations and miss-interpretations.^1,2 They do not even know what the linear numbers is and what the nonlinear numbers is. Typically, the dose-response and pharmacokinetics analysts over used and abused the first order kinetic equation (as exponential equation) and ignored the need to unlocking the nonlinear nature of the experimental data. They also made fundamental mistakes in omitting or disregarding the necessary “origin” or “starting zeros”, including the nonlinear zero. Most analyses wrongfully rely on primitive elementary line charts rather than on the reliable xy scatter charts as solid primary graphs. When comparing 2 variables, they tend to use statistical manipulation and curve fitting with polynomial equations to relate the 2 variables.^1,3,4 This article reveals that when comparing 2 variables mathematically, we need to compare both with the continuous cumulative numbers. That is, we can compare 2 variables graphically with primary graph using the cumulative numbers and mathematically with proportionality equations. The cumulative primary graphs, but not the primitive elementary graph, should be the foundation for all nonlinear data analyses.

We introduce a new extended XY math (named Alpha Beta (αβ) math) concept for graphical expression of the experimental data, along with the presentation of simple mathematical equations having meaningful equation parameters.^5

-10 The Alpha Beta (αβ) math is an extension of the XY math; it emphasizes the association between the XY continuous nonlinear numbers and its associated asymptotes; while the XY math is helpless in addressing the asymptotes relating to the nonlinear numbers. Their difference is XY = {(X), (Y)} and αβ = {α(Y, Yu, Yb), β(X, Xu, Xb)}, where Yu, Yb, Xu, and Xb are the upper and bottom asymptotes of Y and X variables.

Many researchers are new to the Alpha Beta (αβ) math and need to familiarize with several terminologies and phrases to read this article, thus, we will give the following 6 definitions (A to F) and explain the nonlinear concepts in the followings 5 subsections.

Definitions: (A). primitive elementary graphs are the plot of vertical elementary y versus various horizontal X either as column graph or as line chart. (B). Primary graphs are the plot of cumulative Y versus cumulative X. (C). Leading graphs are the graphs having a parabolic curve with continuous changing of the slope. (D). Proportionality graphs are the graphs with a straight line expressible as a proportionality equation. (E). Nonlinear face values are the measurement of variables relative to their asymptotes. (F). “Nonlinear change” or “change” in reading proportionality graphs, see Appendix B.

Subsections: (1). we need to understand what the continuous numbers is. The continuous numbers is (are) the non-terminating numbers with continuity. (2). there are 2 types of continuous numbers: linear and nonlinear continuous numbers. We define the linear continuous numbers as the numbers with equal spacing and having a linear zero that we can touch and cross over, such as…, -4, -2, 0, 2, 4, 6. 8, 10…Where the spacing between the numbers is 2 and they have a 0 that can be touched and crossed over between the negative numbers and the positive numbers. We define the nonlinear numbers as the numbers with non-equal spacing between numbers and are associated with 1 or 2 asymptotes, such as…, 0.3, 0.33, 0.333, 0.3333…, and…, 0.9, 0.99, 0.999, 0.9999….

(3). the traditional XY math is wrong to write 1/3 = 0.33333…or 1 = 0.9999…because 0.33333…is dynamic that is moving forever and 1/3 is static and is an asymptote of the nonlinear numbers. A static cannot equate to a dynamic, we cannot violate the Newton’s Law. The use of the equal sign “=” is one of the basic flaws of the traditional XY math. The asymptote 1/3 is never a part of the nonlinear numbers 0.33333…If we like, we can write 1/3 ∼ 0.33333…or 1/3 → 0.33333…to relate the asymptote and the nonlinear numbers, but not an equal sign “=.” Likewise, 0.9999…is dynamic that is moving forever and 1 is static, we cannot equate a dynamic to a static. For thousands of years people never bother to understand what is the relationship between a continuous nonlinear number and their asymptotes. Readers can read more examples in the reference.⁹ Here, let us use the nonlinear numbers Y with One Upper Asymptote Yu for illustrating some terminology and graphing used in this article.

Nonlinear numbers 0.9, 0.99, 0.999, 0.9999…is a 1-sided nonlinear number, it has an upper asymptote Yu, Yu = 1. We will compare the change of this nonlinear numbers with the change of universal linear numbers U_l (U_l = 1, 2, 3, 4, 5, 6, 7…) and check whether the number 1 is the unique asymptote of the nonlinear numbers. However, before the comparison, let us explain the universal linear numbers and the universal nonlinear numbers. The universal linear numbers U_l is trivial and is known to all human being. For the universal nonlinear numbers, we use the symbol U _n . The most important nonlinear numbers is…10^-3, 10^-2, 10 ^-1, 10⁰, 10¹, 10², 10³, 10⁴…. This U _n has a nonlinear zero as its bottom asymptote. No matter how large the negative of the power of 10, e.g. 10^-100, 10 ^-10000, or 10^-1000000, these numbers has continuity (Axiom I) and are approaching a nonlinear zero which can be approached but cannot be touched (Axiom II). Since the number 10 is extremely useful and will be used extensively, we introduce a symbol θ to represent 10, i.e. 10 = θ. Accordingly, the universal nonlinear numbers U _n =…10^-3, 10^-2, 10 ^-1, 10⁰, 10¹, 10², 10³, 10⁴…is also written as U _n =…θ^-3, θ^-2, θ^-1, θ⁰, θ¹, θ², θ³, θ⁴…or U _n = θ ^ U _l , or $U n = θ^{U_{l}}$ . This is to say that the universal nonlinear numbers are θ raise to the power of universal linear numbers. The universal nonlinear numbers have the characteristic continuity of the universal linear numbers.

In Table 1, let us input the universal linear numbers in Column A as X and the nonlinear numbers in Column B as Y, as shown in Microsoft Excel Screen. In the Screen, we reserve Cell E1 for imputing an active upper asymptote Yu. We need this asymptote for calculating the nonlinear face value (Yu – Y) in Column C. By plotting Column A versus Column B, we obtain Figure 1A for Y versus X in a linear by linear scale, where the data line is approaching an upper asymptote Yu. Figure 1A is a primary graph, because we are plotting cumulative X versus cumulative Y. In this example, we can visualize that the distance from asymptote Yu to Y, (Yu – Y), is negatively proportional to the linear distance of X, as shown by the solid arrow and dashed arrow. The larger the solid arrow the smaller the dashed arrow becomes, or vise visa. In other words, nonlinear change of nonlinear face value (Yu – Y) is negatively proportional to the linear change of linear numbers X.

Table 1.

Nonlinear numbers 0.9, 0.99, 0.999, 0.9999….

E1	fx
	A	B	C	D	E
1	X	Y	(Yu – Y)	Yu =
2	1	0.9		Yu =
3	2	0.99
4	3	0.999
5	4	0.9999
6	5	0.99999
7	6	0.999999
8	7	0.9999999
9	8	0.99999999

Figure 1.

(A). Primary graph, also. a Leading Graph. Y vs. X in linear scale. (B). Pre-proportionality graph. (Yu − Y) vs. X in linear scale. (C). Proportionality graph. q(Yu − Y) vs. X in log-linear scale. (C-1). Proportionality graph. q(Yu − Y) vs. X in log-linear scale. Yu = 1.0000001. (C-2). Proportionality graph. q(Yu − Y) vs. X in log-linear scale. Yu = 1.00001. (C-3). Proportionality graph. q(Yu − Y) vs. X in log-linear scale. Yu = 1.000000001.

Next, in Table 2, let us input “1” into Cell E1, followed by calculating (Yu – Y) in Column C. In Cell C2, we input “=$E$1 – B2”, then copy Cell C2 to Cell C3 through Cell C10, as shown in Table 2. By plotting Column A versus Column C for (Yu – Y) versus X, we obtain Figure 1B in linear by linear scale; this is a pre-proportionality graph; it is also a transitional graph, because we will convert the vertical axis into final nonlinear logarithmic scale in the next step.

Table 2.

Nonlinear numbers 0.9, 0.99, 0.999, 0.9999…(cont.)

C2	fx	=$E$1 – B2
	A	B	C	D	E
1	X	Y	(Yu – Y)	Yu =	1
2	1	0.9	0.1	Yu = 1
3	2	0.99	0.01
4	3	0.999	0.001
5	4	0.9999	1E-04
6	5	0.99999	1E-05
7	6	0.999999	1E-06
8	7	0.9999999	1E-07
9	8	0.99999999	1E-08

We copy Figure 1B into Figure 1C and converting the vertical axis into logarithmic scale. Then followed by right clicking on data series and selecting “Add Trendline”, and then selecting “Exponential” from Trendline Options, we also selecting “Display Trendline” and “Display R-squared”, then click Close. We obtain Figure 1C. The coefficient of determination is R² = 1, indicating that Yu = 1 is the perfect choose as the upper asymptote. How do we know this is the perfect asymptote? Let us try out with some other numbers.

Let us pick a number slightly larger than 1, say, 1.0000001, and input 1.0000001 into Cell E1, as shown in Table 3. Figure 1C will turn into Figure 1C-1, where the data line strays from the straight line and R² reduced to 0.9603. When we change the number into 1.00001 and input 1.00001 into Cell E1, we obtain Figure 1C-2 and R² reduced to 0.8244. We can improve R² by increasing the number of zero after 1 in the numerator, such as 1.000000001. In doing this, we get Figure 1C-3 and the R² improve to 0.9991; it is still less than 1. The overall trend is that all the numbers are eventually approaching 1 as an upper asymptote.

Table 3.

Nonlinear numbers 0.9, 0.99, 0.999, 0.9999…(cont.)

C2	fx	=$E$1 – B2
	A	B	C	D	E
1	X	Y	(Yu – Y)	Yu =	1.0000001
2	1	0.9	0.1000001	Yu =1.0000001
3	2	0.99	0.0100001
4	3	0.999	0.0010001
5	4	0.9999	0.0001001
6	5	0.99999	1.01E-05
7	6	0.999999	1.1E-06
8	7	0.9999999	2E-07
9	8	0.99999999	1.1E-07

In this example, we need 3 graphs for expressing the relationship between nonlinear numbers Y and the linear numbers X. Figure 1A is a primary graph for plotting nonlinear numbers Y vs. X in linear by linear scale. It is also a leading graph because its parabolic curve leads us to visualize the nonlinear face value (Yu – Y) (the solid double arrow distance) is negatively proportional to the linear distance of X. In this example, we measure the nonlinear face value in a graph of linear by linear scale. In the main section of this article, we measure the nonlinear face value of the parabolic curve in a log-linear graph, and the nonlinear face value of a higher order of nonlinearity is (qYu – qY), shown as double solid arrows in Figure 2B. The parabolic line in Figure 2A has nonlinear value Y, it is approaching its upper asymptot Yu. The parabolic line in Figure 2B has nonlinear value qY, it is approaching its upper asymptote qYu.

Figure 2.

(A). Primary graph, with Y versus X in linear by liner scale. (B). Leading graph, with qY versus X in log by liner scale.

In Figure 1C, the regression equation from Excel is the exponential equation y = e^-.2.303X or (Yu – Y) = y = e^-.2.303X. It is awkward to have an “e” in a log-linear graph. It should be a simple equation of (Yu – Y) = y = θ^-X or (Yu – Y) = y = 10^-X. Microsoft Excel program is unable to providing regression equation for the 10–based log-linear straight-line in a log-linear graph. Currently, it can only (awkwardly) provide the regression equation as exponential equation for a straight line in a log-linear graph. A straight-line in log-linear graph should involve a 10 or θ but not an “e”. The “e” is an irregular nonlinear number. The current exponential equation in a log-linear graph is one of the sources for generating the confusions. We urge, in the future, the Microsoft company can provide regression equation for the 10–based log-linear straight-line in a log-linear graph, as illustrated in this article. The Microsoft Company needs only to add a few lines of coding to come up with a regression equation for describing the straight line in a log-linear graph.

(4). There are more flaws in the XY math. Let us use the simplest forms of XY equation in Equation (1a) and Equation (1b) bellow to discuss some issues.

Y = X

Eq.\ 1a

Y = \frac{1}{X}

Eq.\ 1b

The first issue here is that we use the same symbol of Y and X in 2 equations, yet the meanings of 2 equations are dramatically different. In the Eq. (1a), the Y and X are linear numbers; yet, in Eq. (1b), the Y and X are nonlinear numbers. It is wrong to use the same symbols for representing both the linear and the nonlinear numbers. In Eq. (1a), the Y is proportional to X; their relationship is simple and straightforward, as shown in Figure 3(A). However, in the second equation, Eq. (1b), we have some issues. When we plot Eq. (1b) in a rectilinear graph, we obtain a curved line, as shown in Figure 3(B). A curved line means either one of Y or X or both Y and X are nonlinear numbers. This does not mean there is no proportionality relationship between Y and X; in fact, when we convert both the axes in Figure 3(B) from linear into nonlinear logarithmic scale, we obtain a straight-line in Figure 3(C) indicating the nonlinear numbers Y is proportional to the nonlinear numbers X. Here, the α means we measure the nonlinear numbers Y relative to its bottom asymptote Yb; and the β means we measure the nonlinear numbers X relative to its bottom asymptote Xb.

Figure 3.

(A). Graph of Y = X. in linear-linear scale. (B). Graph of Y = 1/X. in linear-linear scale. (C). Graph of qα = 1/ qβ. in log-log scale. q(Y − Yb) = − q(X − Xb), Yb = 0, Xb = 0. (B-1). Graph of Y = 1/X. in linear-linear scale. (C-1). Graph of qα = 1/qβ. in log-log scale.

Here is the second issue. In Equation (1b), when X is 0, what will be the Y? Teachers in traditional math class will teach students that at X = 0, the Y is undefined. A sound math should have everything defined. It is irresponsible to say something is undefined. Then, what shall we do? We need a new nonlinear math concept, the Alpha-Beta (αβ) Math concept, to give the right answer and right expression, such as using Figure 3(C).

The third issue is that when Equation (1a) and Equation (1b) are representing 2 different phenomena, we need additional symbols to address the true nature of the equations, such as extending the XY symbols to (αβ) symbols for representing nonlinear numbers, as shown in Figure 3(C). As shown in Figure 3(B), the nonlinear numbers Y has the bottom asymptote Yb equivalent to x-axis and the nonlinear numbers X has the bottom asymptote Xb equivalent to y-axis. Their nonlinear face values are α = (Y – Yb) and β = (X – Xb), and their true values are qα = q(Y – Yb) and qβ = q(X – Xb). Consequently, we can plot the face values on the nonlinear logarithmic scale to give a log-log graph, as shown in Figure 3(C), where we have a plot of qα vs. qβ, i.e. we have q(Y – Yb) = -q(X – Xb) or qY = -qX with Yb = Φ, and Xb = Φ. Figure 3(C) is good for positive values of Y and X. When either Y or X or both assume negative numbers in Eq. (1b), we will get curved lines in second, third, and fourth quadrants of Figure 3(B); as shown in Figure 3(B-1). In traditional math, we cannot plot theses curves into log-log graph like Figure 3(C) because the negative numbers cannot plot on logarithmic scale. Then, what shall we do? Fortunately, the αβ math can come to rescue.

When X assumes both positive and negative values, one curve exists in the first quadrant and the other in the third quadrant in a Cartesian graph, Figure 3(B-1). In the first quadrant, the nonlinear numbers Y has the bottom asymptote Yb equivalent to x-axis; however, the x-axis becomes the upper asymptote of Y in the third quadrant. Meanwhile, in the first quadrant, the nonlinear numbers X has the bottom asymptote Xb equivalent to y-axis; however, the y-axis becomes the upper asymptote of X in the third quadrant. Because they share the common asymptote, we call them the pivot asymptotes, and representing them as Y _p = 0 and X _p = 0. In the first quadrant, the nonlinear face value of the nonlinear numbers Y is negatively proportional to the nonlinear face value of the nonlinear numbers X. Their differential equation is Eq. (1c), where K is the proportionality constant. In the third quadrant, the nonlinear face value of the nonlinear numbers Y is proportional to the nonlinear face value of the nonlinear numbers X, their differential equation is Eq. (1d). The notation “d” stands for differential or change. In the nonlinear change of nonlinear face value, we simply take the logarithmic transformation of the face value followed by taking the differential “d”.

d (q ({Y − Y}_{p})) = -Kd (q ({X − X}_{p}))

Eq.\ 1c

d (q (Y_{p} - Y)) = -Kd (q (X_{p} - X))

Eq.\ 1d

In the first quadrant, Y _p = 0 and X _p = 0 are the bottom asymptotes of Y and X. In the third quadrant, Y _p = 0 and X _p = 0 are the upper asymptotes of Y and X. In any case, measurements of difference relative to the asymptotes are the upper values minus the lower values, e.g. (Y – Y _p ) for the first quadrant and (Y _p – Y) for the third quadrant. In the third quadrant, the negative X value makes (0 – X) a positive (e.g., 0 – (-3) = 3) and the negative Y value makes (0 – Y) a positive. When plotting the nonlinear numbers Y versus nonlinear numbers X [i.e., (Y – Y _p ) vs. (X – X _p )] for the first quadrant, and also plotting the nonlinear Y versus nonlinear X [i.e., (Y _p – Y) vs. (X _p – X)] for the third quadrant on a log-log graph, we obtain a straight line with slope K = -1, as shown in Figure 3(C-1).Table 4 gives the list of X, Y, (X _p – X), and 1/(X _p – X) for 4 quadrants. Formula bar gives Cell C11 as “=0 – A11”, i.e. 0 – (-0.20) = 0.20

Table 4.

List of X, Y, (Xp – X), and 1/ (Xp - X)

C11	fx	= 0 – A11
	A	B	C	D	E	F	G	H
1	X	Y	Xp – X	1/(Xp – X)	X	Y	Xp – X	1/(Xp – X)
2	First quadrant				Third quadrant
3	0.17	5.88			−0.15	−6.67	0.15	6.67
4	0.25	4.00			−0.60	−1.67	0.60	1.67
5	0.50	2.00			−0.80	−1.25	0.80	1.25
6	1.20	0.83			−1.50	−0.67	1.50	0.67
7	2.00	0.50			−2.50	−0.40	2.50	0.40
8	3.00	0.33			−3.80	−0.26	3.80	0.26
9	4.60	0.22			−5.30	−0.19	5.30	0.19
10	Second quadrant				Fourth quadrant
11	−0.20	5.00	0.20	5.00	0.19	−5.26	0.19	5.26
12	−0.33	3.00	0.33	3.00	0.30	−3.33	0.30	3.33
13	−0.50	2.00	0.50	2.00	0.60	−1.67	0.60	1.67
14	−1.00	1.00	1.00	1.00	1.00	−1.00	1.00	1.00
15	−2.00	0.50	2.00	0.50	1.80	−0.56	1.80	0.56
16	−3.00	0.33	3.00	0.33	3.40	−0.29	3.40	0.29
17	−4.17	0.24	4.17	0.24	4.80	−0.21	4.80	0.21

It is time people should learn the right science. The above examples of arithmetic and algebra should have taught in high schools. It is essential that all the high school students should learn the nonlinear concept at young age, yet, for thousands of years people have been following the wrong teaching generations after generations.

(5) In the next section, let us use a simulated data for illustration on how to obtain the upper asymptote using the template and on how to obtain equation parameters in the αβ Math. Other than using the template, people can also use a trial and error method for solving the upper asymptote and the parameters.^5,8

In the αβ Math, the change of linear numbers is simply taking the differential of the linear numbers, such as dY for change of linear numbers Y and dX for change of linear numbers X. For the nonlinear numbers, we first determine the face value followed by taking the logarithmic transformation of the face value, and then take the differential of the transformed face value. The face values of the nonlinear numbers are the difference of nonlinear numbers measured relative to their asymptotes, such as (Yu – Y), (Y – Yb), (X – Xb), and (qYu – qY). Then, their logarithmic transformations are q(Yu – Y), q(Y – Yb), q(X – Xb), and q(qYu – qY). And their change or the differential are d(q(Yu – Y)), d(q(Y – Yb)), d(q(X – Xb)), and d(q(qYu – qY)). In addition, by introducing the proportionality constant K, we have the differential and integral equations as follows.

d (q (Yu−Y)) = -KdX

Eq.\ 2a

d (q (qYu−qY)) = -KdX

Eq.\ 2b

d (q (qYu−qY)) = -Kd (q (X−Xb)) Xb = ϕ = (0)

Eq.\ 2c

q (Yu−Y) = -KX + qC or (Yu−Y) {= θ}^{(-KX+qC)} {= Cθ}^{(-KX)}

Eq.\ 2a-1

Where C is an integral constant or position constant (for dictating the position of a straight-line moving up/down in a graph). We read the Equation (2a) as: the nonlinear change of the nonlinear numbers Y is proportional to the linear change of linear numbers X. We read the Equation (2b) as: the nonlinear change of the nonlinear numbers Y in second order of nonlinearity is proportional to the linear change of the linear numbers X. We read the Equation (2c) as: the nonlinear change of the nonlinear numbers Y in second order of nonlinearity is proportional to the nonlinear change of nonlinear numbers X.

Table 5 (Worksheet 0) gives the basic data of the example. Column A gives the elementary (x); Column B is the cumulative X for succession of (x); Column C gives the elementary (y); Column D gives the cumulative Y for succession of X, e.g. D4 = D3 + C4, D5 = D4 + C5 etc. When plotting Column B vs. Column C for X versus y, we obtain a primitive elementary graph, showing a skewed-bell curve in Figure 4(A). By plotting Column B versus Column D for X versus cumulative Y, we obtain a parabolic curve as shown in Figure 4b. This is a primary graph with plotting cumulative X versus cumulative Y. It is also a leading graph because it leads us to generate physical equations based on its continuous change of the slope of the parabolic curve and its relationship with its upper asymptote. In the Alpha Beta (αβ) Math, the nonlinear numbers Y are associated with their asymptotes Yu and Yb. In most of the cases, the bottom asymptotes Yb are nonlinear zeros. Thus, we need only to learn how to solve for the upper asymptote Yu during the search for theoretical equation and parameters. In Figure 4b, the Y is nonlinear numbers that increases from the origin toward an asymptote. However, where is the upper asymptote? How do we determine the upper asymptote?

Table 5.

Resolving Optimal Yu (Worksheet 0)

B3	fx	= 0
	A	B	C	D	E	F
1	(x)	X	(y)	Y	y = (Yu – Y)	qy
2
3	0	0	0	0
4	4	4	7.37	7.37
5	4	8	4.66	12.03
6	4	12	2.94	14.97
7	4	16	1.86	16.82
8	4	20	1.17	17.99
9	4	24	0.74	18.73
10	4	28	0.47	19.20
11
12	ΔYu =				Yu =
13					R ^ 2
14	Yu0 =
15	Yu1 =
16	Yu2 =
17	Yu3 =
18	Yu4 =
19	Yu5 =
20	Yu6 =
21	Yu7 =

Figure 4.

(A) Illustration A. (a). Primitive elementary graph. linear by linear scale. y vs. X. (b). Primary graph. cum. Y vs. cum. X. linear by linear scale. (c). Optimal Unique Yu. Yu vs. R^2. (d). Transitional graph. (Yu − Y) vs. X. linear by linear scale. (e). Proportionality graph q(Yu − Y) vs. X linear by linear scale.

The following gives an illustration on how to resolve for the unique/optimal Yu using the Microsoft Excel. First, we build a template and then systematically resolve for the Yu through solving optimal coefficient of determination R ^ 2 (R²). The sequence of determining the optimal upper asymptote Yu are: first, select 7 to 10 estimated upper asymptotes, Yu1, Yu2, and Y3…Yu7 (see Figure 4(A) Illustration A). Second, calculate the R ^ 2 for each estimated upper asymptote, as shown in worksheet 1 to worksheet 5; and third, plot the estimated Yu vs. R ^ 2 and visually identify the optimal Yu from the graph. In Figure 4(b), we show the upper asymptote Yu as dashed horizontal line. The graph shows that the distance of vertical solid double arrow is negatively proportional to the distance of horizontal dashed double arrow; the larger the solid double arrow the smaller the horizontal dashed double arrow becomes, or vise visa. In equation form, it is “the nonlinear change of nonlinear face-value (Yu – Y) is negatively proportional to the linear change of linear face-value X,” or “the change of nonlinear true value q(Yu – Y) is negatively proportional to the change of linear true value X,” as shown in Eq. (2a). Its integral form is Eq. (2a-1).

According to Eq. (2a-1), we can plot (Yu – Y) vs. X on a log-linear (semi-log) graph for the values of q(Yu – Y) vs. X to obtain a straight line when the true upper asymptote Yu is applied in the calculation. (Note: we plot nonlinear face-value (Yu – Y) on vertical logarithm scale to give true value q(Yu – Y)). If the given Yu value strays from the true Yu value, we will get a curved line. To find the straight line, we need first to calculate y = (Yu – Y) and qy = q(Yu – Y), and plotting the equation y versus X on a log-linear (semi-log) graph).

Next, let us generate a few (7 to 10) incremental estimated Yu as Yu1, Yu2, Yu3…Yu7, with initial Yu (Yu0) picked from the last (largest) Y number in Column D (i.e., Cell D10), Yu0 = 19.20. We assign an active incremental Yu value , ΔYu, in Cell B12 as about 1% of Yu0, i.e. 0.19 (ΔYu = 0.19), as shown in Worksheet 1 (Table 6). In this way, we can generate a series of estimated Yu, from Yu1 to Yu7 in Column B (B15: B21). Formula for Yu1 in Cell B15 is “=B14 + $B$12” as shown in formula bar. We copy Cell B15 to Cell B16 through Cell B21 to complete the column. By changing ΔYu, we can obtain a wide range of estimated Yu for Yu1 to Yu7.

Table 6.

Resolving Optimal Yu (Worksheet 1)

B15	fx	= B14 + $B$12
	A	B	C	D	E	F
1	(x)	X	(y)	Y	y = (Yu – Y)	qy
2
3	0	0	0	0
4	4	4	7.37	7.37
5	4	8	4.66	12.03
6	4	12	2.94	14.97
7	4	16	1.86	16.82
8	4	20	1.17	17.99
9	4	24	0.74	18.73
10	4	28	0.47	19.20
11
12	ΔYu =	0.19			Yu =
13					R ^ 2
14	Yu0 =	19.20
15	Yu1 =	19.39
16	Yu2 =	19.58
17	Yu3 =	19.77
18	Yu4 =	19.96
19	Yu5 =	20.15
20	Yu6 =	20.34
21	Yu7 =	20.53

Next, we need to calculate Column E, Column F, and coefficient of determination R² for a given estimated Yu starting from Yu1. We assign this first Yu1 value (in Cell B15) to Cell F12, as shown in Worksheet 2, and call this estimated Yu1 as key estimated active Yu (inside dashed Cell). We will use this key estimated active Yu for calculating y = (Yu – Y) in Column E, qy = q(Yu – Y) in Column F, and for calculating the Coefficient of determination R ^ 2 in Cell F13. After the calculation, we will sequentially change the key estimated active Yu from Yu1 to Yu2, Yu3…Yu7, etc. Table 7 (Worksheet 2) shows the calculation of Column E, the formulas bar shows the calculation of Cell E3, “=$F$12 − D3”. We copy Cell E3 to Cell E4 through Cell E10 to complete the column.

Table 7.

(Worksheet2)

E3	fx	= $F$12 – D3
	A	B	C	D	E	F
1	(x)	X	(y)	Y	y = (Yu – Y)	qy
2
3	0	0	0	0	19.39
4	4	4	7.37	7.37	12.02
5	4	8	4.66	12.03	7.36
6	4	12	2.94	14.97	4.42
7	4	16	1.86	16.82	2.57
8	4	20	1.17	17.99	1.40
9	4	24	0.74	18.73	0.66
10	4	28	0.47	19.20	0.19
11
12	ΔYu =	0.19			Yu =	19.39
13					R ^ 2
14	Yu0 =	19.20
15	Yu1 =	19.39
16	Yu2 =	19.58
17	Yu3 =	19.77
18	Yu4 =	19.96
19	Yu5 =	20.15
20	Yu6 =	20.34
21	Yu7 =	20.53

The next step is to calculate the Column F for q(Yu – Y). This is done by taking the log of Column E, e.g. the formulas bar shows the calculation of Cell F3 as “=LOG (E3)”, as shown in Table 8 (Worksheet 3). We copy Cell F3 to Cell F4 through Cell F10 to complete the column, as shown in Worksheet 3. Next, we need to use the same key estimated active Yu in Cell F12 to calculate the Coefficient of determination R², as shown in Worksheet 4 (Table 9). Cell F13 gives the Coefficient of Determination. There are 2 ways to get Coefficient of Determination. We can use either “= (CORREL (array1, array2)) ^ 2” as shown in the formula bar, or use “=RSQ (known_y’s, known_x’s)”. The calculated R² can have any number of decimals, as is shown in Cell F13, where we have 0.9719 with 4 decimal places.

Table 8.

(Worksheet 3)

F3	fx	= LOG(E3)
	A	B	C	D	E	F
1	(x)	X	(y)	Y	y = (Yu – Y)	qy
2
3	0	0	0	0	19.39	1.288
4	4	4	7.37	7.37	12.02	1.080
5	4	8	4.66	12.03	7.36	0.867
6	4	12	2.94	14.97	4.42	0.646
7	4	16	1.86	16.82	2.57	0.409
8	4	20	1.17	17.99	1.40	0.145
9	4	24	0.74	18.73	0.66	−0.183
10	4	28	0.47	19.20	0.19	−0.723
11
12	ΔYu =	0.19			Yu =	19.39
13					R ^ 2
14	Yu0 =	19.20
15	Yu1 =	19.39
16	Yu2 =	19.58
17	Yu3 =	19.77
18	Yu4 =	19.96
19	Yu5 =	20.15
20	Yu6 =	20.34
21	Yu7 =	20.53

Table 9.

(Worksheet 4)

F13	fx	= CORREL (B3: B10, F3: F10) ^ 2
	A	B	C	D	E	F
1	(x)	X	(y)	Y	y = (Yu – Y)	qy
2
3	0	0	0	0	19.39	1.288
4	4	4	7.37	7.37	12.02	1.080
5	4	8	4.66	12.03	7.36	0.867
6	4	12	2.94	14.97	4.42	0.646
7	4	16	1.86	16.82	2.57	0.409
8	4	20	1.17	17.99	1.40	0.145
9	4	24	0.74	18.73	0.66	−0.183
10	4	28	0.47	19.20	0.19	−0.723
11
12	ΔYu =	0.19			Yu =	19.39
13					R ^ 2	0.9719
14	Yu0 =	19.20
15	Yu1 =	19.39			0.9719
16	Yu2 =	19.58			0.9922
17	Yu3 =	19.77			0.9984
18	Yu4 =	19.96			1.0000
19	Yu5 =	20.15			0.9996
20	Yu6 =	20.34			0.9982
21	Yu7 =	20.53			0.9964

By using 19.39 (Yu1) as key estimated active Yu in Cell F12, we obtain R ^ 2 as 0.9719 in Cell F13. We record this number in Cell E15 (parallel to the Yu1 line) for the case of Yu1. By changing the key estimated active Yu in Cell F12 to 19.58 (Yu2), the Cell F13 changes to 0.9922. We record this number in Cell E16. We sequentially change the Cell F12 values from Yu2, Yu3, through Yu7 in Column B (B15: B21) and record the resulting R ^ 2 in Cell F13 into Column E (E15: E21), as shown in Table 10 (Worksheet 5). For special case, when changing the key estimated active Yu in Cell F12 to 19.96 (Yu4), the Cell F13 changes to 1.0000. We record this number in Cell E18, as shown in Table 10 (Worksheet 5). By changing the key estimated active Yu in Cell F12 to 20.60 (Yu7), the Cell F13 changes to 0.9964. We record this number in Cell E21. By plotting Column B for Yu1 to Yu7 (B15: B21) versus Column E for corresponding R² in Column E (E15: E21), we obtain Figure 4c, where an arrow is pointing to the optimal upper asymptote. The upper asymptote Yu, Yu = 19.96 with maximum R² at 1 is the final answer.

Table 10.

(Worksheet 5)

F13	fx	= CORREL (B3: B10, F3: F10) ^ 2
	A	B	C	D	E	F
1	(x)	X	(y)	Y	y = (Yu – Y)	qy
2
3	0	0	0	0	19.40	1.3002
4	4	4	7.37	7.37	12.03	1.0999
5	4	8	4.66	12.03	7.37	0.8993
6	4	12	2.94	14.97	4.43	0.6982
7	4	16	1.86	16.82	2.58	0.4964
8	4	20	1.17	17.99	1.41	0.2934
9	4	24	0.74	18.73	0.67	0.0884
10	4	28	0.47	19.20	0.20	−0.1197
11
12	ΔYu =	0.19			Yu =	19.96
13					R ^ 2	1.0000
14	Yu0 =	19.20
15	Yu1 =	19.39			0.9719
16	Yu2 =	19.58			0.9922
17	Yu3 =	19.77			0.9984
18	Yu4 =	19.96			1.0000
19	Yu5 =	20.15			0.9996
20	Yu6 =	20.34			0.9982
21	Yu7 =	20.53			0.9964

The last thing to do is to express the proportionality equation and graph using the optimal upper asymptote. By assigning the optimal upper asymptote Yu (Yu = 19.96) to Cell F12, we have all data ready for graphing. We first plot Column B (B3: B10) for X versus Column E (E3: E10) for (Yu – Y) in a linear by linear scale, as shown in transitional graph Figure 4d. By converting the vertical axis from linear into nonlinear logarithmic scale in this graph, we obtain Figure 4e without trendline equation and without coefficient of determination. This log-linear (semi-log) graph is the proportionality graph. The transitional graph is a plot of (Yu – Y) vs. X in a linear by linear scale. The proportionality graph is a plot of (Yu – Y) vs. X in a log by linear scale where the true value comparison is q(Yu – Y) vs. X.

To obtain trendline equation and the coefficient of determination in the proportionality graph, we right clicking on data series (in Figure 4e). Then → adding trendline → then, in Trendline Options, select “exponential” → select “Display Equation on chart” → select “Display R-squared value on chart” → close, and we obtain Figure 4e with regression equation as y = 20e^-0.115x, and with coefficient of determination R² = 1. When we select “exponential” in the Trendline Options, Excel gives us semi-log graph and provides trendline equation as exponential equation. This is awkward because Excel is not capable of providing a 10-based equation. The remedy is to convert the e to θ (θ = 10) and convert -0.115 to -0.05 using conversion factor of 2.303 (0.115/2.303 = 0.05), as shown in Figure 4e, where y = 20θ^-0.05x. The equation y = 20θ^-kx means (Yu – Y) = Cθ^-kx . By taking log on both sides of the equation, we get q(Yu – Y) = -KX + qC, its differential equation is d(q(Yu – Y)) = -KdX, meaning the change of nonlinear true value q(Yu – Y) is negatively proportional to the change of linear true value X.

The followings are the summary of the corresponding steps:

Use the last experimental data point as a reference upper asymptote Yu, e.g. Yu0,

Assign an active incremental Yu value , ΔYu, e.g. ΔYu = 0.19 (about 1% of Yu0), such that we can generate 7 to 10 estimated upper asymptotes to cover a range of Yu, e.g. we generate estimated upper asymptotes Yu1, Yu2, Yu3,…. Yu7 in Column B (B14: B21).

Assign an estimated upper asymptote (e.g., Yu1) to a special Cell (i.e., Cell F12) for calculating the face value y = (Yu – Y) in Column E and true nonlinear values qy = log (Yu – Y) in Column F.

Assign a special Cell (e.g., Cell F13) for calculating coefficient of determination R ^ 2

Calculate R ^ 2 in Cell F13 using the formula “=CORREL (B3: B10, F3: F10) ^ 2”

Copy R ^ 2 values from Cell F13 to Cell E15 (parallel to Yu1 value)

Go on to next estimated Yu (e.g., Yu2) and repeat the last 4 steps (step 3 to step 6),

Using the 7 estimated upper asymptotes along with its coefficient of determinations to plot estimated asymptotes versus the coefficient of determinations, (B14: B21) vs. (E14: E21), to obtain the optimal asymptote, as shown in Figure 4c.

The above example with Figure 4a to Figure 4e and its associated equation is similar to that of an example in arsenic toxicokinetic analysis presented in reference.⁶

The Full Analysis of Lung Cancer Mortality/Radon Relationship

In the followings, we provide analysis of the relationship between the lung-cancer mortality rates vs. indoor radon levels. First, we present the primitive elementary graph for elementary y (mortality) versus X (radon intensity) for linearly grouped and nonlinearly grouped data. Second, we emphasize the presentation of cumulative data Y (cumulative of y for succession of X) using the primary graph. Third, based on the primary graph, we navigate to come up with the reasoning for establishing proportionality relationship between the cumulative Y and the cumulative X according to the physical law. Fourth, we demonstrate the use of template to solve for the upper asymptote Yu of the proportionality equation. As a rule of thumb, we use a “10 by 1%” guideline to solve the optimal upper asymptote in the nonlinear proportionality equations, as shown in the previous section.

A. Data Presentation with Primitive Elementary Graphs

This author obtained a set of data from Professor Cohen on the lung-cancer mortality rate versus indoor radon level collected from 1,597 counties and territory of the U.S.A.¹¹ Because of large data number involved, we conveniently chose to use the data grouping for analysis. First, we use the grouped data to illustrate the noticeable difference between a linearly grouped and a nonlinearly grouped elementary y data when plotting the data in primitive elementary graph. Either way, we should not use these elementary y data for mathematical analysis. Instead, we need to compare a cumulative data with another cumulative data, but not an elementary number with a cumulative number (we need to compare oranges with oranges and apples with apples, but not apples with oranges). Notice the X data is always a monotonic cumulative data. Second, we emphasize the need to comparing the cumulative Y data with the cumulative X data by plotting the primary graph where we plot a grouped cumulative Y data versus cumulative X data followed by mathematical analysis, as will be shown in the next section.

A portion of Cohen’s original data, in ascending order of the mean radon levels r/r₀ (r₀ = 37 Bq m^-3, = 1.0pCi L^-1), is given in Table 11. The first Column (Column A) is the county code, there is 1597 (county and territory) sampling points for the U.S.A. The second Column (Column B) is the in-door radon level X in pCi/L. It is in ascending order (monotonic increasing). The third Column (Column C) is the mortality rate (y) for the county. The fourth Column (Column D) is cumulative Y calculated from cumulative of y in column C, where the Cell D4 is D4 = D3 + C4, D5 = D4 + C5, etc.; we copy the cell toward the end to complete the column. There is 1597 data points in Excel. (Attention: we always reserve a raw of cells above the data set as blank Cells for need in calculation, such as we need Cell D2 as blank cell).

Table 11.

Portion of Data for Lung Cancer Mortality Rate vs. Mean Radon Levels

D4	fx	= D3 + C4
	A	B	C	D
1	County ID No.	X	(y)	Cum Y
2
3	1237	0.261	1.299	1.299
4	547	0.272	1.585	2.884
5	819	0.381	1.323	4.207
6	805	0.392	1.206	5.413
7	560	0.396	0.648	6.061
8	1481	0.401	1.083	7.144
9	872	0.411	1.284	8.429
10	1472	0.417	1.281	9.710
11	180	0.432	0.884	10.593
12	809	0.434	1.183	11.776
13	1471	0.434	1.084	12.860
14	955	0.440	1.084	13.944
15	545	0.460	1.534	15.478
16	542	0.460	2.014	17.492
17	1144	0.461	1.136	18.628
18	501	0.465	1.429	20.057
19	939	0.470	1.054	21.111
20	1333	0.471	1.216	22.327
21	1343	0.474	1.235	23.561
22	546	0.481	1.466	25.028
23	181	0.482	0.893	25.920
24	1476	0.483	1.201	27.121
25	532	0.487	1.401	28.522
26	152	0.490	1.189	29.712
27	918	0.490	0.827	30.539
28	32	0.490	0.959	31.498
29	132	0.490	1.016	32.514
30	871	0.492	1.516	34.030

There are 3 sub tables for Table 11, Tables 12, 13, and 14. Table 12 and 13 give linearly grouped data having the X increases at an increment of 0.5 and 0.25. Table 13 has the starting X at X = 0.25, where we do not have any mortality up to this level (i.e., y = 0). Table 14 gives a nonlinearly grouped data having the X increases at an increment of 1.4X, starting at X = 0.240 (use this 0.240 and a factor of 1.4X will give us 10 sampling points to cover up to X = 0.942). All the sub tables also have a column with the number of counties for the given X. Table 15 gives the linearly grouped mean radon levels and corresponding mortality rates for the range (category). We conveniently select 12 linear groups, starting from X = 1.0 and increases the radon level in a linear increment of 0.5. The total mortality for radon level between category 0 and 1.0 is the sum of mortality within this range; it is listed as end point X = 1.0 (0 to 1.0) and y = 445.513. The total mortality for radon level between category 1.0 and 1.5 is the sum of mortality within this range; it is listed as end point X = 1.5 and y = 380.228, etc. Their cumulative Y is the sum of y for each corresponding increase of X in sequence, as shown in Column 5 and 10 of Table 15. Table 15 is the same as Table 12, except the Table 15 has a column of Category, and the Table 12 has a raw of 0.

Table 12.

Linearly grouped 13 data points; X at an increment of 0.5

X	# of county	y	Cum Y
0	0	0	0
1	408	445.513	445.513
1.5	386	380.228	825.741
2	265	260.145	1085.886
2.5	205	187.461	1273.347
3	136	119.133	1392.480
3.5	88	75.128	1467.608
4	47	41.059	1508.667
4.5	27	24.857	1533.524
5	19	15.921	1549.445
5.5	9	6.219	1555.664
6	3	2.898	1558.562
6.5	4	3.346	1561.908
(SUM)	1597	1561.908

Table 13.

Linearly grouped 26 data points; X at an increment of 0.25

X	# of county	y	Cum Y
0.25	0	0.000	0.000
0.50	32	39.011	39.011
0.75	161	177.757	216.768
1.00	215	228.745	445.513
1.25	194	194.720	640.233
1.50	191	185.508	825.741
1.75	146	142.684	968.425
2.00	118	117.461	1085.886
2.25	100	93.824	1179.710
2.50	104	93.638	1273.347
2.75	75	65.647	1338.995
3.00	61	53.485	1392.480
3.25	52	42.357	1434.836
3.50	38	32.772	1467.608
3.75	21	19.373	1486.981
4.00	26	21.687	1508.667
4.25	16	14.234	1522.901
4.50	11	10.624	1533.525
4.75	15	11.913	1545.438
5.00	5	4.008	1549.445
5.25	3	2.350	1551.796
5.50	6	3.869	1555.664
5.75	2	1.356	1557.020
6.00	1	1.543	1558.563
6.25	2	2.151	1560.713
6.50	2	1.195	1561.908
(SUM)	1597	1561.908

Table 14.

Nonlinearly grouped 10 data points; X at an increment of 1.4X

X	# of county	y	Cum Y
0.240
0.336	2	2.884	2.884
0.470	14	18.227	21.111
0.659	109	121.618	142.729
0.922	210	227.617	370.347
1.291	307	311.803	682.150
1.807	334	322.696	1004.846
2.530	303	282.214	1287.060
3.542	214	184.899	1471.958
4.959	69	77.487	1549.445
6.942	35	12.463	1561.908
(SUM)	1597	1561.908

Table 15.

Linearly grouped data (X increases linearly at an increment of 0.5) Mean radon levels vs. lung cancer mortality rates, (X vs. y or Y)

Category	# ofcounties	X	y	Y = Cum of y	Category	# ofcounties	X	y	Y = Cum of y
0 - 1.0	408	1.0	445.513	445.513	3.5 - 4.0	47	4.0	41.059	1508.67
1.0 - 1.5	386	1.5	380.228	825.741	4.0 - 4.5	27	4.5	24.857	1533.52
1.5 - 2.0	265	2.0	260.145	1085.89	4.5 - 5.0	19	5.0	15.921	1549.45
2.0 - 2.5	205	2.5	187.461	1273.35	5.0 - 5.5	9	5.5	6.219	1555.66
2.5 - 3.0	136	3.0	119.133	1392.48	5.5 - 6.0	3	6.0	2.898	1558.56
3.0 - 3.5	88	3.5	75.128	1467.61	6.0 - 6.5	4	6.5	3.346	1561.91

There are many ways to generate primitive elementary graphs using the above tables. However, there is only a single way to generate cumulative numbers-based primary graph. We will generate various primitive elementary graphs in the followings and the primary graph in the next section.

First, let us generate a primitive elementary graph using Table 15. In total, there are 12 linear groups in Table 15. Figure 5(A) gives the plotting of category versus mortality as a column graph where the mortality within linearly grouped category is given; this is one of the primitive elementary graphs. When we plot the data in Table 15 to give a linear by linear graph of mortality y versus mean radon levels X, we get a primitive elementary graph Figure 5(B), where we plot y vs. X in a linear by linear scale using 12 data points. Both Figure 5(A) and 5(B) give the decreasing mortality rate as the radon level increases. These graphs are misleading graphs that mislead Prof. Cohen to believe his LNT (linear no-threshold) theory. Appendix A shows one of Cohen’s misleading data analyses based on linearly grouped mortality rates y versus mean radon levels X (on different sets of data).¹

Figure 5.

(A). Column graph Mortality within linearly grouped category. (B). Primitive elementary graph y vs. X linear by linear scale. Figure (B-1). Mortality versus Mean radon level. (C). Mortality versus Mean radon level. (D). Mortality versus Mean radon level. (E). Column graph Mortality within nonlinearly grouped category. (F). Mortality versus Mean radon level.

The form of decreasing line from linearly grouped data in Figure 5(B) is similar to the primitive elementary data line originally claimed by Cohen to represent the lung cancer rate. There is nothing wrong to present his primitive data line similar to Figure 5(B) as long as it stands alone and does not intend to relating y and X mathematically. The graph in Figure 5(B) is a primitive elementary graph, and as such, it is not supposed to use for comparing with cumulative primary graph or for modeling of data. If we like, we need to use primary graph, either a cumulative or a demulative (opposite to cumulative) data, for comparison and modeling. Cohen’s confusion analysis arises from 2 basic problems: First, the comparison of primitive data line with theoretical line is invalid because the theoretical data line is a cumulative data line, while Cohen’s data is as an individually grouped primitive data line. Comparison of cumulative data with individual data is not an appropriate comparison. Secondly, Cohen failed to recognize the nonlinear nature of the relationship between the 2 nonlinear numbers and failed to connect the data to the origin or zero. We need to collect, group, and present the nonlinear data as a cumulative data in a nonlinear fashion and address the origin or the zero. When we plot the data of Table 12 without the raw of 0, we essentially get the same figure as Figure 5(B). In Table 15, we have only 12 data points, where we missed the importance of connecting to the origin. This is to show that there is a serious mistake in Cohen’s analysis simply due to his failure to include more data between origin and the first data points or to include the origin (see graph in Appendix A).

Now, when we insert the data of Table 13 with 25 data points (minus the raw of 0) into Figure 5(B), we get Figure 5 (B-1).This graph indicates the importance of collecting more data close to the origin or using a smaller increment of X at an increment of 0.25 in contrast to at an increment of 0.5. It is clear, when there are only 12 data points; the simple curve tends to mislead the practitioner like Prof. Cohen to interpret or to model with LNT theory.¹ However, when there are 25 data points, the skewed bell curve becomes prohibitory difficult for them to model. It is understandable that some practitioners chose a short cut to avoid the collection of more data and to avoid the difficulties of modeling a complicated curve.

In addition to Figure 5(B-1), let us replot the data in Table 12 and 13 to include (connecting to) zero to get Figure 5(C) where we have 13 and 26 data points. Figure 5(C) gives 2 skewed bells, where the larger the increment of X (= 0.5) (and smaller the numbers of data points) the larger the bell will be. The practitioners of “Hormesis” would interpret these curves as biphasic hormesis curves. They would interpret the 13 data points curve from zero to X = 1.5 as linear no-threshold increasing y section followed by decreasing y section beyond X = 1.5. While they would interpret the 26 data points curve as linear with-threshold increasing y section followed by decreasing y section, because there is a threshold of y with y is zero up to X = 0.25 ((X, y) = (0, 0) to (X, y) = (0, 0.25)). The conflict/confusion of LNT and linear with-threshold will disappear when we make use of cumulative numbers, as shown in Figure 6A in next section.

Figure 6.

(A). Primary graph cum Y vs. cum X, 1597 data points and grouped data. (B). Leading graph (originated from primary graph Figure A qY vs. X, Y in log scale and X in linear scale. (C). Locating optimal upper asymptote Yu. (D). Transitional graph (qYu − qY) vs. X linear by linear scale. (E). Proportionality graph q(qYu − qY) vs. qX nonlinear by nonlinear scale. (F). Proportionality graph q(qYu − qY) vs. qX linear by linear scale.

Now, let us compare the primitive elementary curve by inserting the nonlinearly grouped data in Table 14 (for y vs. X) into Figure 5(B-1) to give Figure 5(D). Because the relationship between the mortality and the mean radon level is a nonlinear phenomenon, the nonlinear curve gives a uniform and better range of coverage, especially in the initial small X range. Figure 5(E) gives the corresponding column graph for nonlinearly grouped data with 10 data points. This well-behaved bell-shaped column graph is quite different from that of 12 points linearly grouped column graph in Figure 5(A). (We are using the same 1597 data points as base).

In addition to the above graphs (of X and y), let us examine the relationship among the X, y, and the numbers of counties. By plotting the county data in Table 12 and 14, we get Figure 5(F). This graph shows that the lines of the number of counties, either linearly grouped or nonlinearly grouped is relatively close, indicating they are closely related (may be by coincidence). The nonlinearly grouped data line indicates that there are a greater number of counties having the high mortality in the middle range of X around X = 2, and less number of counties with low level of X (X close to 0.2) and high level of X (X larger than 4 or 5).

Overall, the above primitive elementary graphs have wide variations and the models based on curve fittings would be widely scattered and meaningless. Fortunately, we have a better and consistent way to look at these same data with a simple nonlinear concept where we can simply comparing the cumulative independent variable X with cumulative dependent variable Y as to be discussed next.

B. Data Presentation with Primary Graphs

When plotting cumulative mortality Y versus cumulative radon level X, we get a continuous sigmoid line with 1597 data points, as shown in Primary graph Figure 6a. In the graph, we also insert the nonlinearly grouped data from Table 14 and linearly grouped data from Table 13 (with 26 data points). Not shown in the graph is the county data. When we plot the linearly grouped or nonlinearly grouped county data into the same graph, we will get all the data fall into the same sigmoid curve.

Comparing the entire primitive elementary graphs in Figure 5 (Figure 5A to Figure 5F) and the primary graph in Figure 6 (Figure 6(A)), we see that the primitive elementary graphs are confusing and inconsistence, while the primary graph is simple and straightforward because the latter is in consistence with the physical law. (The beauty of using the cumulative numbers is the application of monotonic continuous numbers). We may say that the primitive elementary graphs are the presentation of “art” and the primary graph is the presentation of “science”. We will discuss how the law of nature dictates the relationship between the dependent variable (continuous cumulative Y) and the independent variable (continuous cumulative X) in the next sections.

C. Beware of the True Meaning of the Data and the Line in the Primitive Elementary Graph and the Primary Graph

It is importance to recognize and distinguish the true meaning of the data and the line in the primitive elementary graph and the primary graph. The primary graph gives the cumulative mortality Y versus cumulative radon levels X, such as sigmoid curve in Figure 6(A). The essential information of this graph is the rate of the change of the curve. For example, at low level of X = 0.5, the change of rate from X = 0.25 to X = 0.5 is relatively small (see Table 13) at around 39.01 (point A in the graph). At middle range of X = 1, the change of rate from X = 1.0 to X = 1.25 is very large at 194.71 (= 640.23 – 445.51) (point B in the graph). At the upper range of X = 6.0, the change of rate from X = 6.0 to X = 6.25 is very small at 2.15 (= 1560.71 – 1558.56) (point C in the graph). In essence, the rate of change of the line in the beginning and near the end is small, or the slope of the curve is very small. At certain middle range, the rate of change is very large or the slope of the line is very steep and large. In the beginning of small X range, the line convex up, whence the positive slopes is increasing. Beyond certain central point, toward the end of large X, the line concaves down and the positive slopes decreasing. It is unfortunate that the litterateur is full of misinformation. For example, Masters and Lindon misinterpreted the sigmoid curve similar to Figure 6(A) and interpreted the sigmoid curve as with high mortality at high radiation dose.¹² It is importance to recognize the small change of the slope at large X, rather than look at the height of the curve and misinterpret it as high Y at large X.

On the other hand, the primitive elementary graph can provide direct reading of the elementary mortality numbers y. As shown in Figure 5C, at X = 0.5, the mortality is 39.01 (relatively low), shown as point A’ in the graph. At X = 1.0, the mortality is 228.75 (very high), shown as point B’ in the graph. At X = 6.0, the mortality is 1.54 (very low again), shown as point C’ in the graph.

D. Mathematical Analysis Based on Primary Graphs and Physical Law

Now, let us discuss the full analysis of data according to the physical law. When dealing with the relationship between the dependent variable (continuous cumulative Y) and the independent variable (continuous cumulative X), we can have several situations of comparison. We can compare (A): the ordinary nonlinear Y with the ordinary linear X, or (B): the ordinary nonlinear Y with the ordinary nonlinear X, or (C): the higher nonlinear order of nonlinear Y with the ordinary linear X, or (D): the higher nonlinear order of nonlinear Y with the ordinary nonlinear X, etc. We have discussed the simple case (A) in subsection (5). We will discuss the case (D) using the data in Figure 6 (A).

For the lung cancer mortality/ radon level case, it is a higher order nonlinearity case. Thus, we need to apply a higher order nonlinear equation, either Eq. (2b) or Eq. (2c). For graphical interpretation, let us copy Figure 6(A) (with sigmoidal line) into Figure 6(B) and converting the vertical axis from linear into nonlinear logarithmic scale. We get Figure 6(B) with parabolic line in log-linear graph. We call it a leading graph that will intuitively lead us to formulating the proportionality equation based on the continuous changing of the slope of parabolic line. In the graph, value of the curve is qY and the curve is approaching qYu as its upper asymptote.

In Figure 6 (B), the measurement of nonlinear face value is a measurement of vertical distance from asymptote, which is (qYu – qY), as indicated by a vertical solid arrow. The measurement of horizontal distance could be either a measurement of linear X or a measurement of nonlinear X, as indicated by a dashed arrow. If it is a linear X, then we have a nonlinear by linear phenomenon with measurement of X from linear zero, such as the relationship between the fructose concentration and the enzyme activity,^4,6 where we can apply Eq. (2b) for describing the curve. When it is a nonlinear X, then we measure X from its bottom asymptote Xb, (X – Xb), and we have a nonlinear by nonlinear phenomenon, where we shall apply Eq. (2c) for describing the curve, as will be described in this section.

The relationship between 2 headed arrows in Figure 6(B) is that as the distance of vertical solid arrow gets bigger the horizontal dashed arrow gets smaller or vise visa—meaning that the 2 double-headed arrows have a negative proportionality relationship. For the above nonlinear by nonlinear phenomenon in Figure 6(B), we can use Eq. (2c) to describe its proportionality relationship. Their meaning is as follows: The nonlinear change of nonlinear face value (qYu – qY) is negatively proportional to the nonlinear change of nonlinear face value (X – Xb). In other words, the change of true-values (q(qYu – qY)) is negatively proportional to the change of true-values (q(X – Xb)).

Now, let us use template based approach to illustrate how to solve for the upper asymptote Yu. The bottom asymptote Yb is always assign as zero and thus no need for any calculation. We need only to solve for the upper asymptote Yu. Table 16 (Template A) gives the X, y, and Cum. Y, in Column A, C, and D for nonlinearly grouped data listed in First, third, and fourth column of Table 14. Column B is for the calculation of qX, it is log(X – Xb) or q(X – Xb) assuming Xb = ϕ = (0).

To start with, we assign/assume an active incremntal Yu, ΔYu, as about 1% of the last Y in Column D of Table 16 (Template A). The last Y in Column D is Cell D12 = 1561.908. We assign this number as Yu0, Yu0 = 1561.908 in Cell B16. Then the active incremntal Yu is ΔYu = 15.619 (1% of Yu0). We input this number into Active cell B14, Cell B14 = 15.619. Refering to Table 17 (Tmplate B), let us calculate estimated upper asymptotes Yu1 to Yu10. Formula for Yu1 in cell B17 is given in formula bar, “=B16 + $B$14”, Yu1 = 1577.527. We can copy Cell B17 to B18 through B26 to complete the column. We also calculate qX in column B, they are log of X in column A (A3: A12).

Table 16.

(Template A) Search for upper asymptote Yu (a)

B16	fx	= D12
	A	B	C	D	E	F
1	X	qX	(y)	Cum Y	qYu – qY	q(qYu – qY)
2	0.240
3	0.336		2.884	2.884
4	0.470		18.227	21.111
5	0.659		121.618	142.729
6	0.922		227.617	370.347
7	1.291		311.803	682.150
8	1.807		322.696	1004.846
9	2.530		282.214	1287.060
10	3.542		184.899	1471.958
11	4.959		77.487	1549.445
12	6.942		12.463	1561.908
13
14	ΔYu =	15.619		Yu =
15				R ^ 2
16	Yu0 =	1561.908
17	Yu1 =
18	Yu2 =
19	Yu3 =
20	Yu4 =
21	Yu5 =
22	Yu6 =
23	Yu7 =
24	Yu8 =
25	Yu9 =
26	Yu10 =

Table 17.

(Template B) Search for upper asymptote Yu (b)

B17	fx	= B16 + $B$14
	A	B	C	D	E	F
1	X	qX	(y)	Cum Y	qYu – qY	q(qYu – qY)
2	0.240
3	0.336		2.884	2.884
4	0.470		18.227	21.111
5	0.659		121.618	142.729
6	0.922		227.617	370.347
7	1.291		311.803	682.150
8	1.807		322.696	1004.846
9	2.530		282.214	1287.060
10	3.542		184.899	1471.958
11	4.959		77.487	1549.445
12	6.942		12.463	1561.908
13
14	ΔYu =	15.619		Yu =
15				R ^ 2
16	Yu0 =	1561.908		R ^ 2
17	Yu1 =	1577.527
18	Yu2 =
19	Yu3 =
20	Yu4 =
21	Yu5 =
22	Yu6 =
23	Yu7 =
24	Yu8 =
25	Yu9 =
26	Yu10 =

In the next step, refering to Table 18 (Template C) , let us calculate Column E and Column F for estimated upper asymptote Yu1. First we assign Yu1 value in B17 into Active cell in E14. Formula for Cell E3 is “=LOG($E$14)-LOG(D3)”. We can copy Cell E3 to Cell E4 through E12 to complete the column. Column F is the log of column E, e.g. F3 = LOG(E3). Next, refering to Table 19 (Template D), let us calculate the coefficient of determination R ^ 2 in Cell E15. Formula for Cell E15 is “=CORREL(B3: B12, F3: F12) ^ 2”. R ^ 2 is 0.9699. We copy this value into Cell D17 in parallel to Yu1. Next, we calculate R ^ 2 value for Yu2 by assigning 1593.146 into Cell E14. Once the Cell E14 is changed, all the values in Column E, Column F and Cell E15 all will change. The Cell E15 changs into 0.9823 (not shown in Template). We copy this value into Cell D18, in parallel to Yu2.

Table 18.

(Template C) Search for upper asymptote Yu (c)

E3	fx	= LOG($E$14) – LOG(D3)
	A	B	C	D	E	F
1	X	qX	(y)	Cum Y	qYu – qY	q(qYu – qY)
2	0.240
3	0.336		2.884	2.884	2.7380
4	0.470		18.227	21.111
5	0.659		121.618	142.729
6	0.922		227.617	370.347
7	1.291		311.803	682.150
8	1.807		322.696	1004.846
9	2.530		282.214	1287.060
10	3.542		184.899	1471.958
11	4.959		77.487	1549.445
12	6.942		12.463	1561.908
13
14	ΔYu =	15.619		Yu =	1577.527
15				R ^ 2
16	Yu0 =	1561.908		R ^ 2
17	Yu1 =	1577.527
18	Yu2 =	1593.146
19	Yu3 =	1608.765
20	Yu4 =	1624.384
21	Yu5 =	1640.003
22	Yu6 =	1655.622
23	Yu7 =	1671.241
24	Yu8 =	1686.860
25	Yu9 =	1702.479
26	Yu10 =	1718.098

Table 19.

(Template D) Search for upper asymptote Yu (d)

E15	fx	= CORREL (B3: B12, F3: F12) ^ 2
	A	B	C	D	E	F
1	X	qX	(y)	Cum Y	qYu – qY	q(qYu – qY)
2	0.240
3	0.336		2.884	2.884	2.7380	0.4374
4	0.470		18.227	21.111	1.8735	0.2726
5	0.659		121.618	142.729	1.0435	0.0185
6	0.922		227.617	370.347	0.6294	−0.2011
7	1.291		311.803	682.150	0.3641	−0.4388
8	1.807		322.696	1004.846	0.1959	−0.7080
9	2.530		282.214	1287.060	0.0884	−1.0537
10	3.542		184.899	1471.958	0.0301	−1.5217
11	4.959		77.487	1549.445	0.0078	−2.1080
12	6.942		12.463	1561.908	0.0043	−2.3646
13
14	ΔYu =	15.619		Yu =	1577.527
15				R ^ 2	0.9699
16	Yu0 =	1561.908		R ^ 2
17	Yu1 =	1577.527		0.9699
18	Yu2 =	1593.146
19	Yu3 =	1608.765
20	Yu4 =	1624.384
21	Yu5 =	1640.003
22	Yu6 =	1655.662
23	Yu7 =	1671.241
24	Yu8 =	1686.860
25	Yu9 =	1702.479
26	Yu10 =	1718.098

We sequentially change Yu value in Cell E14 from Yu1 to Yu10 and recording the corresponding R ^ 2 in column D, as shown in Table 20 (Template E) . This template gives the calculation with Yu7 = 1671.241. Refering to Table 20 (Template E) , by plotting R ^ 2 vs. Yu for Column D (D17: D26) vs. Column B (B17: B26) we obtain Figure 6c. In Colum D (D17: D26), we locate the maximum coefficient of determination R ^ 2 as 0.9926, its corresponding estimated Yu is Yu7 = 1671.241. This is also indicated in Raw 23 for Yu7 = 1671.241 and R ^ 2 = 0.9926.

Table 20.

(Template E) Search for upper asymptote Yu (e)

E14	fx	=1671.241
	A	B	C	D	E	F
1	X	qX	(y)	Cum Y	qYu – qY	q (qYu – qY)
2	0.240
3	0.336		2.884	2.884	2.7380	0.4374
4	0.470		18.227	21.111	1.8735	0.2726
5	0.659		121.618	142.729	1.0435	0.0185
6	0.922		227.617	370.347	0.6294	−0.2011
7	1.291		311.803	682.150	0.3641	−0.4388
8	1.807		322.696	1004.846	0.1959	−0.7080
9	2.530		282.214	1287.060	0.0884	−1.0537
10	3.542		184.899	1471.958	0.0301	−1.5217
11	4.959		77.487	1549.445	0.0078	−2.1080
12	6.942		12.463	1561.908	0.0043	−2.3646
13
14	ΔYu =	15.619		Yu =	1671.241
15				R ^ 2	0.9926
16	Yu0 =	1561.908		R ^ 2
17	Yu1 =	1577.527		0.9699
18	Yu2 =	1593.146		0.9823
19	Yu3 =	1608.765		0.9877
20	Yu4 =	1624.384		0.9904
21	Yu5 =	1640.003		0.9918
22	Yu6 =	1655.622		0.9924
23	Yu7 =	1671.241		0.9926
24	Yu8 =	1686.860		0.9924
25	Yu9 =	1702.479		0.9920
26	Yu10 =	1718.098		0.9914

Next, we plot Column E vs. Column A for (qYu – qY) vs. X on a linear by linear graph, shown as Figure 6d. It is a transitional graph with plotting of mixed numbers (nonlinear numbers (qYu – qY) and linear number zero) in a linear scae. This is followed by converting both the axes from linear into nonlinear logarithmic scale to give Figure 6e, which is the proportionality graph without trendline equation and R ^ 2.

By right clicking on data set in Figure 6e, follow by selecing Format Trendline → select Power in trendline options → select Disply Equation on Chart → select Display R-squard value on Chart → then Close, we obtain Figure 6f with trendline equation y = 0.532X^-1.62 and R ^ 2 = 0.9926. Equation y = 0.532X^-1.62 means (qYu – qY) = 0.532X^-1.62. Taking log and differential on both sides of the equation, we get

q (qYu−qY) = -1 .62qX + q (0 .532) and d (q (qYu−qY)) = -Kd (qX), with K = 1 .62, C = 0 .532

Figure 6f provides the physical meaning that the cumulative lung cancer mortality Y is negatively proportional to the cumulative radon intensity X; or specifically, the nonlinear change of nonlinear face value (qYu – qY) is negatively proportional to the nonlinear change of nonlinear face value (X – Xb).

Discussions

In life and biomedical sciences, the data are mostly nonlinear and thus require careful nonlinear data collection and analysis. In principal, for 2 variables analysis, we need to collect at least minimum of 7 data points (in exceptional cases of excellent data, we may get by with 6 data points) to account for skewed-bell and sigmoidal data lines. In designing experiments, we need to select independent variable X consistently in linear or nonlinear fashion but not with a mix or at random,

We need to emphasize the use of Primary Graph rather than the Primitive Elementary Graph in Data Analysis. The primitive elementary graph, at most, can only provide the peak information, but not the essential rate of change and proportional relationship between 2 variables. To obtain essential rate of change and proportional relationship, we need to compare one cumulative number with another cumulative numbers. For the cases with higher order of nonlinearity, we can generate one series of 4 graphs to describe the complete nonlinear phenomenon: a primitive graph of skewed-bell curve, a primary graph of sigmoid curve, a leading graph of parabolic curve, and a proportionality graph of a straight line. In another words, based on the primary graph we may extend the analysis to get a simple straight line and proportionality equation to representing the rate of change and proportionality relationship of 2 variables in a proportionality graph.

It is important for all researchers to learn how to distinguish between a primitive elementary graph and a primary graph. We cannot use a primitive elementary graph to build a dose-response mathematical relationship, because each elementary “y” has no mathematical connectivity, and one elementary number cannot mathematically relate to the other cumulative numbers.⁹ Instead, we must resort to relating one cumulative number with the other cumulative numbers and using the primary graph for mathematical analysis, where we can have both continuous numbers Y, and continuous numbers X exist as cumulative Y and cumulative X. Cumulative numbers mean the existence of connectivity.

Summary

We analyzed the relationship between the lung cancer mortality and the indoor radon intensity from the viewpoint of nonlinear mathematics. We conclude that their relationship is governed by the proportionality law where the cumulative lung cancer mortality Y is negatively proportional to the cumulative radon intensity X; or specifically, the nonlinear change of nonlinear face value (qYu – qY) is negatively proportional to the nonlinear change of nonlinear face value (X – Xb).

Traditional XY math is insufficient to describe the nonlinear phenomena; we need to extend the XY math into the αβ Math to account for the existence of asymptotes, i.e. we need to extend XY = {(X),(Y)} into αβ = {α(Y, Yu, Yb), β(X, Xu, Xb)}.The αβ Math classifies continuous numbers into linear and nonlinear numbers. Nonlinear numbers are associated with asymptotes, and when measuring changes, we measure their changes relative to their asymptotes.

In data analyses, we must always account for the origin (either linear zero or the nonlinear zero). In data collection, the independent variable X must be consistent. The X needs be either linear numbers or nonlinear numbers, but cannot be with mixed arrangement.

The Alpha Beta (αβ) Math is a science for connecting a straight line to parabolic, sigmoid, and various bell curves in biomedical and physical sciences. We provide examples for building Excel Templates to solve for upper asymptotes and building a straight-line proportionality equation.

Data in life and biomedical fields must obey the law of nature. The experimental law dictates that the response be either proportional to or negatively proportional to the dose. We can build a straight-line proportionality relationship for the dose-response data in a log-linear and log-log graphs. This article demonstrates a straightforward methodology for solving the key upper asymptotes for the proportionality equation using the Microsoft Excel via determining the “coefficient of determination”. All examples include systematic demonstration of Excel data manipulation and extensive graphing.

Footnotes

Appendix A Cohen’s Data Analysis 1

Appendix B “Nonlinear Change” or “Change” in Proportionality Graph

The proportionality graph has a straight line that is expressible with an equation. Figure B1 gives the proportionality graph for Y = X in linear-linear scale. Where the change of linear numbers Y is proportional to the change of linear numbers X. Figure B2 gives the proportionality graph for qα = 1/qβ in log-log scale. Where the nonlinear change of face value, (Y – Yb) is negatively proportional to the nonlinear change of face value (X – Xb). We obtain this graph by first plotting both face values in a linear-linear scale followed by converting both the scales from linear into logarithmic scales. The result is that we have the data having true value of q(Y – Yb) and q(X – Xb). Consequently, the equation is also read as the change of nonlinear true value q(Y – Yb) is negatively proportional to the change of nonlinear true value q(X – Xb). Figure B3 gives the proportionality graph for y = 20θ^-0.05X in log-linear scale, where y = (Yu – Y), C = 20, and K = 0.05. We obtain this graph by first plotting the face value (Yu – Y) versus X in a linear-linear scale followed by converting the vertical scale into logarithmic scale. The result is that we have the data having true value of q(Yu – Y) versus X. Consequently, the equation is also read as the change of nonlinear true value q(Y – Yb) is negatively proportional to the change of X.

The general differential and integral equations of Figure B1 is

The general differential and integral equations of Figure B2 is

The general differential and integral equations of Figure B3 is

For linear numbers, we read the differential equations as simple change. E.g. we read Eq. (Ba) as “the change of Y is proportional to the change of X”.

For nonlinear numbers, we have 2 ways to read the equations. We can address “d” first or address “q” first. The “q” implies the nonlinear change. In Eq. (Bb), we can address the “d” first by saying “the change of nonlinear true value q(Y – Yb) is negatively proportional to the change of nonlinear true value q(X – Xb). When we address the “q” first, we say “the nonlinear change of face value (Y – Yb) is proportional to the nonlinear change of face value (X – Xb).

Declaration of Conflicting Interests

The author(s) declared no potential conflicts of interest with respect to the research, authorship, and/or publication of this article.

Funding

The author(s) received no financial support for the research, authorship, and/or publication of this article.

ORCID iD

Ralph W. Lai

References

Cohen

. Test of the linear-no threshold theory of radiation carcinogenesis for inhaled radon decay products. Health Phys. 1995;68(2):157–174.

Curry

. Clinical Pharmacokinetics—The MCQ Approach. The Telford Press; 1987: 39–40.

Stebbing

ARD

. Adaptive response account for the β-curve—hormesis is linked to acquired tolerance. Nonlinearity Biol Toxicol Med. 2003;1(4):493–511.

Hedayatkhah

, et al. The Relationship Between the Fructose Concentration and the Enzyme Activity. Universiteit Van Amsterdam; 2017.

Lai

. Get more information from flotation-rate data. Chem Eng. October 19:181–182. www.Researchgate.net

Lai

. Unlock the mystery of dose-response and pharmacokinetics data analysis. 2020. www.Researchgate.net

Lai

. A Text Book on Nonlinearity in Life and Biomedical Sciences. The Cornerstone Company; 2011:316. Hard copy book available at www.lulu.com

Lai

Richardson

. Unified proportionality equation for modeling biological and pharmacological data. In: Proceedings, 11th IEEE Symposium on Computer-Based Medical Systems, Lubbock, TX, June 12-14, 1998: 104–109.

Lai

. The wonderful mathematical connectivity of the Alpha Beta (αβ) math. (Science of connecting straight lines to parabolic, sigmoid, and various bell curves in biomedical and physical sciences). 2015. www.Researchgate.net

10.

Lai

. What is the continuous nonlinear numbers, what is the nonlinear zero? 2014. www.Researchgate.net

11.

Cohen

. Data Set SHORT92. University of Pittsburgh; 1992.

12.

Masters

. Introduction to Environmental Science and Technology. John Wiley & Sons; 1974:315–316.