Abstract
This note starts by discussing what is regarded as the pioneering work in proportional–integral–derivative controller tuning by Ziegler and Nichols. The basic concepts in their paper, rather than the specific tuning rules they presented, are held to be their major contributions. It is then explained with some brief theory how the ideas have been developed and how with modern technology the loop cycling method has been automated.
I. Introduction
Proportional–integral–derivative (PID) or three-term controllers are still extensively used in the process industries although their technical design is far different from their early use around 80 years ago (see Edwards and Otterson 1 – Tech Talk for some basic material). For their use in a feedback control loop, the three parameters P, I and D need to be set to achieve a satisfactory performance, and the first major contribution to the topic was the study by Ziegler and Nichols. 2 Ziegler and Nichols, often referred to as Z and N, worked for the Taylor Instrument Companies in the United States, the former being in the Sales Engineering Department and the latter in the Engineering Research Department. The paper is well worth reading not only for the contribution of Z and N but, since the paper was presented at the annual meeting of the ASME in New York City in December 1941, also for the discussion comprising around 30% of the total, to which four other industrialists contributed – a topic sadly missing from today’s publications.
To quote first from the abstract ‘These units form the basis of a quick method for adjusting a controller on the job’ and also from the introduction ‘The paper will thus first endeavour to answer the question: “How can the proper controller adjustments be quickly determined on any control application”’ No process mathematical models are introduced in the paper, so I believe the paper might have resulted from a conversation between Ziegler and Nichols which might have gone along the lines. Ziegler – when I visit plants where our controllers are being used, the users are often not happy with the system response, what simple procedure can I suggest to them to improve the situation? Nichols – let me think about that and do some experiments. The result is the paper which presents two suggestions after describing some experiments.
Unfortunately, not a great deal is stated about the process on which the experiments were performed. It is stated that To simplify terminology we will take the most common type of control circuit in which a controller interprets the movement of its recording pen into a need for corrective action, and, by varying its output air pressure, repositions a diaphragm-operated valve.
P control is first discussed on what is then referred to as a ‘typical application’ which, apart from possible valve nonlinearity, is generally believed to have had a mathematical model near to that of first-order plus dead time (FOPDT). What would now be referred to as the ‘P’ gain is called ‘sensitivity’ and the ‘proportional band’ as ‘throttling range’. Pen recorder plots are given for both disturbance and load change responses as the sensitivity is changed, and the term amplitude ratio is used to refer to the relative magnitude of oscillations in the response, although it is not entirely clear whether it is overshoot to overshoot or overshoot to undershoot. An amplitude ratio of one is a continuous oscillation, and the sensitivity giving this is referred to as the ‘ultimate sensitivity
Z-N method 1 parameters
Z-N: Ziegler and Nichols; PID: proportional–integral–derivative.
Here, K is the controller gain, Ti is the integral time constant and Td is the derivative time constant, for an assumed ideal controller transfer function of
(Note that it is not entirely clear from the paper that this was the form of controller assumed to be fed by the error.)
The next step in the paper was to discuss how the parameters might be set from the ‘process reaction curve’, namely, from the step response of the process. They drew a ‘reaction curve’, which is defined as being approximately ‘S’ shape (bear in mind time cannot move backwards!). A tangent was then drawn to the curve at its maximum slope, which is designated, R, and the distance below the axis where this meets the time origin was denoted RL, so L is a delay measure for the response. From these experiments, the controller parameters of Table 2 were suggested.
Z-N method 2 parameters
Z-N: Ziegler and Nichols; PID: proportional–integral–derivative.
Note that in the above two tables, the ratio of the gains in the first column and the time constants are consistent and that for the PID, the ratio of the integral to the derivative time constant is taken to be 4. Furthermore, for the rules to be identical requires (as mentioned by Z and N but taken no further)
II. Relevance of the Z and N Paper
Viewed from today’s technology, one wonders why such imprecise results are still quoted. Although the wording is not used in the Z-N paper, they are today referred to as tuning rules, and they still provide the basis for ‘on the job’ controller settings. Both may be regarded as simple process ‘identification’ and then the setting of controller parameters based on the identification. The setting of the parameters is independent of the ‘identification’, and one could, therefore, use a different choice to Z-N.
The identification in the first approach, the loop cycling method, is of two process parameters namely,
Z-N loop cycling method parameters
Z-N: Ziegler and Nichols; PID: proportional–integral–derivative.
Note that
The second method, namely, obtaining the process step response to ‘identify’ it, was more familiar to control engineers at the time, and Z-N suggested one way of using this information. If the process model is taken as the FOPDT transfer function
then it is easy to show that
Z-N method 2 based on FOPDT parameters
Z-N: Ziegler and Nichols; FOPDT: first-order plus dead time; PID: proportional–integral–derivative.
There are obviously various alternatives which might be used with this method in terms of fitting other models to the values in Table 2 or obtaining models, FOPDT or others, to fit a step response by alternative approaches. In practice, one has to obtain the step response of the process, which may not always be possible, and noise in the measurement may be a problem.
III. Time Scaling and the FOPDT Process
Since there are many other tuning algorithms given in the literature, 3 many of which are based on the assumption of an FOPDT process model, it is probably worth a slight digression here to discuss time scaling and the FOPDT process.
Amplitude and time scaling were very familiar to users of an analogue computer, where the latter was very useful according to the measurement device, for example, a slow plotting table or an oscilloscope. If for the FOPDT transfer function of equation (2), a normalised s, sn, is taken equal to sT and with ρ = τ/T, then the transfer function becomes
which can be referred to as a normalised FOPDT transfer function. The normalised transfer function has a unit time constant and only two parameters Kp and ρ. The actual system of equation (2) has a step response which is T times slower and markings on its frequency response T times smaller than that of equation (3). If this normalised plant is controlled by the ideal PID controller of equation (1) and fed with the error signal, then this becomes
where
where Kn = KpK. This means that if the controller parameters are designed based on some property of the open- or closed-loop transfer function, the results will be of the form
Thus, for any FOPDT plant, the controller parameters must be of the form
for the same performance property (i.e. time response) to be maintained for all FOPDT plants. This will be referred to as consistent tuning. Method 2 of Z-N is easily seen to be consistent with a simple choice for the functions f1, f2 and f3 of being inversely proportional, proportional and proportional to ρ, respectively. It is also easy to show that method 1 is also consistent. Further discussion can be found on this topic in Atherton 4 but suffice to say here that not all the tuning algorithms given in O’Dwyer 3 are consistent.
IV. More on Loop Cycle Tuning and Critical Point Design
The loop cycling method has many desirable features, particularly the fact that the ‘identification’ is a closed-loop one, and the critical point provides useful information about the plant. Obviously, it is only applicable to processes where
Even if the loop were linear, the fact that many processes have very long time constants makes it extremely difficult and time-consuming to try and find the gain, Kc;
It can be dangerous if there is no satisfactory limiting effect in the loop as an adjustment to an overestimated value of Kc can result in a large oscillation;
The oscillation may be quite noisy making obtaining good measurements possibly difficult or time-consuming;
Many practical loops are nonlinear. Saturation is helpful in limiting the amplitude of the oscillation, but dead zone effects may make finding Kc even more difficult.
With the advent of microprocessor controllers, Astrom and Hagglund, 5 possibly influenced by discussions with Mike Somerville the Technical Director of Eurotherm, suggested a much more suitable method of practical implementation for estimating the critical point. The method, which has become known as relay autotuning, involves replacing the P term by an ideal relay function to obtain a limit cycle. It can then easily be shown using describing function (DF) analysis 6 that the frequency of the limit cycle, ωo, is approximately the critical frequency, ωc, and the critical gain, Kc, is given approximately by Kc = 4h/aπ, where 2h is the peak-to-peak amplitude of the relay output, and a is the fundamental frequency amplitude of the limit cycle at the relay input. It can be shown that the estimate for ω c will be better than for Kc and that the results will be better the nearer the limit cycle at the relay input is to a sinusoid. The error introduced by replacing a by half the peak-to-peak amplitude of the limit cycle, which is normally done in practice for ease of measurement, is usually quite small. The fact that the limit cycle amplitude can be controlled by the relay is useful in practice and also provides a mechanism for looking at the linearity of the process, provided time is available for repeated experiments.
Thus, with a more practical approach for estimating the critical point, it is appropriate to comment further on the use of the critical point in PID tuning. First, what is the principle of the Z-N method 1 in using the critical point for tuning? Since all that is known about the plant is its critical point, then all one can do theoretically in selecting the controller parameters is to place this frequency at a known point on the compensated open-loop frequency response locus. To do this, one only has the freedom to vary two parameters, and in the Z-N method, this is done by taking Ti equal to 4Td. This corresponds to the two zeros of the PID controller transfer function of equation (1) being real and equal. It is not entirely clear why this ratio of 4 for the two time constants was taken, although it seems to be a reasonable choice in many instances. My former colleague Mike Somerville, however, delighted in the fact that the two zeros being equal made sketching of the controller Bode diagram easy. It is easy to show that the point given by the Z and N tuning rules on the compensated locus is 0.5 arg − 180° for the P controller, 0.46 arg − 192° for the PI controller and 0.66 arg − 155° for the PID.
This concept is a useful design approach, and if felt appropriate, a different point can be chosen within the allowable range and an alternative to 4 can be selected for the time constant ratio. It is easy to show with Ti/Td = 4, that is, for the FOPDT plant, the controller parameters are obtained from the following equations for moving the critical point to g arg − (180 − ϕ)°
Critical point design is a useful concept since closed-loop performance is very dependent on the open-loop frequency response in the region of the Nyquist critical point (−1, 0), as clearly recognised by Z and N. Certainly, if one has little knowledge of the plant dynamics, it can be very useful as illustrated here for a plant with transfer function G(s) and a possible reduced order model Gr(s), for which the step responses are shown in Figure 1 . The difference between the step responses is very small, and it would be difficult to detect with noisy measurements. However, the frequency responses shown in Figure 2 are quite different around the Nyquist critical point; indeed, G(s) has a finite gain margin while that of Gr(s) is infinite. Hence, a major problem in using step response data for design if one does not have some idea of the process dynamics (note the G(s) chosen was third order but not a typical process plant mathematical model 4 ). The major problem with a step response is that the higher frequency information is contained in the early part of the response, and variations may be difficult to detect in the presence of noise.

Step responses of G(s) and Gr(s)

Frequency responses of G(s) and Gr(s)
Before concluding this section, it is probably worth mentioning that by including known networks in the loop with the relay, or using a relay with known hysteresis, additional information can be obtained on the process frequency response and/or more accurate measurements found. 6
V. Further Comments on PID Control
The number of papers and books written on PID control, mainly by academics, is huge, with many containing little of significance for the practising engineer. In practice, the ideal PID controller of equation (1) is never used, but its simple form allows ease of analysis and understanding. A pure D term is never used as theoretically it gives infinite gain at low frequency. It is normally replaced by
where a typical and easy design uses the derivative zero to ‘cancel’ a pole of the process.
The objective of selecting the PID parameters is to meet the specifications given for the control loop. These may, for example, be numerous, very simple or possibly difficult to translate into loop dynamic behaviour specifications. They might be easily met with many choices of parameters or very difficult to meet. They may refer to performance when regulating, typically with respect to disturbances or noise, or to changes in set point. With today’s microprocessor controllers, regulation parameters may be chosen differently from those for a set point change, and indeed, bearing in mind the Z and N statement regarding automatic reset, the integral term may be excluded except when near set point. Unlike other areas of control engineering, where typically it is expected that a good analytical designer can design a suitable controller using a mathematical model of the process with any one of many approaches, the PID field has become overburdened with so-called tuning rules. These are typically given whether the starting point is a mathematical model of the process or Z and N concepts, may be aimed at different specifications, which are often not clear, often compared inappropriately, and with dubious benefits. On the technical side, many controllers have one of several possible autotuning algorithms. Often they assume a model of a specific form for the process, so the manufacturer should be consulted before use on other than suggested processes.
VI. Concluding Comments
The purpose of this note has been to explain the valuable concepts in the early work of Z and N on PID tuning. Their formulae are regularly quoted but rarely is attention drawn to the simple experimental work with its imprecise details in terms of today’s technology. The contribution, however, is rightly regarded as very significant for the concept of the loop cycling method and the information it can yield and also for obtaining a possible mathematical model from simple tests on the process. This latter concept has now grown in control engineering into the huge field of identification.
Footnotes
Funding
The author(s) received no financial support for the research, authorship and/or publication of this article.
