Abstract

This article continues our series on probabilistic thinking and the evaluation of therapies, 1700–1900
The series will continue to appear as separate articles in the forthcoming issues of JRSM
Books by Lind, Gregory, Haygarth and Black quoted earlier in this series had all been translated into German by the end of the 18th century – but not into French (with the exception of Lind). I have been unable so far to find any of their methodological probabilistic passages referred to in the wider contemporary and early 19th-century German literature (German medicine was trapped for a time by the speculative philosophical systems of romantic medicine). 1 This had changed by the mid-1830s and Paris’s new hospital medicine attracted open-minded, frustrated German students after the end of the Napoleonic period. The emphasis was on clinical examination, including the ultra-modern auscultation, and Louis’s anatomo-clinical outlook based on large numbers of patients and bodies, respectively. Naturally, from the 1830s onwards, they came across the méthode numérique in one way or another, and some were also aware of the associated academic debates. That explains why we find references to these issues by German doctors from this latter period onwards, for instance by Jacob Henle.
But not only to fill the deficiencies of both parts should empiric and rational medicine be linked to each other, but to foster one another where both can be applied simultaneously. (Transl. from Henle 1844, p. 34).
Henle’s warning of the danger of falling into the post-hoc-ergo-propter-hoc fallacy by basing one’s practice on successful single cases was also raised by Blane (1819, p. 226) and Guy (1860, p. 554).
Henle moved to Heidelberg in 1844, the year that Gavarret’s landmark book came out in German. Two years later, Henle was the first German I have been able to trace so far who referred to it. And this was the only precisely quoted reference in Henle’s 19-page text ‘On doctors’ methods’ at the beginning of the introduction to the first volume of his Handbuch der rationellen Therapie (Handbook of rational therapy, 1846). In admirably worded sentences, he summarised the contemporary epistemological basis of therapeutics and, in a farsighted way, looked ahead. Of course, he also came to speak of Louis, whose Numerical or statistical method […was] the only one the application of which might let us expect some advantage of empirical medicine, for [and here he paraphrased Gavarret] claims derived from experience never feature logical certainty, but only a major or minor grade of probability, […]and even the so-called laws of nature have only the highest grade of probability. [As to therapeutics, Henle pointed out that] ‘numbers only determine the grade of probability with which we can deduce a given effect from a given cause and which may entitle us to prophesy the same effect from the same cause in the future (Transl. from Henle, 1846, pp. 12,15). Men, who are as familiar with the value as with the shortcomings of medical statistics, want[ed] to base their calculations on nothing else than upon hospital practice (p. 17).
Finally, he did not eschew … the cliffs that lay in empiric medicine … The less control a doctor is to be afraid of and the more splendid the rewards are in this world … the nearer is the danger that not only the superficiality of self-deception, but also true, mean fraud obfuscate facts so that the course of their successors is lead astray (p. 17).
Henle’s entire methodological introduction was written against the obviously prevailing strict separation of the empirical from the rational method – another age-old issue (Matthews, 2020a). It was one of the rare pleas for rational empiricism as it had been propagated by 18th-century British medical arithmetical observationists: ‘Both were made ready to amble henceforth friendly close side by side’ (Henle, p. 19).
Henle did not follow this track further. After all, he was a professor of anatomy and not a clinician. He later worked in Göttingen, and he was soon to acquire a worldwide reputation. Henle’s loops in the kidneys are just one example of his many contributions. But younger German clinicians (who might have read this early book of his during their studies), took up Gavarret. In Tübingen particularly, a network established itself from the mid-1840s around Carl Wunderlich.
Methodology for evaluation: A first Tübingen circle
If the numerical method, provided it is correctly used, may have some value … for the diagnostic and prognostic significance of some phenomena, it certainly is devoid of any use/profit, a drawback even, for the decision about pathological and therapeutic problems. … How can one altogether dare to determine a therapy with it? (Transl. from Wunderlich, 1844, pp. 41–42). … explicitly without any selection [he did not say how] and with undaunted tenacity until death. I could not help the impression that we live in times more barbaric than when criminals sentenced to death were used for [testing] operations or for physiological experiments. Medicine’s first duty is indeed scientific research; however, all his objects should be more holy to a doctor than to an entomologist, who transfixes his beetles without mercy (Transl. from Wunderlich, 1841, pp. 41–43).
Of course, he referred to Louis and criticised him for often having asked the wrong questions, for example concentrating, crudely, only on the final result, on ‘cure or death’. This also made clear that statistics had so far not achieved much. Wunderlich also criticised Gavarret for his quest for 400 cases, because this made ‘any application of statistics impossible’ (Wunderlich, 1851, p. 111). He might have understood all this from discussing the methods of evaluation research with an already potent former collaborator in his Tübingen clinic, Wilhelm Griesinger.
Griesinger was all in favour of numerical and statistical methods. Quoting Guy (Griesinger, 1848), he noted: … the “sometimes” of the prudent – […] – is the “often” of the sanguine, the “always” of the empiric and the “never” of the sceptic; the numbers 10, 100, 1000 [however] have the same meaning for everybody (p. 6).
But anyhow, figures, even mathematically calculated valid differences between groups à la Gavarret, were not everything; they needed interpretation: who died, what of, and when did he die? In that sense, smaller groups could also be valuable. And there was the problem of the relevance of mean values for the individual case. Much remained to be done (p. 22)!
Although approved of, Louis was also criticised by yet another friend of Wunderlich’s and Griesinger’s, Friedrich Oesterlen. All three had studied together in Tübingen and had visited Paris and Vienna in the 1830s.
Oesterlen had also read Gavarret. He now aimed to set the issue of statistics in a theoretical context by applying in medicine (pVI) the teaching of J. Stuart Mill’s System of Logic (1846). 2 Oesterlen’s book dealt with medical observation, the concepts of induction, deduction, generalisation, experiment, experience and statistics. But valid scientific results consisted in the discovery of causation, not just in the discovery of statistical correlations.
He wrote about the contribution of statistics in general terms as ‘the essential step in our research on truth based on experience’. This held as long as one kept to the rules of extremely precise observation, compared comparables, considered the natural course of diseases, collected large numbers to establish high grades of probability. He was very cautious about generalisations (Oesterlen, 1852, pp. 129–140) and hasty conclusions, as had been the case with Louis. These could lead to nonsense, and the general error of internal medicine was the post-hoc-propter-hoc fallacy. Comparison was needed (pp. 129–140).
This work would prove to be a major contribution to the methodological discussion in Germany. 3 However, neither Oesterlen nor Wunderlich mentioned the calculus of probabilities in this context, in contrast to their friend Griesinger. It was beyond their horizon at this time.
Having thus initiated probabilistic thinking into therapeutic evaluation early in their careers, these three Tübingen friends moved on to further responsibilities: Wunderlich went as chief of internal medicine to the University of Leipzig. Griesinger succeeded him in the Tübingen chair; he turned more and more to reforming psychiatric care. Oesterlen did not succeed academically in Germany. After much publishing on various issues he retired to private practice, eventually in Switzerland.
But the Archiv für Physiologische Heilkunde, edited after Wunderlich by Griesinger and now by another Tübinger, the physiologist Karl Vierordt, continued to open its pages for a major contribution to the field from
Schweig started his article by clarifying definitions: medical statistics were for him a special method for drawing conclusions (Schlussziehung) (pp. 307–309). He had clearly read Jacob Bernouilli, Poisson, and Gavarret (pp. 322–323), and he wrote at length on the establishment of arithmetic averages (means) of groups of cases. Such averages were only of any significance if compared with other averages. And for a valid comparison, it was necessary to know their ‘their limits of oscillation’ (according to Gavarret). These had to be as small as possible. But the exact determination of the sufficient number of cases or groups to achieve this (by the method of least-squares) was too complicated. Therefore ‘the probability is to approach certainty [simply] by further observations or experiments’ (pp. 330–331). Thus, finally, he set up the following rules:
Know the state-of-the-art and ask a precise and sharply limited question. Collect well-observed cases according to a plan defined by the question, yet do not select them in ways that are biased by a preconceived idea. Form groups and calculate averages (means). Draw conclusions based on calculations that accord with clearly stated conditions. As to the validity of a conclusion, be aware that it always depends only on a probability, and that it is provisional until other works performed under similar conditions achieve the same result (replication), wherewith it rapidly approaches certainty (pp. 351–355).
These rules were certainly clarifying, but they were not acknowledged by the medical world. Schweig was not quoted by a group of contemporaneous, yet somewhat younger mathematician-physicist-physiologist-physicians who advanced the methodology by developing tests of significance for assessing the meaning of differences between groups.
Testing the validity of comparisons, 1858–1877
All these theoretical contributions of the 1840s and early 1850s reflected probabilistic thinking in the unconscious (Wunderlich) and conscious, pre-mathematical modes (Henle, Griesinger, Oesterlen, Schweig). In the next two decades, two generations of younger men acted in compliance with the formal, mathematical mode.
A storm in a teacup ensued over the next few years. The opposition was led by physiologists of Radicke’s generation such as Karl Vierordt (*1818), the newly appointed professor of physiology at Tübingen), and Friedrich Wilhelm Beneke (*1824), who acted also as Kurarzt (spa doctor), was still a Privatdozent with vested interests, which added confusion.
They argued that effects were due to a ‘logic of determining facts’ and that probabilistic mathematics was a valid, but purely formalistic form of medical statistics without substance in the real world. This line of argument was later used also by Claude Bernard. The participants in this debate did not understand what Radicke’s approach was about. In the end, determinism prevailed; Radicke and his test disappeared from the German literature. 4
Mathematical aids are handily presented to medical researchers ready for use. […] Yet they still don't make comprehensive use of them. And, after quoting from puzzling Beneke at length, he added: Yes, even quite frequently, weighty voices have risen against them in principle (Transl. from Fick, 1866, p. 430). … a covenant about the degree of probability one wishes to require. A certain measure is naturally to be observed. Since probability is more or less to replace certainty one must not be satisfied with too scanty a probability, e.g. it would be completely senseless to ask for a probability of only ½ (pp. 430, 434 and 440).
That was a methodological advance, yet Fick did not contribute to solving the practical difficulty of the computation of hundreds of comparable cases. Consequently, the large number of patients required according to Gavarret (and Fick) continued to be criticised. Several ways to solve the problem were suggested. Wunderlich had, irrelevantly, proposed concentrating on the effects of a given remedy rather than on a disease because of the diagnostic uncertainties (Wunderlich, 1851, p. 111).
So, questions remained open. But new inputs were soon to be propounded by three physicians of an even younger generation born in the 1840s and then elaborated by an older, remarkably versatile colleague.
Therefore, [he said] it is also permitted to choose this stricter form of calculation, although the absolute numbers are not very large. … Fick’s formula is insufficient, for [the number of cases] is too few. [And he concluded] maybe it is time just now to open the doorway for analytic statistics for non-specialists. This science, albeit hardly existing today … will in the future solve problems of which we now have not the slightest idea (Transl. from Jürgensen, pp. 65–76, 129). … perhaps it is timely just now to popularise analytical statistics more generally … This science, albeit hardly existing nowadays, … would in the future solve problems of which we had not yet a clue (Transl. from Jessen, 1867, pp. 128, 136).
Tübingen again
Since his youth,
Now Liebermeister started working like a professional on the methodological issue that Jürgensen had dealt with a decade before in an amateurish way. In Basel he had wondered why the lethality of typhus patients was higher in his clinic when compared to Jürgensen’s. This had to be explained. But he also had in mind to find a mathematical solution to the meaning of a statistical difference between two therapies. For this he developed a test of significance for such a difference (1877). He lectured on the issue, and sent a manuscript to two professors of mathematics, former colleagues from the University of Basel, for critical examination. They approved. The ensuing publication bore a similar, yet more specific, title than Fick had chosen, namely Ueber Wahrscheinlichkeitsrechnung in Anwendung auf therapeutische Statistik. (On the calculus of probabilities applied to therapeutic statistics (Liebermeister, 1877). He named this mathematical solution a ‘four-table-test’. It was practicable for analysing very small groups (n=<10), but led to extensive calculations when larger groups were analysed. He included examples of both situations.6,7
Footnotes
Declarations
Acknowledgements
My heartfelt thanks to: Iain Chalmers, without whose unflinching encouragement, gentle whip, intellectual and unrenounceable practical help over the years, I would neither have begun nor ever terminated this work; Thomas Schlich, who critically and helpfully read all previous versions; Robert Matthews, whose help with mathematical matters was very welcome; Brigitte Wanner and Christian Wyniger of the Institute of Social and Preventive Medicine, Bern, who helped me, together with Patricia Atkinson, Oxford, with ever so many IT technicalities; my wife, Marie Claude, whose patient love is not probably, but absolutely true.
Provenance
Invited contribution from the James Lind Library.
Supplementary file
The references listed below are chosen as essential to the reading of the article. However, the full list of primary and secondary references is available online both on the Journal’s website as supplementary material, and with the original publication at
. Except when otherwise mentioned, translations into English are the author’s own
Author’s note
When not specifically referenced, biographical details stem from:
Bynum WF and Bynum H, eds. Dictionary of Medical Biography. Westport, CT and London: Greenwood, 2007. Dictionary of Scientific Biography. New York: Charles Scribner’s Sons. Hirsch A, ed. Biographisches Lexikon der hervorragenden Ärzte aller Zeiten und Völker, 2nd ed. Berlin, Wien: Urban & Schwarzenberg, 1929.
