Prediction of Molecular Properties with Mid-Infrared Spectra and Interferograms

Abstract

We have built infrared spectroscopy-based partial least-squares (PLS) models for molecular polarizabilities with the use of a 97-member training set and a 59-member independent prediction set. These 156 compounds span a very wide range of chemical structures. Our goal was to use this well-defined chemical property to test the breadth of application of a method whose end use is aimed at predicting poorly defined, environmentally important properties and activity parameters (e.g., microbial transformation rate constants). Separate models were built by using gas-phase mid-infrared spectra and, alternatively, their Fourier transformations (i.e., interferograms). The optimum spectrum- and interferogram-based models produced approximately the same error (root mean square deviation divided by the parameter value range) for the independent prediction set, 9.53 and 9.92%, respectively. With spectrum-based models, we found that deresolving the spectra from a point spacing of 6 cm⁻¹ to about 40 cm⁻¹ produced much lower error (under leave-one-out cross-validation) when all 156 compounds were included, but much higher error when a model was built by using a structurally narrow subset of the compounds (namely, 38 alkanes). Qualitative interpretation of the first PLS weight-loading vector from the spectrum-based model provided important information on the relationship between chemical structure and molecular polarizability.

Keywords

Property prediction Partial least squares Infrared spectroscopy Fourier transform data processing Spectral resolution

Get full access to this article

View all access options for this article.

References

Carreira

L. A.

Hilal

Karickhoff

S. W.

, in Theoretical and Computational Chemistry: Quantitative Treatments of Solute/Solvent Interactions, Politzer

Murray

J. S.

, Eds. (Elsevier, Amsterdam, 1994), p. 291.

Collette

T. W.

, Environ. Sci. Technol. 24, 1671 (1990).

Steen

W. C.

Collette

T. W.

, Appl. Environ. Microbiol. 55, 2545 (1989).

Collette

T. W.

, Environ. Toxicol. Chem. 11, 981 (1992).

Collette

T. W.

Szladow

A. J.

, Appl. Spectrosc. 48, 1379 (1994).

Collette

T. W.

, Vib. Spectrosc. 15, 113 (1997).

Haaland

D. M.

Jones

H. D. T.

Thomas

E. V.

, Appl. Spectrosc. 51, 340 (1997).

Haaland

D. M.

Thomas

E. V.

, Anal. Chem. 60, 1193 (1988).

Collette

T. W.

, Trend. Anal. Chem. 16, 24 (1997).

10.

De Vries

A. H.

Van Duijnen

P. T.

Zijlstra

R. W. J.

Swart

, J. Electron Spectrosc. Relat. Phenom. 86, 49 (1997).

11.

Moore

W. J.

, Physical Chemistry (Prentice-Hall, Inc., Englewood Cliffs, 1972), 4th ed., p. 701.

12.

Miller

K. J.

, J. Am. Chem. Soc. 112, 8533 (1990).

13.

Debye

, Polar Molecules (Dover Publications, New York, 1945).

14.

Applequist

Carl

J. R.

Kwonk-Keung

, J. Am. Chem. Soc. 94, 2952 (1972).

15.

Bloomfield

, Fourier Analysis of Time Series: An Introduction (John Wiley and Sons, New York, 1976).

16.

Small

G. W.

Arnold

M. A.

Marquardt

L. A.

, Anal. Chem. 65, 3279 (1993).

17.

Hazen

K. H.

Arnold

M. A.

Small

G. W.

, Appl. Spectrosc. 48, 477 (1994).

18.

Mattu

M. J.

Small

G. W.

Combs

R. J.

Knapp

R. B.

Kroutil

R. T.

, Appl. Spectrosc. 54, 341 (2000).

19.

Ding

Small

G. W.

Arnold

M. A.

, Anal. Chem. 70, 4472 (1998).

20.

Bangalore

A. S.

Demirgian

J. C.

Boparai

A. S.

Small

G. W.

, Appl. Spectrosc. 53, 1382 (1999).

21.

Haaland

D. M.

Han

Niemczyk

T. M.

, Appl. Spectrosc. 53, 390 (1999).

22.

Nyquist

R. A.

, The Interpretation of Vapor-Phase Infrared (Sadtler, Philadelphia, 1984), Vol. 1, p. 530.

23.

Bellamy

L. J.

, The Infrared Spectra of Complex Molecules (Chapman and Hall Ltd., London, 1975), Vol. 1, 3rd ed., pp. 369–371.