Abstract
The biogeochemical fate of organic matter (OM) entering soils is an important issue that must be examined to better understand its roles in nitrogen cycling and as a natural modulator of soil-atmospheric carbon fluxes. Despite these critical roles, there are uncertainties in estimating the contribution of this feedback mechanism due in part to a lack of molecular-level information regarding the origin and labile and refractory inventories of OM in soils. In this study, we used a multi-analytical approach to determine molecular-level information for the occurrence and stabilization of OM in a bird guano concretion of the Late Miocene or Pliocene age in Jamaica. We determined the specific organic structures persisting in the concretion and the possible contribution of fossil organic matter to the OM pool in modern environments. Our results indicate that aliphatic species, presumably of a highly polymethylenic nature [(CH2)
Keywords
Introduction
Soil organic matter (SOM) is a complex heterogeneous mixture of various biomolecules predominantly of plant and microbial origin. 12 Traditionally, studies on SOM have focused on its role in soil fertility and other related soil characteristics; however, in recent decades, the role of SOM as a natural modulator of soil-atmospheric carbon fluxes has emerged as a critical area of research. Despite this shift in the conventional paradigm, knowledge of the biochemical composition and dynamics of SOM, including the carbon and nitrogen biogeochemical cycles, is based on the prerequisite that organic matter (OM) occurring in soils originates exclusively from current biomass. Conversely, the contribution and biochemical fate of fossil organic matter (FOM) to the modern SOM inventory is less well-documented and is poorly understood.3–6 Therefore, to provide more accurate estimates of the soil-atmospheric carbon flux as well as the size and availability of the labile nitrogen pool to plants for predicting fertilizer requirements, it is important to first examine the biochemical contribution of FOM to the SOM inventory. FOM entering soil, sediment, and river environments is often released by the weathering and erosion of sedimentary and metamorphic rocks.3,7,8
Given the uncertainties regarding the elusive “missing carbon sinks”, the size of labile and refractory SOM pools, and the dynamics of carbon and nitrogen in these pools, it is important to delineate the possible contribution of FOM to the SOM pool. Further, it has been suggested that increases in CO2 levels are expected to continue in the next decades, even if stringent cuts in fossil fuel emissions are implemented worldwide. 8 Therefore, accurate molecular-level information about the origin of FOM, its role in the biogeochemical cycling of carbon, and processes involved in its release and transformation is necessary to provide better estimates of soil-atmospheric CO2 fluxes. It is important to note that in addition to inputs, the composition and dynamics of biomolecules in soils are influenced by a wide range of biotic and abiotic factors. These include bioturbation, oxidation, microbial- and photo-degradation, hydrolysis, and weathering. 9
Many of the biomolecules present in OM are reactive and are subjected to readily discernible modifications to their original molecular structure as a consequence of degradation reactions, thus allowing biogeochemical reaction sequences to be examined.10,11 As an example, lipids demonstrate a strong preservation potential, structural specificity, and readily discernable structural modifications in OM,11,12 and may be used to trace the origin, chemical history, and biogeochemical dynamics of OM in soils. Similarly, Copard et al 3 used fossilized geochemical markers to confirm the contribution of fossilized organic carbon (FOC) in present-day soil and river environments. Together, this information may be used to provide clues about the processes underpinning carbon and nitrogen cycling.
When considering the structural aspects of OM, it is important to examine the contribution of the various compound classes. Here we employ Fourier transform infrared (FT-IR) spectroscopy, highresolution-magic angle spinning nuclear magnetic resonance (HR-MAS) spectroscopy, and gas chromatography mass spectrometry (GC-MS) to provide molecular-level information on the occurrence and stabilization of OM in a bird guano concretion of the Late Miocene or Pliocene age from the Manchester Parish in Jamaica. Particularly, we examined which specific organic structures persisted in the concretion and the possible contribution of FOM to the present-day OM pool. FT-IR spectroscopy has been successfully employed to study environmental matrices such as SOM 16 and compost, 17 and provides us with a nondestructive means to perform rapid molecular-level investigations of a sample. NMR spectroscopy is the single most powerful analytical technique that can be applied for the analysis of organic structures and their interactions in various environmental matrices, 18 and provides useful complementary and confirmatory data regarding specific biochemical components that persist and are available for release into the environment. Although advanced spectroscopic approaches such as HR-MAS NMR are used to yield vital data about predominant species,18–20 these methods provide less information about exact molecular composition. 2 Therefore, we also examined the chemical decomposition of OM to yield smaller units amenable to GC-MS, thus providing a means for performing more informative investigations about the biochemical nature of the OM pool in the sample.
Methods
Sample and Sample preparation
A guano concretion, previously described in Garrett et al, 21 was obtained from a goethite-rich bauxitic soil in Spitzbergen (18.1268°N, 77.4957 W and altitude 732 m) in the Manchester Parish, Jamaica. Upon receipt, the sample was thoroughly cleaned by consecutive washes with large amounts of distilled and deionized water and then oven-dried at 60 °C. Next, the concretion was subdivided, ground to a fine consistency, and sieved through a stainless steel sieve with a 2-mm aperture. Duplicate samples (15 g each) of the sieved homogenized guano concretion were exhaustively extracted with 0.1 M NaOH (each time using 30 mL), the extract filtered through 0.22-μm Durapore membrane filters (Millipore, Billerica, MA, USA) under nitrogen atmosphere, cation-exchanged [Amberlite (Rohm and Haas, Philadelphia, USA) IRC-50 (H)] to remove the Na+ and then freeze-dried.
FT-IR analysis
The FT-IR spectrum of the sample was recorded on a PerkinElmer FT-IR Spectrum GX Spectrometer (Waltham, MA, USA). Approximately 1.0 mg of freeze-dried OM extracted from the sample was homogenized in 100 mg of spectroscopic-grade KBr with a refractive index of 1.559 and a particle size of 5-20 μm (Sigma, St. Louis, MO, USA) and used to create a disc-like potassium bromide (KBr) pellet for analysis. Background KBr spectra were obtained and spectra were compared to the background. Spectra were recorded by accumulating 256 scans (to increase the signal-to-noise ratio) in the 4000 to 400 cm−1 mid-infrared spectral range in the absorbance mode with a resolution of 4 cm−1. Baseline correction was conducted using the automatic baseline correction method. Samples were analyzed immediately after preparation to minimize water absorption onto the hydroscopic KBr that may have obscured key signals from the sample or lead to misinterpretation of signals. 22
High-Resolution Magic Angle Spinning (HR-MAS)NMR
Thoroughly dried organic extract (∼15 mg) was placed in a 4 mm zirconium oxide rotor and 1 mL D2O was added and titrated to pH 13.1 using NaOD (40% by wt) to ensure complete solubility. It is essential to dry the samples thoroughly and use only ampules of NaOD or D2O to prevent a large water peak often centered at ∼3.3 ppm that can obscure many actual OM signals. After homogenization of the sample using a stainless steel mixing rod, the rotor was doubly sealed using a Kel-F (Bruker) sealing ring and a Kel-F rotor cap. 1H HR-MS NMR spectra were acquired using a Bruker 500 MHz Avance spectrometer (Billerica, MA, USA) fitted with a 4-mm triply tuned 1H−13C−15 N HR-MS probe fitted with an actively shielded Z gradient at a spinning speed of 10 kHz. 1H NMR was acquired while simultaneously decoupling both 13C 15N nuclei. Scans (256) were acquired with a 2-s delay between pulses, a sweep width of 20 ppm, and 8 K time domain points. Basic spectral assignments were verified by two-dimensional (2D) experiments such as heteronuclear single quantum coherence (HSQC) and 1H−1H total correlation spectroscopy (TOCSY) of actual biopolymers (data not shown). Spectral predictions were also carried out using Advanced Chemistry Development's ACD/SpecManager and ACD/2D NMR Predictor using Neural Network Prediction algorithms (version 10.02). Parameters used for prediction including line shape, spectral resolution, sweep width, and spectrometer frequency were set to match those of the real datasets as closely as possible. 18
Sequential Solvent extraction
Solvent extraction
Total solvent extraction of the concretion followed the protocol described by Otto et al 23 with some modifications. Powdered sample (15 g) was sonicated twice for 15 min, each time with 30 mL double-distilled water to remove highly polar water-soluble compounds. Water extracts were centrifuged at 1048 xg for 30 min, decanted, and the combined extracts freeze-dried and stored at −20 °C for further analyses. The water extracted residue was freeze-dried to remove excess water and then extracted with solvents as follows: samples were sonicated twice for 15 min with 30 mL methanol, then 30 mL dichloromethane:methanol (1:1; v/v), followed by 30 mL dichloromethane. The combined total solvent extracts (“free lipids”) were filtered through glass fiber filters (Whatman GF/A, Kent, UK) using a Buchner apparatus, concentrated by rotary evaporation, and dried completely in 2 mL glass vials under a constant stream of nitrogen gas. Solvent-extracted residues were air-dried in preparation for base hydrolysis.
Base hydrolysis
Air-dried solvent-extracted residues were refluxed for 3 h using 30 mL of 1 N methanolic KOH. The samples were allowed to cool, centrifuged for 30 min at 1048 xg, and the supernatant decanted into a glass round-bottom flask and kept at 4 °C. Each residue was then extracted twice by sonication for 15 min with 30 mL of dichloromethane:methanol (1:1; v/v). The suspensions were centrifuged as described above and the combined supernatants were acidified to pH 1 using 6 M HCl. The extracts were dried by rotary evaporation at 40 °C under vacuum and the dried extracts resuspended in 30 mL of Milli-Q water. Base hydrolyzed products were recovered from the water phase by liquid-liquid extraction in a separation funnel with diethyl ether. Anhydrous Na2SO4 was added to the combined ether phases to remove water. The extracts were concentrated by rotary evaporation, transferred to 2-mL glass vials, and dried under a nitrogen atmosphere. The base hydrolyzed residues were air-dried and stored at −20 °C. 24
Derivatization and gc-ms analysis
Derivatization of total solvent extracts was performed as described below. Dried solvent extracts were redissolved in 500 μL of dichloromethane:methanol (1:1; v/v) and aliquots of the reconstituted extracts (50 μL) were evaporated to dryness under nitrogen atmosphere. The dried residues were then converted to trimethylsilyl (TMS) derivatives in a 50 μL reaction using
Results and Discussion
FT-IR spectroscopy
Extractable OM represents a complex heterogeneous mixture of biomolecules. Therefore, unlike pure compounds with characteristic sharp absorption peaks, OM extracted from the guano concretion is characterized by a series of broad bands resulting from the overlap of the absorptions of many similar functional groups (Fig. 1). In the present study, spectral assignments were consistent with those reported in the literature and represent the predominant species (not all species) in a given region, which are outlined below.

FT-IR spectrum of fossil organic matter extracted from a bird guano concretion of Late Miocene or Pliocene age. Major spectral assignments are consistent with data from published literature and represent the predominant species (not all species) in a given region and are assigned in the text.
A strong broad band positioned at 3427-3308 cm−1 is attributed to the presence of an abundance of
HR-MAS NMR spectroscopy
FT-IR spectroscopy provides a good overview of the major biochemical components present in the sample (Fig. 1). However, due to the vast spectral overlap from the heterogeneity of compounds in natural organic matter, we have applied HR-MAS NMR to clarify and confirm the assignments made from the IR spectrum. The HR-MAS NMR spectrum of guano concretion-derived OM is presented in Figure 2 and provides substantial complementary details of the major biochemical components that have been preserved in the sample. Spectral regions in Figure 2 have been assigned.20,31 The regions highlighted in brackets represent the predominant species (similar contributions from other species are present in some areas) and are broadly defined as contributions from: (1) amide signals in peptides and signals from aromatic rings including aromatic amino acid residues, some amide signals in peptides may also resonate in this area; (2) overlap region of anomeric protons from carbohydrates, protons associated with esters, protons on α carbon in peptides and double bonds; (3) other protons in carbohydrates, protons α to an ester, ether, and hydroxyl in aliphatic chains will also resonate in this region; (4) signals from various substituted methylenes and methines β to a functionality in hydrocarbons, signals from some amino acid side chains will also resonate here; (5) CH2, main chain methylene in lipids; and (6) CH3 groups. When a spectrum has a very strong protein contribution, the CH3 signal is derived mainly from methyl rich amino acids residues such as leucine, isoleucine, and valine; however, when the sample is dominated by aliphatic chains, for example in Figure 2, a considerable contribution is from aliphatic substances.20,31

1-D 1H HR-MAS spectra of (A) fossil organic matter extracted from a bird guano concretion of Late Miocene or Pliocene age, with that of bovine serum albumin presented for comparison (B). General regions are highlighted in brackets and may be broadly defined as: (1) amide signals in peptides and signals from aromatic rings including aromatic amino acid residues, some amide signals in peptides may also resonate in this area; (2) overlap region of anomeric protons from carbohydrates, protons associated with esters, protons on α carbon in peptides and double bonds; (3) other protons in carbohydrates, protons α to an ester, ether, and hydroxyl in aliphatic chains will also resonate in this region; (4) signals from various substituted methylenes and methines β to a functionality in hydrocarbons, signals from some amino acid side chains will also resonate here; (5) CH2, main chain methylene in lipids; (6) CH3 groups; aa = amino acids; and (i) mainly aromatic residues.
It is evident that the 1-D NMR spectrum is dominated by signals from aliphatic lipids (regions 5 and 6), suggesting these compounds are selectively preserved and may be a major contributor to the soil-C pool. This enrichment of aliphatic lipids in the sample may be due in part to their hydrophobic nature, which prevents transport and as a result, lowers the contact with degrading microbes and other biotic and abiotic factors, allowing them to remain in the sample for longer periods of time. Additionally, aliphatic enrichment may be due in part to the degradability of the readily bioavailable dissolved or water-extractable OM fraction. This is consistent with the observation that alkyl-C, such as those in polymethylenic structures, is the most recalcitrant form of organic carbon found in non-living SOM.32,33 Such structures have also been found in the resistant biopolymers of algae. 34 Further, several recent studies focusing on the adsorption of OM to clay mineral surfaces have demonstrated the aliphatic nature of clay-organo interactions.35–38 Such interactions should not be excluded from the enrichment of aliphatic species in the sample. Although not as significant as aliphatic lipids, there is some evidence to suggest that carbohydrates have been preserved in the sample. This is somewhat surprising since the conventional view is that labile substrates such as carbohydrates are rapidly mineralized by degrading microbes. It is probable that the preservation of carbohydrates in the sample is linked to modifications by other biomolecules such as lipids or proteins. For instance, recalcitrant lipopolysaccharides are thought to be a large component of the surviving carbohydrates entering the SOM pool.2,18,31
Similarly, the presence of proteins/peptides (region 1) in the sample is somewhat unexpected. Proteins/peptides have been confirmed by the spectral similarity shown in Figure 2B (protein standard bovine serum albumin) to that of the 1D 1H NMR spectrum of the sample (Fig. 2A), for which contributions from protein/peptide structures such as amide (N-H), aromatic amino acids (aa), and α-protons are easily distinguished. Traditionally, proteins are thought to be labile in the environment; therefore, it is not immediately clear whether specific proteins presented here are resistant to degradation due to their intrinsic molecular structure, as is the case for membrane proteins, 39 or environmentally induced modifications of their quaternary structures and physico-chemical (charge and hydrophobicity/hydrophilicity) and biochemical characteristics. For example, the biorefractory nature of signals resonating from lipoproteins (region 4) has previously been demonstrated. 31 Additionally, protein preservation could include abiotic processes such as condensation reactions, which are known to reduce degradability. 42 Similarly, sorptive protection offered by mineral matrix may be an important factor in the long-term stabilization of proteins/peptides. It has been suggested that this form of association with clay minerals may result in OM being isolated in microenvironments where it could be protected from decomposers. 43 It is also intriguing to suggest that proteins/peptides survived as products of humification catalyzed clay-mediated reactions before concretion of the sample. This suggestion is favored since proteins/peptides are structurally and biochemically versatile biomolecules (they contain both cationic and anionic functional groups as well as hydrophilic and hydrophobic residues) and have demonstrated a strong affinity for clay mineral surfaces.37,38 Additionally, anoxic soil conditions may have also contributed to the overall stability of OM in the sample as fossilization occurred. Whatever the case, these results further suggest that FOM may be an important contributor to carbon and nitrogen pools in present-day SOM.
Lipid analysis
To further understand the nature of the dominant lipid fraction in the sample, we performed GC-MS analysis of silylated lipid extracts. The major classes of compounds contained in the fractions obtained from sequential extraction of the sample included a series of
Occurrence of identified compounds in fractions obtained from sequential extraction of a fossilized bird guano.

Representative GC-MS chromatograms (TIC) of fractions obtained from sequential solvent extraction of a bird guano concretion of Late Miocene or Pliocene age. (A) Free lipids and (B) bound lipids (base hydrolysate). * denotes
It has been hypothesized that, in the environment, the reactivity and therefore lability of aliphatic chains increases with their degree of unsaturation; hence, it is not surprising that we did not detect
Although not present in significant diversity or relative abundance, there is still sufficient evidence to support the recalcitrance of guano-derived lipids in this study. This recalcitrance may be due in part to the molecular structure, cross linkages, and various mechanisms of protection offered by minerals. The roles of specific mineral components (Fe, Al, and Si) in clay-organo interactions have been previously suggested.
38
Further, selective preservation of shorter chain FAs after degradation may be linked to chemical modification by polar cellular components, such as amino acids and carbohydrates, as is consistent with lipoproteins and lipopolysaccharides, respectively. Such biomolecules have been shown to resist decomposition in soil microbial biomass and leachates.
31
The biorefractory nature of
Conclusion
The biochemical contribution of FOM to the SOM inventory is important and must be considered in order to provide more accurate estimates of the soil-atmospheric carbon flux as well as size and availability of the labile nitrogen pool to plants so as to predict fertilizer requirements aimed at improving nitrogen use efficacy. Molecular-level investigations were carried out to determine the possible contribution of FOM to the OM pool in soils. The results indicate the occurrence of major biochemical components (proteins, carbohydrates, and various classes of lipids) in a guano concretion. Although not a quantitative approach, this study provides a useful molecular-level descriptor of FOM components preserved in a concretion synonymous with that of sedimentary rocks. Therefore, it is probable that fossil components preserved in the concretion are available for release by weathering and erosion, eventually becoming integrated into the recent SOM pool. These findings also further highlight uncertainties in our knowledge of SOM inventory and its role in regulating natural and anthropogenic changes to global carbon and nitrogen biogeochemical cycles. Therefore, if we are to realistically address these and related concerns, further studies are needed to decipher the fate of FOM in the environment and establish a link between FOM in soils, sediments, and rivers as well as the carbon and nitrogen biogeochemical cycles.
Footnotes
Acknowledgements
We are grateful to Charles Grant and Johann Antoine of the International Centre for Environmental and Nuclear Sciences (ICENS) and Dr. Sherene James-Williamson of the department of Geography and Geology, University of the West Indies, Mona for their efforts in acquiring the guano concretion. We thank the anonymous reviewers for their insights and help with the completion of this manuscript.
Author Contributions
Conceived and designed the experiments: AS. Analyzed the data: AS, REH, TJ, CR. Wrote the first draft of the manuscript: AS. Contributed to the writing of the manuscript: AS, REH, TJ, CR, RNA. Agree with manuscript results and conclusions: AS, REH, TJ, CR, RNA. Jointly developed the structure and arguments for the paper: AS, REH, CR. Made critical revisions and approved final version: AS, RNA. All authors reviewed and approved of the final manuscript.
Funding
Author(s) disclose no funding sources.
Competing Interests
Author(s) disclose no potential conflicts of interest.
Disclosures and Ethics
As a requirement of publication the authors have provided signed confirmation of their compliance with ethical and legal obligations including but not limited to compliance with ICMJE authorship and competing interests guidelines, that the article is neither under consideration for publication nor published elsewhere, of their compliance with legal and ethical guidelines concerning human and animal research participants (if applicable), and that permission has been obtained for reproduction of any copyrighted material. This article was subject to blind, independent, expert peer review. The reviewers reported no competing interests.
