Abstract
Thyroglobulin (Tg) is a tumour marker for differentiated thyroid cancer. Interpretation requires a knowledge of the current thyrotropin (TSH) concentration as secretion is TSH-dependent. While a raised serum Tg may be indicative of residual or recurrent thyroid cancer, trauma to the thyroid (e.g. surgical, biopsy or due to radioiodine treatment) also causes an increase. Tg may be measured when TSH is suppressed and also following recombinant TSH (rhTSH) stimulation. Interpretation of results in pregnancy and in children is discussed. Assay bias and interference by endogenous Tg antibodies (Abs) are the main confounders in the interpretation of results. Although there is an international standard for Tg, there are large differences in results and yet there are few assay-specific clinical decision limits. Patients should therefore be monitored with the same assay. Endogenous TgAbs may cause false-negative interference in immunometric assays and may cause false-positive results in radioimmunoassay. Although the measurement of TgAbs has been advocated for predicting interference, it is now clear that interference can still occur when TgAbs have not been detected, the effect being TgAb-assay-specific. Approaches to identifying those samples where there may be interference are discussed. The laboratory should have a protocol for the investigation of possible interferences and data on the bias of the Tg assay that they use. An appreciation of the clinical uses of the service is required as an understanding by endocrinologists, oncologists and endocrine surgeons of the analytical limitations of the service.
Introduction
Thyroid cancer, although rare, is the commonest endocrine cancer with over 2000 people diagnosed with the disease in 2007 in the UK and a men:women ratio of 1:3. 1 Patients typically present with thyroid nodules, with or without goitre, and the clinical need is to identify those nodules which represent thyroid cancer, of which differentiated thyroid cancer (DTC) is the commonest form. Prevalence rates for thyroid nodules are dependent on mode of detection (palpation versus ultrasonography), population, gender and age. In the survey of thyroid disease in the community of Whickham, the prevalence of thyroid nodules was found to be 3.2% against a background prevalence of goitre of 6.9% and 8.6% (palpable and visible and palpable but not visible, respectively). 2 In general, treatment consists of total thyroidectomy, radioiodine ablation of the thyroid remnant and treatment with thyroxine (T4) in doses sufficient to suppress serum thyrotrophin (TSH). Prognosis is good with a survival of 90% at 10 years. The American, British and European Thyroid Associations have provided guidelines on the treatment of thyroid cancer. 3–6
Thyroglobulin (Tg) is a large molecular weight glycoprotein (approximate molecular weight 660 kDa) found in thyroid cells and the follicular lumen, and is involved in the synthesis of thyroid hormones. In humans, the gene is found on chromosome 8 and transcription is controlled by a number of thyroid-specific transcription factors. Tg mRNA is very heterogeneous and, once translated, the protein undergoes glycosylation. The extent of glycosylation and iodination depends on the source of the protein, e.g. from normal thyroid, tumour tissue or the peripheral circulation. It is released into the peripheral circulation during normal hormone synthesis and after trauma to the thyroid. As the standard approach to treatment of DTC is thyroidectomy followed by radioactive ablation, successful treatment is reflected by an undetectable serum Tg. Thus, measurement of serum Tg can be used as a tumour marker for monitoring treatment of DTC and to detect disease recurrence, with detectable/rising serum Tg potentially indicating residual or recurrent disease. 7 This contrasts with a subject with an intact thyroid who will have serum Tg in the range of 3–40 μg/L. 8 It cannot be overemphasized that reference ranges and values for measured Tg are assay-dependent.
Serum Tg is currently measured by immunoassay and there are a number of analytical challenges, namely the requirement for stability of the assay over decades given the need for long-term clinical follow-up, differences in assay bias and interference in the immunoassays by endogenous Tg antibodies (Abs). 9,10 Concern remains that endocrinologists, endocrine surgeons and oncologists may not be aware of the analytical limitations of immunoassay for Tg and that laboratories may not be aware of the varied clinical situations in which results are used. The aim of this paper is to consider the use of serum Tg as a long-term marker of DTC and the pitfalls of interpretation of results by a multidisciplinary team.
Routine review of the patient with DTC
Serum Tg is one element of the assessment of the patient and interpretation of the result requires a full knowledge of, for example, clinical presentation, type of thyroid surgery, thyroid hormone replacement therapy and results of imaging. It is essential that serum TSH is measured at the same time as serum Tg as secretion of Tg is TSH-dependent. It is unlikely that the clinical biochemistry laboratory will always have sufficient information to ‘interpret’ a Tg result at the stage of result authorization, but knowledge of histological type of DTC, surgery, ablation and TSH will allow assessment of whether the Tg result is analytically and clinically plausible. It is inappropriate to quote a reference range for serum Tg determined from a population of subjects with intact thyroids.
Serum Tg may be requested at the following stages of patient care and these will be discussed:
At presentation with thyroid nodule or nodular goiter; Prethyroidectomy/preradioiodine ablation; A few weeks post-ablation of the thyroid remnant; On T4 replacement therapy; Off T4 replacement therapy; Following stimulation with recombinant TSH (rhTSH; Thyrogen®).
The thyroid nodule and nodular goitre
Thyroid nodules may be the presenting feature found on palpation of the thyroid or may be discovered incidentally on imaging of the head and neck. Depending on clinical presentation and findings, history and family history, it has been recommended 4 that only nodules >1 cm require evaluation as these may represent clinically significant cancers. Serum TSH should be estimated. A subnormal serum TSH with a normal or raised serum free T4, i.e. the biochemistry of overt or subclinical hyperthyroidism, is rarely associated with thyroid cancer. Measurement of serum Tg is not indicated as concentrations of Tg are elevated in most thyroid diseases and coexisting lymphocytic thyroiditis or Graves’ disease may be more prevalent in patients with DTC prior to surgery, 11,12 thus the test lacks specificity. 13
Diagnostic ultrasound (US) establishes the size and US nature of the nodule and may be accompanied by fine needle aspiration (FNA) biopsy. Results of FNA cytology (FNAC) can then be considered according to defined diagnostic categories. These can be summarized as follows: Thy1 – non-diagnostic; Thy2 – non-neoplastic; Thy3 – follicular lesion/suspected follicular neoplasm; Thy4 – suspicious of malignancy; and Thy5 – diagnostic of malignancy. 5
DTC is the commonest thyroid cancer comprising papillary cancer at 85% of cases, follicular 10% and Hürtle cell or oxyphil tumours 3%. Certain histological subtypes of papillary thyroid cancer may have a worse prognosis.
Metastatic disease – cervical lymphadenopathy
The neck represents the commonest site for DTC metastases and the presence of cervical metastases will determine the extent of primary surgery or the need for second surgery. Preoperative imaging may also identify cervical lymphadenopathy and indicate the possible presence of metastases. A number of studies have considered the measurement of Tg in aspirates from cervical nodes as indicating metastatic disease. There are a number of potential analytical pit-falls with this approach. There is reported variability in how the sample is collected, including dilution effects when collecting needle ‘washouts’ and possible contamination with serum and thyroid cells. Laboratories need to establish protocols for the exclusion of the high-dose hook effect and investigate the possible matrix effects of the needle washout fluid in their assay. Logistical difficulties in collecting needle washouts into serum-based diluents have been reported. 14 As an alternative, sodium chloride (9 g/L) has been used to collect the needle washout sample, despite the reported over-recovery of exogenous Tg of 100–140%. 14 Other studies have shown a wide range of results in such samples. Boi et al. 15 defined Tg cut-off values of 36 μg/L if the patient had an intact thyroid and 1.7 μg/L if there was no thyroid, to indicate that the lymph node that had been aspirated contained metastatic disease. An earlier study also showed overlap between patient groups (pre- and post-thyroidectomy). In both patient groups, there was a significant elevation in Tg in the samples from patients with metastatic disease compared with those with negative lymph nodes (post-thyroidectomy patients; range of aspirate Tg < 0.25–76,000 versus <0.25–5.0 μg/L and pre-thyroidectomy patients; range of aspirate Tg < 0.25–89,000 versus <0.25–20 μg/L for metastatic versus negative lymph node, respectively). 16
There are few data on the comparative bias of different Tg assays in aspirates, 17 and further work is required to determine the analytical limitations of analysing such samples particularly with regard to assay specific cut-offs. It is clear that cytology should always be performed on the aspirates as Tg will be undetectable in undifferentiated metastases.
Metastatic disease – unknown primary
As for many tumour markers, the measurement of serum Tg is not diagnostic. The common sites for metastatic disease for DTC, other than cervical lymph nodes, are the lungs and bone. The laboratory may receive requests for serum Tg in patients with such metastases with a primary of unknown origin. Although very high serum Tg concentrations may suggest metastatic thyroid cancer, moderately elevated or normal concentrations do not rule it out. In an early study, 18 18 of 40 patients with metastatic thyroid cancer had a serum Tg greater than 400 μg/L compared with two out of 55 with benign nodular disease. Cut-off values are not agreed and other co-existent thyroid diseases may cause similar elevations. 19
Preoperative/preradioiodine ablation Tg
The measurement of Tg preoperatively/preradioablation is not recommended as there are few data to suggest that this influences further treatment/actions. 3–6 Given that the patient has an intact thyroid it is not possible, with current assays, to determine the source of serum Tg, i.e. whether from normal thyroid tissue, tumour or metastases.
Early measurement of Tg post-thyroid remnant ablation
Patients with known metastases, extra-thyroidal tumour extension, primary tumours larger than 4 cm or other high-risk features will generally be offered radioiodine ablation. This may be carried out after thyroid hormone withdrawal or following administration of rhTSH. Initial assessment of the success of ablation is usually performed at six months. There is nothing to be gained by measurement of serum Tg less than six weeks post-ablation (or thyroidectomy) as any measured Tg may merely reflect the recent injury to the thyroid and subsequent clearance of released Tg. Assessment at six months may also include a 131I diagnostic whole body scan (WBS) to determine the success of ablation and extent of any residual disease in the neck or elsewhere.
Long-term follow-up
The frequency of follow-up after surgery and thyroid remnant ablation will depend on histological type and other risk factors. Initial review may be at six-month intervals and then yearly and is likely to be lifelong. 5 The laboratory should therefore endeavour to provide Tg results from the same assay over a period of decades for a given patient. Requests for serum Tg may be made when the patient is (1) taking T4; (2) off T4/T3, i.e. biochemically hypothyroid; or (3) after administration of rhTSH. It is important that serum Tg has a high negative predictive value, not only to reassure the patient of the absence of disease but also to optimize the frequency of follow-up. Review of the patient may include both serum Tg and a 131I-WBS.
Of the possible outcomes of the above approach, an undetectable serum Tg with no radioiodine uptake into the thyroid bed or elsewhere, and a detectable serum Tg with detectable isotope uptake, would be considered to be in agreement, indicating successful or unsuccessful thyroid ablation, respectively. The difficulties in interpretation may arise where there is a detectable serum Tg but no radioiodine uptake or an undetectable serum Tg with significant isotope uptake.
Serum Tg may be a less sensitive marker in patients with less differentiated tumours, with some tumours losing the ability to trap radioiodine and secrete Tg. The possibility of a falsely low serum Tg due to interference by endogenous TgAbs in the immunoassay should be excluded.
Where a serum Tg is detectable but with no radioiodine uptake on diagnostic scan, a false-positive interference in the Tg assay, for example reflecting heterophilic antibodies in an immunometric assay (IMA) or anti-TgAbs in a competitive radioimmunoassay (RIA), should be excluded (see ‘Is the result correct?’ and following sections).
Recombinant thyrotropin
Protocol
Recombinant human TSH (Thyrogen®; Genzyme Corp, Cambridge, MA USA) is licensed for use with or without radioiodine imaging and with measurement of serum Tg for the detection of thyroid remnant and thyroid cancer in patients post-thyroidectomy. 20 It is also licensed to increase the uptake of radioiodine for the ablation of remnant thyroid tissue post-thyroidectomy. The dose can be administered as two 0.9 mg intramuscular injections 24 h apart. With a single injection, the serum TSH reaches a peak 6–8 h post-injection of 280 mIU/L, falling to 140 mIU/L at 24 h and 25 mIU/L at 48 h. 21 Use of rhTSH is contraindicated in patients with hypersensitivity to human or bovine TSH and its use should be avoided in pregnancy and patients who are breast feeding.
Interpretation
Early reports of the use of rhTSH-stimulated serum Tg as a marker of residual or recurrent disease were based on whether the Tg was undetectable or detectable. These studies utilized an assay with a quoted functional sensitivity of 0.5 μg/L and also a coefficient of variation of 24.8% at a Tg of 0.44 μg/L. 22,23 A number of studies have reported on the sensitivity and specificity of the test though these are difficult to compare as some have assessed performance for the detection of metastatic disease, some for residual disease and yet others for both. 24 With serum Tg cut-off concentrations of between 1.0 and 3.0 μg/L, sensitivity varied from 56% to 100% and specificity from 82% to 100%. This may be due to the different populations studied, for example, the proportion of high- versus low-risk patients and whether studied at first or subsequent follow-up. In addition, some studies have excluded those patients who were found to have endogenous antibodies to Tg, which may cause interference in the Tg assay, thus influencing the estimates of sensitivity and specificity.
Cut-off values for serum Tg used to assess the performance of rhTSH stimulation in detecting metastatic and residual disease (adapted from Robbins and Robbins 24 )
Brahms Diagnostics, Berlin, Germany; Sanofi Diagnostics Pasteur, Mions, France; Kronus, Star, ID, USA; DPC (now Siemens Healthcare Diagnostics), Los Angeles, CA, USA; Nichols Institute Diagnostics, San Juan Capistrano, CA, USA
Tg, thyroglobulin; rhTSH, recombinant thyrotropin; IMA, immunometric assay; RIA, radioimmunoassay; ICMA, immunochemiluminometric assay
Since then, it has been suggested that patients can be monitored for persistent or recurrent disease without the need for TSH stimulation if a Tg assay with a functional sensitivity of at least 0.1 μg/L is used, though there might be a concomitant decrease in specificity of the test. 4 Smallridge et al., 32 using the Beckman Access IMA (Beckman Coulter, Brea, CA, USA), found that only a small number of patients (2.5% of their cohort) with an unstimulated Tg of <0.1 μg/L had a stimulated Tg greater than 2 μg/L and imaging evidence of local recurrence or distant metastases. They concluded that patients can be monitored with a serum Tg while on T4 suppressive therapy with periodic neck US and that a rising Tg or the finding of abnormal lymph nodes should prompt further investigation. The authors did note that in their practice, there was a requirement for extensive evaluation whenever there was a change in reagent lot (this comprised a 40-patient sample comparison in duplicate, linear regression analysis of the data showing no significant change in the intercept and no change in slope of the regression line greater than 10%) to ensure that there was no clinically significant effect at low Tg concentrations of the change in reagents. Other studies using the same assay have established different cut-off values. Malandrino et al., 33 studying patients on T4 suppression, found that a cut-off of 0.15 μg/L for serum Tg gave the best specificity and sensitivity for detecting disease recurrence, and others 34 have used a value of 0.27 μg/L.
The difficulty in establishing cut-off values for serum Tg for routine clinical practice is illustrated by a study 35 in which serum Tg was analysed by both Immulite IMA (Siemens Healthcare Diagnostics, Los Angeles, CA, USA; quoted functional sensitivity of 0.9 μg/L) and the Beckman Access IMA (experimentally determined functional sensitivity of 0.1 μg/L) on a proportion of the study samples. The authors present absolute values of serum Tg for 35 patients while on T4 suppression and post-rhTSH. Analysis of this published data, excluding data points below the relevant assay's lower limit of reporting, shows that the results from the Beckman assay are significantly lower (Wilcoxon test, P < 0.001) than that of the Immulite, despite traceability of both assays to the Certified Reference Material (CRM) 457. This contrasts with the data presented by Schlumberger et al. 36 who showed good agreement of these two assays when the CRM 457 was added to a basal sample. These differences in assay bias for patient samples but not ‘spiked’ samples might be due to differences in the molecular forms of Tg recognized by the difference assays and make comparison of functional sensitivities and cut-off values problematic.
Comparison of studies investigating the use of serum Tg while on T4 suppression with Tg post-rhTSH are further complicated by differences in the patient populations studied with respect to risk of recurrence, the method for excluding false-negatives due to TgAb interference and at what stage of the follow-up protocol neck ultrasonography is used. The implications of misclassification of patients as disease-free or with recurrent disease requiring further radioiodine treatment are significant and the guidelines emphasize the significance of a rising Tg and further investigations.
Whether a cut-off value of serum Tg has been determined for TSH-stimulated or TSH-suppressed patients, the main difficulty in extrapolating cut-off values determined for one study population to another is primarily the significant effect of assay bias, particularly at low concentrations of Tg in serum. In addition, of the published studies, many of the assays are no longer used or insufficient method detail is given, making assessment of their bias difficult. Although many assays are stated to be traceable to the CRM 457, 37,38 a wide scatter of results is obtained as noted above. When the CRM 457 was prepared from the normal thyroid tissue of nine patients undergoing thyroidectomy for a single adenoma, it was noted on electrophoresis that the preparation, as well as containing the intact Tg molecule, also contained smaller molecular weight forms. With the availability of recombinant human Tg, it is hoped that its use as a standard will result in a reduction in inter-assay variability. 39
In the meantime, the problems of assay variability are illustrated by External Quality Assessment Scheme data. For example, data from Distribution 47 UK National External Quality Assessment Service (UKNEQAS) 40 showed one sample with a range of measured Tg of 0.6–10.26 μg/L (assay mean for nine different IMAs) and for TgAbs of 1.4–217 U/mL (assay mean of eight assays), confirming four-fold differences in results previously reported. 8 These issues are beginning to be recognized in the clinical literature. 41
Biological variability
There is a significant literature on the biological variability of many analytes of clinical interest and recognition that the within-subject variability may differ in patients with disease as compared with healthy controls. 42 There are few data for serum Tg – within-subject variability in a group of healthy women has been reported to be 8.5% and 16.2% for TgAbs and Tg, respectively. 43 Thus there is little information to allow formal evaluation of critical differences or reference change values for serum Tg. However, a recent retrospective analysis of patients with treated papillary thyroid cancer (post-thyroidectomy and TSH suppression) has suggested that the serum Tg doubling time may be helpful in risk stratification. 44 Given that low concentrations of serum Tg measured when TSH is suppressed can be due to persistent disease and normal thyroid remnant, this approach requires further investigation. The reported values for the half-life of Tg vary from 6 to 96 h, which may reflect both heterogeneity of Tg structure and assay specificity. 45 Thus, interpretation of results of samples collected less than six weeks post-surgery or radioiodine ablation is problematic.
In pregnancy
The care of the pregnant patient with thyroid disease is particularly demanding given the implications for both patient and fetus. 46,47 The DTC patient may first present during pregnancy or have a known diagnosis requiring follow-up. Detailed advice is given in the Endocrine Society Guidelines covering the need for FNA cytology, timing of possible surgery and the absolute necessity of avoiding the use of radioactive isotopes, 48 and has recently been discussed. 49
Of note for the laboratory is the need for more frequent monitoring of thyroid function and review of the dose of T4 in the patient with treated DTC. Dose requirements for T4 are increased in pregnancy and change throughout pregnancy, the major clinical need being to avoid under-replacement and potential effects on the fetus. Laboratories should be encouraged to quote method and trimester-specific reference ranges for TSH and free T4 and should recognize the need for more frequent monitoring, for example at 4–6 weeks if T4 dosage adjustment is made. 50
Serum thyroglobulin (μg/L) in pregnancy in subjects with an intact thyroid
RIA, radioimmunoassay; IMA, immunometric assay
In children
Although thyroid cancer is not common in children, the prevalence of thyroid nodules found on clinical examination has been reported to be 1–1.5%, with a greater risk compared with adults for the presence of an underlying diagnosis of thyroid cancer if a nodule is present. In addition, thyroid cancer is reported to be a common secondary cancer in children treated with radiation therapy for another malignancy and it represents 0.5–3% of all childhood malignancies. DTC is less frequently inherited than medullary thyroid carcinoma but may be associated with the autosomal dominant conditions of Carney's complex, familial adenomatous polyposis and Cowden disease. Younger children are reported to have greater disease burden at presentation of DTC, with additional risk factors being identified such as male sex, large primary tumour size and extrathyroidal extension. The natural history of the disease in children is less well understood than in adults but the treatment pathway will be similar following FNAC: total thyroidectomy and TSH suppression with T4. However, the use of radioactive iodine remnant ablation in children is controversial. 55
The monitoring of treatment is based on the measurement of serum Tg, with similar problems of interpretation as those of adults. In a cross-sectional study, serum Tg was measured in children with an intact thyroid aged 7–18 years. There was no significant difference in serum Tg found between the genders (mean serum Tg was 28.0 μg/L [girls] and 25.3 μg/L [boys]), but a negative correlation with age was demonstrated. The authors concluded that there was a decrease in serum Tg with chronological age in children and adolescents which might be explained by changes in serum TSH. 56 As for adults, it is inappropriate to use a reference range determined in euthyroid subjects for the interpretation of results on children treated with DTC.
Other indications for the measurement of serum Tg
There are two other circumstances where serum Tg may be measured outside the context of treated thyroid cancer, namely in the investigation of congenital hypothyroidism and factitious thyrotoxicosis.
In the neonate, serum Tg has been reported to rise in the first six hours postnatally and for the first postnatal week, with concentrations falling thereafter in both term and preterm neonates. Reported concentrations vary between studies, with mean concentrations during the first week of life ranging from 41.9 to 180 μg/L and may be affected by study group characteristics such as iodine status and serum TSH concentrations. 57–60
The UK guidelines on screening for congenital hypothyroidism 61 indicate that, if radiological imaging indicates that there is no thyroid present, measurement of serum Tg may be desirable, to exclude false-negative scans due to iodine uptake defects such as that associated with mutations in the sodium iodide symporter (NIS) gene.
In patients with congenital hypothyroidism, it might be expected that in the absence of a thyroid gland (athyreosis or thyroid aplasia), serum Tg would be undetectable or low, while for other causes with a confirmed raised TSH (such as transient hypothyroidism), serum Tg would be elevated. 62 In dyshormonogenesis, concentrations of serum Tg can be low if there is a defect in Tg synthesis or intracellular processing or high if the defect is in other pathways. In an early study of congenital hypothyroidism, no significant differences in serum Tg were found between neonates with thyroid aplasia confirmed by thyroid scintigraphy (range = 15–290 μg/L) and a control group of euthyroid neonates (range = 34–700 μg/L). 63 In the neonates with congenital hypothyroidism, irrespective of cause, the range was <2.0–1100 μg/L. A more recent study 64 found a range of serum Tg of <1.0–18.7 μg/L in athyreosis while those with an ectopic gland had a range of 4.5–123 μg/L. Given this overlap in serum Tg, scintigraphic imaging and high-resolution colour Doppler US may be considered alternatives. 65
In the case of factitious thyrotoxicosis with a suppressed serum TSH, serum Tg will also be suppressed (having excluded analytical interference causing a false-negative result). 66 In contrast, in thyrotoxicosis, serum Tg will be increased despite a suppressed TSH due to release from the diseased thyroid.
Is the result correct?
Given the possibility of interference by endogenous antibodies in the Tg immunoassay, the laboratory plays a key role in developing strategies to identify and minimize the clinical impact of any such interference. This requires close collaboration with clinical staff and reporting such uncertainties is not straight forward. Different approaches will be considered such as the measurement of TgAb concentrations and Tg RIA-IMA discordance.
Endogenous antibodies to Tg
Endogenous antibodies to Tg may interfere in immunoassays for Tg. If there is antibody interference, it is thought generally to result in positive interference in competitive immunoassays and negative interference in IMAs, though this is not always the case. 67
The analytical limitations of TgAb assays themselves have been reported with respect to thyroid autoimmunity, 68 but are not as well appreciated in the area of thyroid oncology. Agglutination assays lack the required sensitivity and should not be used. However, there is significant between-method variability in results with the immunoassays for TgAbs. 68,69 Different immunoassay formats are used, either competitive or reagent excess, with all based on the binding of the endogenous TgAbs to either solid phase Tg or labelled Tg. These details of reagent type and preparation are likely to be critical given the variable epitope recognition of TgAbs. Although there is an International Reference Preparation available (IRP 65/093), variability in results may represent differences in antibody specificity and epitope expression in different disease states. In patients with DTC, antibodies with both restricted and broad specificities have been described, and interference in a Tg IMA has been shown to be related more to the number of epitopes recognized than to the pattern of epitope recognition. 70 In addition, high concentrations of serum Tg may result in under-recovery of TgAbs when TgAbs are measured by immunoassay. 71
Further confusion may result from the use of different concentrations or cut-off values for defining ‘positive’ and ‘negative’ TgAb results. Such definitions are used either to define positive/negative for the detection of disease recurrence (for thyroid autoimmunity or for DTC) or positive/negative in terms of predicting interference in Tg assays. Even in defining ‘positive/negative’ for autoimmune thyroid disease, manufacturers’ cut-off values were found to vary between 80 and 325 kU/L 68 and for defining recurrence of DTC to vary between 27.8 and 100 U/mL. 72–76
Increasingly the limitations of TgAb assays in identifying those samples showing interference in Tg IMAs are being recognized. In part, these limitations may be due to the use of manufacturers’ cut-off values for positive/negative derived for the investigation of thyroid autoimmunity. Comparison of four automated TgAb assays and receiver operator curve analysis enabled Algeciras-Schimnich et al. 77 to derive cut-off values which gave comparable performance for identifying false-negative Tg IMA results. These were: Beckman Access <4 IU/mL, Siemens Centaur (Siemen Diagnostics, Los Angeles, CA, USA) 44 IU/L and Roche (West Sussex, UK) 22 IU/L, though a comparable cut-off could not be derived for a fourth assay. Of course, these cut-off values may be specific for interference in their Tg assay alone. Spencer et al. 78 recommended the use of the TgAb assay analytical sensitivity, rather than the reference range. Even so, they reported that for two of the assays investigated, interfering antibodies were not identified in 20–30% of cases. A similar lack of concordance has been reported for TgAb assays currently in use in the UK, 79 although the concordance achieved using the manufacturers’ reference data was 74% and could be increased to 90% by adjusting the cut-off values. Thus, the re-assurance of a negative TgAb and thus exclusion of a false-negative serum Tg has to be uncertain. Similarly, some laboratories have taken the approach of not reporting Tg results in patients who are TgAb ‘positive’ though this may give false reassurance when TgAbs are not detected and the possibility of assay interference remains.
TgAbs can also be used as a marker of disease recurrence. 4,6,75 The previously reported prevalence of TgAbs in DTC patients of 10–25% is higher than that of the general population. TgAbs decline after surgery with approximately half of patients in remission becoming TgAb ‘negative’ at 1.5 years, 80 though the decline may continue for up to three years post-surgery. 81 Interpretation may be complicated by the presence of thyroiditis at the time of diagnosis and thyroid remnant ablation. Thus, a persistently detected or rising concentration of TgAbs may be more indicative of remaining disease or of recurrence than a single result. What is clear from a consideration of the analytical limitations of TgAb assays is that results from different assays cannot be used interchangeably.
Recovery of added Tg
Under-recovery of added Tg to a patient sample may indicate interference by TgAbs. Recovery has been reported to be dependent on the type and concentration of added Tg and on incubation conditions 8 and thus has not been universally recommended. However, given the limitations now noted for TgAb assays themselves, measurement of recovery may aid interpretation in difficult cases. 82
IMA-RIA discordance
Given that TgAb interference, if present, is generally reported as giving rise to false-positive results by RIAs, and false-negative result in IMAs, a demonstrated discordance in results would indicate the possibility of assay interference in either or both assays. Any definition of discordance needs to take into account the bias of both assays in the absence of interference and importantly that the bias may change with changes in reagent lot. Of particular clinical significance is the discordance of IMA-RIA results at low concentrations. A falsely undetectable Tg by IMAs may result in a delay in detection of disease recurrence. A false detectable Tg by an RIA may prompt further unnecessary investigation. Thus, knowing the effect of assay bias at low concentrations is critical in deciding whether there is IMA-RIA discordance.
To address this, Spencer et al. 78 have estimated the ratio of serum Tg measured by IMA (Beckman Access) to that measured by an in-house RIA in 24 samples that were negative for TgAbs by four methods (RSR [Cardiff, Wales], Roche, Immulite and Access), i.e. samples where TgAb interference was unlikely. The mean IMA:RIA ratio was 90% (95% confidence interval = 75–110%). Thus, discordance could be defined if the ratio fell outside that range.
An alternative is to take the regression line for the comparison of the two methods with TgAb-negative samples and calculate the standard error of that line. Discordance would be indicated if the results fell outside of those limits. However, the regression line is likely to be unduly influenced by high Tg concentrations and may be less useful at low concentrations where the intercept of the regression line might have a significant impact.
The analytical or functional sensitivity of an assay is commonly used as a basis for clinical cut-off values, whether as quoted by the manufacturer or determined by experiment. This forms the basis of classification of a result as detectable/undetectable and of IMA-RIA discordance. As noted by Ross et al., 83 bias (and changes in bias) near the zero point for automated Tg IMAs may be ‘hidden’ as instrument settings do not permit a measured signal to be reported below a certain level. They used an alternative approach of determining upper reference limits in a group of DTC patients who had undetectable TgAbs and undetectable Tg when TSH was suppressed during T4 therapy. In order to assess positive and negative bias near zero concentrations, they used an in-house computer programme for calibration which allowed the calculation of ‘negative’ concentrations. Thus, negative bias could be determined. They used a one-sided 99.9% confidence interval for the Tg-negative serum samples to determine an upper reference limit for each of six IMAs. This approach may minimize the effects of assay bias at low concentrations and allow more accurate determination of IMA-RIA discordance.
The question remains as to how these approaches to the investigation of antibody interference can be used in routine laboratory practice. A pragmatic approach is to indicate the potential error in either or both assays and continue to follow up with both assays and with TgAb monitoring for a change or trend. In some countries, there may be limited availability of an RIA. Differences in calibration, bias and functional sensitivity may make the use of IMA-RIA discordance more difficult at low Tg concentrations.
Heterophilic antibodies
Heterophilic antibody (HAB) interference in Tg IMAs generally results in false elevations. A prevalence of 3% was found in patients with an initial serum Tg ≥ 1 μg/L using the Beckman Access or Access 2 and the authors recommended the routine use of HAB blocking tubes to overcome this effect. 84 This type of interference and its prevalence is likely to be assay-dependent, having been reported for the Brahms Tg-plus immunoradiometric assay, 85 Immulite assay 86 and the CisBio (Bedford, MA, USA) and Access Tg IMA. 87 Thus, if a laboratory using an IMA obtains detectable serum Tg results that are inconsistent with the clinical picture or unexpected changes with time, linearity checks and/or treatment with HAB blocking tubes should be used.
The problems of TgAb and HAB interference and the care required in interpretation are exemplified in a recent study 82 of 288 consecutive DTC patients, 47 of whom were found to have an undetectable serum Tg by IMAs but had residual 131I uptake on post-ablation WBS. To investigate the possibility of antibody interference, the authors measured TgAbs by two different assays, performed Tg recovery experiments, pretreated samples with HAB blocking tubes and measured Tg by two other IMAs. Of the 47 patients, 10 were found to have TgAbs and one HAB causing a negative interference. Of the remaining 36 patients, 18 were found to have a detectable Tg by a second IMA and 18 a detectable Tg by a third IMA. Thus, after extensive laboratory investigation, of the original 47 patients with an initial undetectable serum Tg with residual 131I uptake, only nine were confirmed to have an undetectable serum Tg with no evidence of assay interference. The authors noted that their investigations were extensive and expensive such that the treatment of such difficult DTC patients should be transferred to specialist referral centres.
Given the limitations of both Tg and TgAb assays and the costs of laboratory investigations, it is likely that no single analytical strategy will necessarily identify all samples exhibiting Tg assay interference and consideration of other imaging techniques and the broader clinical picture is essential. As a minimum, it is suggested that all reports should comment on the possibility of assay interference. There are then a number of options: (1) measurement of TgAbs; (2) measurement of Tg by another IMA if the first assay is an IMA; (3) measurement of Tg by an RIA if the first assay is an IMA; (4) measurement of Tg by an IMA if the first assay is an RIA and a false-positive is suspected; (5) measurement of TgAbs by another method; (6) recovery of added Tg experiments; (7) investigation of possible HAB interference; and (8) assessment of linearity on dilution. The optimal strategy is likely to be dependent on the assays available and the feasibility and costs involved.
Conclusions
Serum Tg is a tumour marker used for monitoring treatment of DTC. Measurement is of little value pre-thyroidectomy and samples should not be collected until at least six weeks post-thyroidectomy or radioiodine ablation. Interpretation of serum Tg results requires knowledge of the extent of thyroid surgery and concurrent serum TSH concentrations. Results should not be related to a reference range obtained from euthyroid subjects with an intact thyroid.
There are important analytical considerations, namely of assay bias and interference by endogenous TgAbs, which clinical staff should be made aware of. Most assays are reported to be calibrated against the international standard CRM 457; yet, there is a significant lack of agreement between assay results. Patients should be monitored using the same assays for serum Tg. Between-assay variability of TgAbs determines that the same TgAb assay should be used.
There are limitations in classifying samples as TgAb-positive or -negative (with respect to predicting interference in the Tg assay) and a numerical result should be reported, though interference in the Tg assay cannot be ruled out on the basis of TgAb status alone. Analysis by another method (IMA or RIA) and investigation for heterophilic antibodies are alternative approaches. Each laboratory should establish their own protocol for the investigation of possible interference.
DECLARATIONS
