Factors Affecting Thyroid Cancer in Patients with Thyroid Nodules Using Logistic Regression in Interval Censored Data


avatar Jamshid Yazdani-Charati 1 , avatar Ozra Akha 2 , avatar Farideh Khosravi 3 , *

Department of Biostatistics, Health Sciences Research Center, Faculty of Health, Mazandaran University of Medical Sciences, Sari, IR Iran
Diabetes Research Center, Mazandaran University of Medical Sciences, Sari, IR Iran
Department of Biostatistics, Occupational and Environmental Health Research Center, Shahroud University of Medical Sciences, Shahroud, IR Iran

how to cite: Yazdani-Charati J, Akha O, Khosravi F. Factors Affecting Thyroid Cancer in Patients with Thyroid Nodules Using Logistic Regression in Interval Censored Data. Int J Cancer Manag. 2018;11(3):e9111. doi: 10.5812/ijcm.9111.



Thyroid nodules in the majority of cases are symptom-free; but, their clinical significance is due to being differentiated from malignancy. The outbreak of thyroid cancer (TC) is gradually soaring because of lifestyle turning into westernization in developing nations such as Iran. Thus, the current research has been performed pursuing the goal to identify the most critical risk factors in the thyroid cancerous individuals with thyroid nodules in northern Iran during a decade.


This study is of historical cohort type. A total of 33,530 files of the patients referring to the medical clinic through 2003 to 2013 have been examined and 1,817 thyroid nodule afflicted patients have been diagnosed and the data about these patients includes 70 recorded variables The data have been analyzed by logistic regression in interval Censored data at significance level 0.05 applying SPSS and Strata software.


A total of 1 594 (87.7%) of the patients consist of women in the mean age bracket 41.07 ± 13.81. Also, the mean age of those with TC is 38.15 ± 13.52. Among the most significant factors identified in thyroid malignancies in the patients with thyroid nodules, we can note severe obesity, bilateral nodules, abnormal thyroid stimulating hormone (TSH) and T4 hormones, phosphorus, calcium, and thyroglobulin.


Regarding the incidence raise of TC, it is required to identify the malignancy risk factors. It is recommended to carry out some studies on the role of abnormal T4 hormone, phosphorus, calcium, and thyroglobulin in thyroid malignancies in the patients suffering from thyroid nodules.

1. Background

Any sort of swelling, observable, and palpable lump in thyroid refers to thyroid nodule (1). Thyroid nodule is a common disease and in physical examination, it is discovered among around 3% to 7% adults (2,3). Plethora of studies indicate this issue that half of the people in society, in case of being examined via sonography, will come up with thyroid nodule and this level increases by aging (4). These nodules may be single or multiple and with performance or without performance. The prevalence of these nodules is high in the iodine deficient regions among women and the elderly. The diameter of the majority of palpable nodules is more than 1 cm. However, the palpability of a nodule depends on its location inside the gland (superficial or deep), the patient’s neck anatomy, and the examiner’s adroitness (5).

Cases such as nodule growth, voice hoarseness, vocal cords paralysis, palpable neck lymph nodes, and nodule adhesion to the surrounding tissues suggest malignancy. In addition, the patients with the record of their whole body exposed to radiation due to bone marrow transplantation or the head and neck radiation exposure are at higher risk for malignancy (6). As the reports suggest, in 2014, totally 1,665,540 new cases of cancer infection and 585,720 cancer induced deaths occurred in the US. Also, TC affliction and its induced death rate have been reported as 62,980 and 1,890 individuals, respectively (7,8).

Based on a review conducted in Italy, the outbreak of TC is rising worldwide and it has been recognized as the most prevalent endocrine cancer (1% - 1.5%) annually in the US (9,10).

One of the most important specifications of survival analysis is censored data. The censoring of the survival studies is of different types. Interval censoring occurs when an event is not observable directly and happens in a random time range. In fact, it can be asserted that a person in the variable T is an interval censored if only we know that a < T < b. The interval censored data often occurs in longitudinal studies and clinical trials, which requires more follow-up (11). The prevalence of interval censored data is rising in medical studies given the biomarkers increase in a disease development. Interval censoring results from periodically monitoring a disease progress. Logistic regression is of applied techniques for classified data analysis. In binary logistic regression, the response variable has 2 modes. One of the methods to estimate the parameters is error least squares, in which the goal is to find the regression coefficients in order to minimize the error least squares. Another approach is maximum likelihood pursuing the goal behind the model likelihood maximization is to address the model parameters (12). The logistic regression model equals:

Because of such patients frequently referring, doing check-ups, sonography, recording the data in the files, the possibility of the thyroid nodules turning into cancer and concerning the thyroid gland performance being so vital in human body, in metabolism process and its impairment significant effect on other organs, we have decided to study this issue.

2. Objectives

The current research has been performed pursuing the goal to identify the most critical risk factors in the thyroid cancerous individuals with thyroid nodules in northern Iran during a decade.

3. Methods

The current research is of historical cohort type and targets to detect the most important factors influencing thyroid cancer in the patients with thyroid nodules, referring to Endocrinology and Metabolism diseases clinic from July 2003 to mid-March 2013. The statistical community consists of the patients with glands or symptoms like pain in nodule area or suffocation, hoarseness, swelling in the neck referring to the endocrinologist, introduced by another physician or referring due to other reasons such as diabetes. The question event in the research is thyroid cancer affliction in thyroid nodule patients. Out of 33,530 files surveyed, 1,817 files of the thyroid nodule sufferers have been recorded including 169 patients with different TCs. The data about the patients cover 70 variable types. Due to the missing data being massive in the study, expectation-maximization (EM) algorithm has been employed to compute the missing data. This study was approved by forth meeting of ethics committee of Research at Mazandaran University of Medical Sciences (July 8th, 2014).

When all the data are interval censored, one of the methods used is logistic regression. In data analysis with logistic regression, the time interval has been taken as 6 months and in case of the individual referring during the considered interval, 1 and otherwise 0 has been recorded. Under the following conditions, this is useful:

- Having the sufficient number of events in each range;

- The distribution determination f (t) not being important.

For instance, the logistic model with 3 indicator variables (defined for the intervals) and an explanatory variable is as it follows:

P (y = 1) equals the probability that the event occurs in a certain range provided that it did not occur in the prior intervals (13).

Applying univariate logistic regression, some meaningful variables have entered the final logistic regression model along with the variables having P < 0.3 and the final analyses have been performed. The patients' files have been examined fully confidentially and considering the obtained results in general, the patients’ privacy has been appreciated. The data have been analyzed with SPSS and Stata software at significance level 0.05.

4. Results

The following tables include the descriptive data about some of the most principal variables existing in the study (Table 1).

Table 1. Descriptive Statistics of Some Variables in Patients with Thyroid Nodules
VariableNo. (%)VariableNo. (%)
SexT3 hormone
Female1594 (87.7)Normal464 (25.5)
Male323 (12.3)Abnormal204 (11.2)
AgeTSH hormone
< 30448 (24.7)Normal1193 (65.7)
30 - 601202 (66.2)Abnormal492 (27.1)
> 60160 (8.8)
Marital statusHemoglobin
Single1525 (84)Normal678 (37.3)
Married153 (8.4)Abnormal454 (25)
HypothyroidismHb A1C
Yes196 (10.8)Normal43 (2.4)
No1621 (89.2)Abnormal71 (3.9)
HyperthyroidismVitamin D
Yes93 (5.1)Normal193 (10.6)
No1724 (94.9)Abnormal144 (7.9)
Yes69 (3.8)Normal401 (22.1)
No1748 (96.2)Abnormal34 (1.9)
Body Mass IindexPhosphorus
< 18.526 (4.1)Normal366 (20.1)
18.5 - 25117 (6.4)Abnormal18 (1)
25 - 30174 (9.6)Thyroglobulin
30 - 35136 (7.5)Normal49 (2.7)
> 3510 (0.6)Abnormal34 (1.9)
CholesterolThyroglobulin antibodies
Normal478 (26.3)Normal33 (1.8)
Abnormal431 (23.7)Abnormal10 (0.6)
HDLRight lobe nodule
Normal313 (17.2)Yes1130 (62.3)
Abnormal150 (8.3)No687 (37.7)
LDLLeft lobe nodule
Normal320 (17.6)Yes1000 (55)
Abnormal246 (13.5)No617 (45)
TriglycerideBilateral nodules
Normal664 (36.5)Yes618 (34)
Abnormal194 (10.7)No1199 (66)
Fasting blood sugarMulti nodular goiter
Normal983 (58.3)Yes1102 (60.6)
Abnormal314 (17.3)No715 (39.4)
Blood pressureSleeping disorder
Normal748 (41.2)Yes86 (4.7)
Abnormal279 (15.4)No1731 (95.3)
T4 hormoneSore throat and irritation
Normal1486 (81.8)Yes105 (5.8)
Abnormal164 (9)No1702 (94.2)

Out of the cancer sufferers, 141 (83.4%) were women and 28% (16.6%) were men. The patients' age, when referring, has been recorded quantitatively. The patients' mean age has been yielded as 41.07 ± 13.81. Moreover, the TC suffering patients' mean age has been 38.15 ± 13.52. Their mean education years number has been 10.96 ± 4.25. The graph below illustrates annual TC incidence in thyroid nodule patients by age separation (Figure 1):

Annual TC Incidence in Thyroid Nodule Patients by Age Separation

Twenty-two (22) 6-month time intervals have been defined in the research that in case of the patient referring at that time interval has been viewed as 1 and otherwise 0.These time intervals have been considered as independent variables along with other variables in the analysis. The results from the logistic regression model fitness have been given in the table below (merely the significant variables have been reported) (Table 2):

Table 2. The Results From the Logistic Regression Model in Patients with Thyroid Nodules
VariableOdds RatioParent StatisticsS.DSig. Level
1nd half of 20050.535.980.2250.014
2nd half of 20050.459.550.2690.002
2nd half of 20070.645.070.1940.024
2nd half of 20111.45.590.1430.018
2nd half of 20121.364.510.1440.034
2nd half of 20131.456.490.1470.011
Hyperthyroidism0.2356.550.196< 0.001
Hypothyroidism0.13163.020.161< 0.001
Diabetes0.2626.620.267< 0.001
Right lobe nodule0.065179.20.205< 0.001
Left lobe nodule0.2496.930.146< 0.001
Bilateral nodule2.388.950.3480.003
Multi Nodular goiter0.09552.050.326< 0.001
Cholesterol0.4540.860.124< 0.001
T30.4312.940.237< 0.001
TSH2.98181.950.081< 0.001
Calcium4.9648.870.229< 0.001
Phosphorus4.7717.060.378< 0.001
Vitamin D0.4114.20.237< 0.001
Thyroglobulin12.9582.20.283< 0.001
Sleep disorders0.0449.491.02< 0.001
Sore throat0.865.81.020.002
Severely obese people1.3411.790.0860.016

According to the achieved results, the odds to get the cancer in the first half of 2005, if not happening in the prior intervals, equals 0.53. This odds is 0.45, 0.64, 1.4, 1.36, and 1 .45 in the second half of the years 2005, 2007, 2011, 2012, and 2013, respectively. Given the above tables, the odds ratio logarithm for the variables having bilateral nodules has been gained 2.38 so that it can be stated that the cancer infection odds in the bilateral thyroid nodules patients is 2.38 times than other sufferers. The cancer odds in the bilateral thyroid nodules with abnormal T4 hormone is 1.29 times than other sufferers .This odds has been obtained as 2.98, 4.96, 4.77, and 12.95 in the patients with abnormal TSH hormone, calcium, phosphorus, and thyroglobulin. Also, in the overweight individuals, based on the logistic regression model fitness, this odd has been gained as 1.34 times than the other patients. Other variables reported in the above table have odds ratio less than 1.

5. Discussion

In recent years, as diagnosing thyroid nodules increase, thyroid cancer incidence has been increasing across the world (14-16). The most common thyroid cancers are papillary and, then, follicular TCs that, as the studies indicate, include 85% to 90% and 10%to 15% of total TCs, respectively (17); they cover 94.1% and 3% of total cancers, respectively. As the studies in the US suggest, the raise of TCs outbreak in the US has been 6.6% in both genders from 1997 to 2009. In the present research, until 2006, the age bracket has been 30 to 60 years and in 2003, those over 60 years have had the highest TCs. According to a review paper by Pellegriti et al. in 2004 to 2008, in men, the highest TCs outbreak has been seen in the one over 65 years and in women, in 55 to 64 years (10). According to the results from the logistic regression finesse, it can be concluded that between 2011 and 2013, the patients have had the maximum TCs odds, which is true regarding the increasing trend of cancer thorough the world in recent years. TSH serves as an independent risk factor in predicting thyroid nodules malignancy (18). Various experiments on animals suggest the raise of iodine deficiency induced TCs as TSH goes up.

In the current study, the patients with high TSH will come up with 2.98% times further TC odds. The results extracted from several prospective studies display the increased risk of TC in obese individuals, consistent with this research derived results (10,19). In the present study, the TCs odds in severely obese individuals can be 1.34 times than others. The ones with multi-nodular and single nodules goiters face the same risk in thyroid malignancy; on the other hand, the prevalence of malignancy may be lower in the individuals with multi-nodular goiter, compatible with his study results (20,21). Having bilateral nodule has been identified of other risk factors behind thyroid cancers affliction, in line with the current research (5). According to the results of this study, the patients with bilateral nodules are exposed to 2.38 times more odds to get TCs than other ones.

Analyzing other variables has not been proved in TC malignancy so far. Of such variables gained as meaningful in the final fitted model and with the odds ratio higher than 1, we can mention calcium, phosphorus, thyroglobulin, and T4 hormone. It is hoped that through conducting more studies in this field in future, we access more fulfilling results on thyroid cancers risk factors in patients with thyroid nodules.

5.1. Conclusions

The incidence of TCs follows a rising trend; thus, in order to identify the risk factors in getting this disease, it is essential to identify the most critical factors. Of such factors identified in thyroid malignancy in thyroid nodules patients, severe obesity, bilateral nodules, abnormal TSH hormone, T4 hormone, calcium, phosphorus, and thyroglobulin can be pointed out.



  • 1.

    Ghani H, Akbari Jami M. Diagnostic value of FNAc in Thyroidal nodules in Kashan from 1998 to 2000. KAUMS J. 2001;5(2):71-5.

  • 2.

    Esteghamati AR, Yousefizadeh A, Rakhshan M, Mehrdad R. ‍‍Diagnostic value of FNA in thyroid nodules. IranJ Endocrinol Metab. 2001;3(3):193-203.

  • 3.

    Gharib H, Papini E, Valcavi R, Baskin HJ, Crescenzi A, Dottorini ME, et al. American Association of Clinical Endocrinologists and Associazione Medici Endocrinologi medical guidelines for clinical practice for the diagnosis and management of thyroid nodules. Endocr Pract. 2006;12(1):63-102. doi: 10.4158/EP.12.1.63. [PubMed: 16596732].

  • 4.

    Larijani B, Pajouhi M, Bastanhagh MH, Sadjadi A, Sedighi N, Eshraghian MR. Evaluation of suppressive therapy for cold thyroid nodules with levothyroxine: double-blind placebo-controlled clinical trial. Endocr Pract. 1999;5(5):251-6. doi: 10.4158/EP.5.5.251. [PubMed: 15251662].

  • 5.

    Kasper D, Fauci A, Hauser S, Longo D, Jameson J, Loscalzo J. Harrison's principles of internal medicine. 2015.

  • 6.

    Curtis RE, Rowlings PA, Deeg HJ, Shriner DA, Socie G, Travis LB, et al. Solid cancers after bone marrow transplantation. N Engl J Med. 1997;336(13):897-904. doi: 10.1056/NEJM199703273361301. [PubMed: 9070469].

  • 7.

    Lee JI, Kim SH, Tan AH, Kim HK, Jang HW, Hur KY, et al. Decreased health-related quality of life in disease-free survivors of differentiated thyroid cancer in Korea. Health Qual Life Outcomes. 2010;8:101. doi: 10.1186/1477-7525-8-101. [PubMed: 20840792].

  • 8.

    Siegel R, Desantis C, Jemal A. Colorectal cancer statistics, 2014. CA Cancer J Clin. 2014;64(2):104-17. doi: 10.3322/caac.21220. [PubMed: 24639052].

  • 9.

    Curado MP, Edwards B, Shin HR, Storm H, Ferlay J, Heanue M, et al. Cancer incidence in five continents, Volume IX. IARC Press, International Agency for Research on Cancer; 2007.

  • 10.

    Pellegriti G, Frasca F, Regalbuto C, Squatrito S, Vigneri R. Worldwide increasing incidence of thyroid cancer: update on epidemiology and risk factors. J Cancer Epidemiol. 2013;2013:965212. doi: 10.1155/2013/965212. [PubMed: 23737785].

  • 11.

    Lim HJ, Sun J. Nonparametric Tests for Interval-Censored Failure Time Data. Biometr J. 2003;45(3):263-76. doi: 10.1002/bimj.200390011.

  • 12.

    Bashiri MKRR. Parameter estimation for improving communication using binary logistic regression. J Manufactur Operat. 2011;2(1):135-54.

  • 13.

    Mosure DJ, Berman S, Kleinbaum D, Halloran ME. Predictors of Chlamydia trachomatis infection among female adolescents: a longitudinal analysis. Am J Epidemiol. 1996;144(10):997-1003. [PubMed: 8916511].

  • 14.

    Howlader N, Noone AM, Krapcho M, Neyman N, Aminou R, Waldron W, et al. SEER Cancer Statistics Review, 1975–2008. National Cancer Institute; Bethesda, MD: 2011. Bethesda: National Cancer Institute; 2011.

  • 15.

    Hundahl SA, Fleming ID, Fremgen AM, Menck HR. A National Cancer Data Base report on 53,856 cases of thyroid carcinoma treated in the U.S., 1985-1995 [see commetns]. Cancer. 1998;83(12):2638-48. [PubMed: 9874472].

  • 16.

    Leenhardt L, Grosclaude P, Cherie-Challine L, Thyroid Cancer C. Increased incidence of thyroid carcinoma in france: a true epidemic or thyroid nodule management effects? Report from the French Thyroid Cancer Committee. Thyroid. 2004;14(12):1056-60. doi: 10.1089/thy.2004.14.1056. [PubMed: 15650358].

  • 17.

    Wang F, Wang Y, Wang L, Wang X, Sun C, Xing M, et al. Strong association of high urinary iodine with thyroid nodule and papillary thyroid cancer. Tumour Biol. 2014;35(11):11375-9. doi: 10.1007/s13277-014-2397-8. [PubMed: 25119588].

  • 18.

    Zhang Y, Hua LEI, Huang J. A Spline-Based Semiparametric Maximum Likelihood Estimation Method for the Cox Model with Interval-Censored Data. Scandinavia J Statistics. 2010;37(2):338-54. doi: 10.1111/j.1467-9469.2009.00680.x.

  • 19.

    Duick DS, Klopper JP, Diggans JC, Friedman L, Kennedy GC, Lanman RB, et al. The impact of benign gene expression classifier test results on the endocrinologist-patient decision to operate on patients with thyroid nodules with indeterminate fine-needle aspiration cytopathology. Thyroid. 2012;22(10):996-1001. doi: 10.1089/thy.2012.0180. [PubMed: 22873825].

  • 20.

    Cooper DS, Doherty GM, Haugen BR, Kloos RT, Lee SL, Mandel SJ, et al. Management guidelines for patients with thyroid nodules and differentiated thyroid cancer. Thyroid. 2006;16(2):109-42. doi: 10.1089/thy.2006.16.109. [PubMed: 16420177].

  • 21.

    Ravangard R, Arab M, Rashidian A, Akbarisari A, Zare A, Zeraati H. Comparison of the results of Cox proportional hazards model and parametric models in the study of length of stay in a tertiary teaching hospital in Tehran, Iran. Acta Med Iran. 2011;49(10):650-8. [PubMed: 22071639].

Copyright © 2018, Cancer Research Center (CRC), Shahid Beheshti University of Medical Sciences. This is an open-access article distributed under the terms of the Creative Commons Attribution-NonCommercial 4.0 International License (http://creativecommons.org/licenses/by-nc/4.0/) which permits copy and redistribute the material just in noncommercial usages, provided the original work is properly cited