Reproducibility of CT-based opportunistic vertebral volumetric bone mineral density measurements from an automated segmentation framework

Bodden, Jannis; Prucker, Philipp; Sekuboyina, Anjany; El Husseini, Malek; Grau, Katharina; Rühling, Sebastian; Burian, Egon; Zimmer, Claus; Baum, Thomas; Kirschke, Jan S.

doi:10.1186/s41747-024-00483-9

Original article
Open access
Published: 01 August 2024

Reproducibility of CT-based opportunistic vertebral volumetric bone mineral density measurements from an automated segmentation framework

Jannis Bodden ORCID: orcid.org/0000-0001-5997-203X¹^na1,
Philipp Prucker¹^na1,
Anjany Sekuboyina^2,3,
Malek El Husseini¹,
Katharina Grau¹,
Sebastian Rühling¹,
Egon Burian⁴,
Claus Zimmer^1,5,
Thomas Baum¹ &
…
Jan S. Kirschke^1,5

European Radiology Experimental volume 8, Article number: 86 (2024) Cite this article

136 Accesses
Metrics details

Abstract

Background

To investigate the reproducibility of automated volumetric bone mineral density (vBMD) measurements from routine thoracoabdominal computed tomography (CT) assessed with segmentations by a convolutional neural network and automated correction of contrast phases, on diverse scanners, with scanner-specific asynchronous or scanner-agnostic calibrations.

Methods

We obtained 679 observations from 278 CT scans in 121 patients (77 males, 63.6%) studied from 04/2019 to 06/2020. Observations consisted of two vBMD measurements from Δdifferent reconstruction kernels (n = 169), Δcontrast phases (n = 133), scan Δsessions (n = 123), Δscanners (n = 63), or Δall of the aforementioned (n = 20), and observations lacking scanner-specific calibration (n = 171). Precision was assessed using root-mean-square error (RMSE) and root-mean-square coefficient of variation (RMSCV). Cross-measurement agreement was assessed using Bland-Altman plots; outliers within 95% confidence interval of the limits of agreement were reviewed.

Results

Repeated measurements from Δdifferent reconstruction kernels were highly precise (RMSE 3.0 mg/cm³; RMSCV 1.3%), even for consecutive scans with different Δcontrast phases (RMSCV 2.9%). Measurements from different Δscan sessions or Δscanners showed decreased precision (RMSCV 4.7% and 4.9%, respectively). Plot-review identified 12 outliers from different scan Δsessions, with signs of hydropic decompensation. Observations with Δall differences showed decreased precision compared to those lacking scanner-specific calibration (RMSCV 5.9 and 3.7, respectively).

Conclusion

Automatic vBMD assessment from routine CT is precise across varying setups, when calibrated appropriately. Low precision was found in patients with signs of new or worsening hydropic decompensation, what should be considered an exclusion criterion for both opportunistic and dedicated quantitative CT.

Relevance statement

Automated CT-based vBMD measurements are precise in various scenarios, including cross-session and cross-scanner settings, and may therefore facilitate opportunistic screening for osteoporosis and surveillance of BMD in patients undergoing routine clinical CT scans.

Key Points

Artificial intelligence-based tools facilitate BMD measurements in routine clinical CT datasets.
Automated BMD measurements are highly reproducible in various settings.
Reliable, automated opportunistic osteoporosis diagnostics allow for large-scale application.

Graphical Abstract

Background

Osteoporosis is a systemic disease, that destabilizes the bone by demineralization of the osseous tissue and deterioration of the trabecular microstructure [1, 2]. The diagnosis is frequently delayed, because patients remain symptom free, until fragility fractures occur. Those occur in the absence of adequate trauma, but inherit significant morbidity, mortality and enormous socioeconomic consequences [3]. Demographic change aggravates this issue. In the USA, approximately 54 million people over the age of 50 were estimated to suffer from low bone mass or osteoporosis by 2010, and the number is projected to reach 71 million by 2030 [4].

Dual x-ray absorptiometry and quantitative computed tomography (CT) suited to screen for and diagnose osteoporosis are available [5, 6]. However, osteoporosis remains vastly underdiagnosed [2, 6, 7]. Part of the problem is that both methods rely on specialized tools (e.g., calibration phantoms), and, importantly, necessitate a dedicated exam, which inherits a substantial organizational effort.

Opportunistic approaches aim to overcome the limitations by determining bone mineral density (BMD) from exams performed for other indications [8]. The abundance of CT data underscores the possible impact of CT-based opportunistic BMD screening approaches: annual scans in the USA surpassed 278 per 1,000 inhabitants by 2019 [9]. However, the need for exact manual segmentations to determine trabecular volumetric BMD (vBMD) from routine clinical multidetector CT scans limited the viability of this approach so far.

Deep learning-based convolutional neural network frameworks have recently been developed to master this challenge [10]. Such neural networks automatically perform all steps of vertebral body segmentation. The conversion of asynchronous CT-based density values measured in HU into vBMD and corrections for the intravenous contrast media phase can be performed automatically [11,12,13]. However, to render this approach suitable for mass application, reproducibility has yet to be determined.

Thus, this study aimed to provide comprehensive information on the reproducibility of vBMD measurements performed by a fully automated convolutional neural network framework, utilizing routine clinical thoracoabdominal CT, obtained in different contrast media phases on a diverse set of scanner systems, with varying settings, asynchronously calibrated as well as using a manufacturer-generic, kVp-based calibration.

Methods

Study population

Patients who received at least two consecutive routine thoracoabdominal CT scans with an interscan interval of up to one month and matching regions of interest between April 2019 and June 2020 were identified from the local picture archiving and communication system. The maximum interscan interval of one month was selected to maximize the number of patients eligible for inclusion, while simultaneously aiming to rule out longitudinal changes in vBMD, which have been reported to reach 2% per year in a healthy cohort [14]. To cover the widest possible range of CT scanning systems in this study, scans performed at other institutions but that were imported into our system were also included. All scans were manually checked for lumbar spine coverage by a neuroradiology resident (P.P., 3 years of experience in spine imaging). Scans not covering the lumbar spine, as well as scans with severe beam hardening artifacts or high noise level at the lumbar spine (e.g., due to implants or other foreign material), were excluded. Further exclusion parameters were the presence of inflammatory and neoplastic lesions at the lumbar spine.

Dataset acquisition, scanner calibration, and automated vertebral body segmentation and vBMD extraction

In-house contrast-enhanced scans were performed with a bodyweight-adjusted dosage (≤ 80 kg, 80 mL; 80–100 kg, 90 mL; > 100 kg, 100 mL) of iodined contrast media (Imeron 300, Bracco Imaging Deutschland GmbH, Konstanz, Germany). Tube voltage was 120 kVp (n = 176), or 100 kVp (n = 3) for in-house scanners, with an average tube load of 200 mAs.

In-house scans were obtained on a set of four scanners (Philips Brilliance iCT 256, Philips IQon Spectral CT, and Philips Ingenuity, Philips Medical Systems, Hamburg, Germany; Siemens Somatom Definition AS + , Siemens Healthineers, Erlangen, Germany). External scans were performed on one of eight scanner types (Canon Aquilion, and Canon Aquilion PRIME, Canon Medical Systems, Amstelveen, Netherlands; Siemens Biograph, Siemens Somatom Emotion 16, Siemens Somatom Definition AS, Siemens Somatom Force and Siemens Somatom Emotion 6, Siemens Healthineers, Erlangen, Germany; and Philips Ingenuity Core 128, Philips Medical Systems, Hamburg, Germany).

All reformations with a spatial resolution of ≤ 3 mm craniocaudally and of 5 mm left-right or anterior-posterior were included. In-house scanners were asynchronously calibrated using a commercially available anthropomorphic spine phantom (QRM QSA-717 Phantom; Quality Assurance in Radiology and Medicine GmbH, Möhrendorf, Germany).

In all available reconstructions, automated spine detection, vertebral labelling, and trabecular compartment segmentation steps were performed automatically (SpineQ, version 1.0, Bonescreen GmbH, Munich, Germany, Fig. 1). HU-to-BMD calibration was performed automatically using linear conversion factors based on kVp and scanner type. Automated correction for the contrast media phase was performed using a two-dimensional DenseNET model [12]. Mean trabecular vBMD was calculated across the L1-4 lumbar vertebra (vBMD_L1-4) for each observation. Vertebrae considered as unmeasurable according to the ACR criteria [15] were excluded from this mean on a per-patient basis. Specifically, vertebrae with fractures or Modic type 3 changes affecting > 25% of the vertebrae were excluded. For quality assurance, a neuroradiologist trained in spine imaging (P.P., 3 years of experience) reviewed vertebral body segmentation, automatic contrast media phase detection, and supervised in-/exclusion of vertebrae, documenting any manual corrections, if necessary.

Group definitions

Acquisition parameters such as kVp, reconstruction kernel, and slice thickness, as well as scanner model, scan positioning, intravenous contrast media phase, and calibration status are well-known confounders of vBMD measurements [12, 16,17,18,19]. To quantify each factor’s impact on reproducibility, we defined a set of six groups with an expected increase in variance. Observations were assigned to groups based on the following criteria:

I.
Δ_recon: both measurements derived from a single acquisition but using different reconstruction kernels and / or slice thicknesses;
II.
Δ_contrast: measurements obtained from different contrast media phases, obtained in consecutive acquisitions during a single scan session;
III.
Δ_session: measurements derived from different scanning sessions at a single calibrated scanner, but both scans had the same contrast media phase;
IV.
Δ_scanner: measurements assessed in the same contrast media phase, but at two different calibrated scanners;
V.
Δ_all: measurements obtained from two different scanners, in different contrast media phases;
VI.
_{scanner-agnostic}: measurements obtained from datasets from two different scanners, at least one of which was not asynchronously calibrated, and only a kVp-specific, but scanner-independent, calibration was applied.

Groups I-V consisted of observations from asynchronously calibrated scanners only. Reconstruction kernels, slice thicknesses, and reconstruction planes were not controlled for and varied randomly as per acquisition protocol.

Statistical analysis

All statistical analyses were performed using STATA software version 13.1 (StataCorp LLC, College Station, Texas, USA).

Absolute interscan differences in vBMD_L1-4 were calculated by subtracting the vBMD_L1-4 measurement 2 from the vBMD_L1-4 measurement 1, in each observation, respectively. Relative interscan differences were calculated as percent gain or loss in vBMD_L1-4 between measurement 1 and measurement 2, in all observations. Mean absolute vBMD differences and relative vBMD differences and respective standard deviations were calculated groupwise (I–IV).

Root mean square error (RMSE) and root mean square coefficients of variation (RMSCV) were calculated as measure of variance, for each group. To further investigate agreement of both measurements on single observation level, Bland Altman plots were created including 95% confidence intervals (95% CIs) of limits of agreement (mean difference × 1.96 standard deviation of the difference). The statistical significance of group differences in coefficients of variation was assessed using unifactorial ANOVA and Tukey post-hoc test.

Observations with poor agreement, defined as absolute difference exceeding the inner boundaries of the 95% CI limits of agreement, were retrospectively reviewed by an experienced neuroradiologist, specialized on spine imaging (J.S.K., 22 years of experience), and possible factors influencing vBMD measurements were recorded manually.

Results

Group statistics

Following the criteria for inclusion, 970 observations were derived from 146 patients (61 females, 41.8%), aged 63.4 ± 15.0 years (mean ± standard deviation), for a total of 292 patient scans total (146 × 2) (Fig. 2). After excluding observations without evaluable lumbar vertebrae, 679 observations were assigned to groups (Table 1). Across all observations, mean vBMD_L1-4 was 173.4 mg calcium hydroxylapatite (CaHA) / cm³ (range 55.4–302.3) at baseline and 173.4 mg CaHA/cm³ (45.3–313.4) at follow-up. Slightly lower values were noted at measurement 2 compared to measurement 1: -0.0 ± 8.8 mg CaHA/cm³; -0.1%, p = 0.962. Observation numbers were greatest in the Δ_recon group and decreased through groups II–VI. Notably, the 133 observations assigned to Δ_contrast derived from 43 patients only, while 123 observations in the Δ_session group derived from 55 patients.

Table 1 Cohort demographics

Full size table

Reproducibility measurements

As expected, RMSCV values increased along the groups from a minimum of 1.3% in measurements obtained from identical scans (Δ_recon) to a maximum of 5.9% in the Δ_all cohort (Table 2). The RMSCV increased significantly between Δ_recon and Δ_contrast (p < 0.001) and Δ_contrast and Δ_session (p < 0.001), while the increases between Δ_session and Δ_scanner as well as Δ_scanner and Δ_all were not statistically significant (p ≥ 0.770, respectively). The RMSCV difference between Δ_all and _{scanner-agnostic} did barely not reach statistical significance (p = 0.054). As reflected in the RMSCV, vBMD_L1-4 values for both measurements in the Δ_recon group (n = 169) were extremely similar, with an average absolute difference of 0.1 ± 3.1 mg CaHA/cm³ and a relative difference of 0.1 ± 2.3%. Consequently, the group showed an overall low absolute error with an RMSE of 3.1 mg CaHA/cm³.

Table 2 Reproducibility of fully automated vBMD_L1-4 measurements in each group

Full size table

Observations containing measurements from different contrast media phases showed slightly greater dispersion, but overall, the absolute and relative errors remained low (Δ_contrast RMSE = 6.1 mg CaHA/cm³; RMSCV = 3.1%) (Fig. 3a). While measurements obtained from different scanning sessions (but the same scanner) had a greater absolute error of RMSE = 11.0 mg CaHA/cm³, the RMSCV remained below 5%. Measurements from different scanners showed greater differences (Δ_scanner absolute difference = -3.8 ± 10.0 mg CaHA/cm³; relative difference = -3.1 ± 7.8%), accompanied by increased errors (RMSE = 9.4 mg CaHA/cm³; RMSCV = 5.7%) (Fig. 3b). The relative error increased further in Δ_{all-observations}, to 5.9%. Notably, measurements in scanner-agnostic scans showed acceptable absolute (2.0 ± 10.0 mg CaHA/cm³) and relative (2.0 ± 5.9%) differences, reflected in the mean absolute (RMSE = 9.6 mg CaHA/cm³) and relative error (RMSCV = 4.2%).

Bland Altman plot analysis showed good agreement for all groups, except in Δ_session. In this group, 12 observations exceeded the lower 95% CI of the upper limit of agreement (25.5 mg CaHA/cm³) or the upper 95% CI of the lower limit of agreement (-20.15 mg CaHA/cm³) (Fig. 4). Manual review showed that all those patients were scanned twice within a short period due to severe illness with signs of hydropic decompensation, resulting in new or increasing pleural effusion (n = 11), anasarca (n = 10), mesenterial fluid injection or ascites (n = 5) and pulmonary septal thickening (n = 3). Two of the patients were intubated between baseline and follow-up and had received new abdominal drainages (Fig. 5). Exclusion of the identified outliers resulted in lower value dispersion and in improvement of absolute and relative precision errors (Δ_{session_no-outliers} n = 111; absolute difference = 0.6 ± 9.5; relative difference 0.3 ± 5.9; RMSE = 9.0; RMSCV = 4.1).

Discussion

This study investigated the reproducibility of vBMD assessments from routine clinical CT scans using a fully automated convolutional neural network framework in various settings, including cross-scanner and cross-center settings. Reproducibility was excellent for all comparisons derived from a single scanning session. This demonstrates that neither the input convolution kernel nor slice orientation or thickness diminish reproducibility of this approach, and that the contrast media phase can be effectively corrected for. Precision errors for measurements derived from two different scanning sessions or different scanners were higher, but acceptable. While measurement errors may be due in part to changes in patient positioning, we also found evidence that measurement errors may also be driven up by short-term changes in tissue water content of severely ill patients and recommend introducing hydropic decompensation as a general exclusion criterion for quantitative CT measurements.

The Δ_recon group showed excellent reproducibility of vBMD_L1-4 measurements with precision errors of approximately 1.5% or 3 mg CaHA/cm³. Similar precision has been shown for dual X-ray absorptiometry in repeated measurements without and with repositioning [20,21,22]. Since the two measurements in this group were derived from reformations created using different reformations of a single scan, the precision error is most likely attributable to slight differences in the segmentation process caused by the different reconstruction kernels, slice orientations and thicknesses, resulting in slightly different mean HU values or volume-of-interest placements by the convolutional Bonescreen neural network framework [10, 23]. Overall, the findings underline the previously reported robustness of the automated segmentation algorithm and HU-to-vBMD conversion following asynchronous calibration [10, 21, 24].

The influence of the contrast media phase on vBMD measurements is well known, especially in setups with external calibration, as the contrast medium augments the attenuation of vascularized body parts but does not influence the reference values obtained in the external phantom [8, 25, 26]. Therefore, a set of studies investigated correction methods for the contrast phase, and Rühling et al recently developed an automated model for contrast media phase detection and correction in a single-scanner setting [12, 25, 27, 28]. The study reported precision errors of 9.5 CaHA/cm³ in arterial and 4.0 mg CaHA/cm³ in portal-venous phase [12]. Contrast phase correction using phase-dependent correction factors performed similarly well in the current dataset, derived from various scanners, with absolute and relative precision errors of 6.1 mg CaHA/cm³ and 2.9% in the Δ_contrast group. Moreover, precision errors increased only slightly in the Δ_all group, compared to the Δ_session and Δ_scanner groups. This indicates that the implemented contrast media phase correction may work similarly well in cross-scanner comparisons. Modern advancements in CT, like spectral imaging, can further minimize the impact of intravenous contrast or other foreign materials on vBMD measurements and has been shown to yield promising results for vBMD measurements and in fracture prediction with good reproducibility [29,30,31]. While there is barely any use case for spectral CT in mass opportunistic osteoporosis screening due to the limited scanner availability, it may prove to be a pivotal advancement in osteoporosis diagnostics in the future.

Patient positioning and table height are known to severely impact attenuation values in multidetector CT due to x-ray field inhomogeneities and beam hardening effects being highly dependent on the scanner’s isocenter, and thereby also affect vBMD measurements [16,17,18,19]. This may in part explain the higher RMSCV and RMSE values in the Δ_session, Δ_scanner, and Δ_all groups, as all observation pairs in these groups were obtained in different scanning sessions and to a certain extent, differences in patient positioning and table height can be expected between sessions. To encounter this problem, internal calibration has been proposed as an alternative calibration method. Internal calibration uses tissue-specific HU values, e.g., fat and muscle, to calculate a scan-specific conversion factor from HU to BMD [32]. While the method showed promising results in the past, studies have found that it is not superior to asynchronous calibration [27].

Across groups with patient repositioning, retrospective examination of cases with the greatest vBMD-variability revealed that the measurements were partially derived from severely ill patients from the intensive care units of our hospital, who had fluctuating levels of intra-abdominal as well as interstitial fluid and pleural effusion between scans. This bias is particularly difficult to correct for since hydration status in intensive care unit patients may substantially fluctuate daily. We regard this finding particularly important, as hydration status is not a reported confounder for quantitative CT measurements, which thus ought to be critically revised in severely ill patients. However, since our study was not specifically designed to investigate this topic, it warrants further investigation.

Data on cross-session reproducibility for asynchronous vBMD measurements is scarce, even more so for cross-scanner settings. Previous reports of reproducibility measures for asynchronous quantitative CT in single-scanner settings showed precision errors of 3−4 mg CaHA/cm³ or 2.2−3.7% [33, 34]. Sollmann et al compared opportunistically assessed vBMD from a set of six different scanners with QCT and documented different degrees of variation per scanner; an approach that seems somewhat comparable to cross-scanner results [11]. However, the authors did not measure cross-scanner reproducibility directly. With respect to the possible bias of the hydration status, precision errors across Δ_session, Δ_scanner and Δ_all may be regarded as acceptable, with a maximum relative error of 5.9% in the Δ_all group and a maximum absolute error of 11 mg CaHA/cm³ in Δ_session. _{Scanner-agnostic} observation pairs showed similar reproducibility to asynchronously calibrated measurements from different sessions, and _{scanner-agnostic} yielded better results than Δ_all, and did only barely not reach statistical significance. In fact, both the RMSE of 10.1 mg CaHA/cm³ and the RMSCV of 3.7% were slightly lower in _{scanner-agnostic} compared to the Δ_session group and the RMSCV was markedly lower in _{scanner-agnostic} compared to Δ_all. Both results suggest that scanner-specific phantom measurements may not necessarily be needed for asynchronous calibration if kVp-specific calibration factors are available for the scanner type.

We acknowledge some limitations of our study. First, we extended the generalizability of our results by including several scanners from outside of our institution. However, despite our efforts, we did not achieve an equal distribution across all main vendors. This may have an adverse impact on the external validity of the determined reproducibility. However, from our point of view, this issue can only be solved in multicentric studies, preferably with scanner-specific asynchronous calibration. Nonetheless, we demonstrated, that even scanner-agnostic kVp-based calibration yields acceptable reproducibility, regardless of the scanner combination. Second, cross-session reproducibility was limited, as we included many severely ill patients. It remains unclear, whether the observed increases in precision errors were attributable to patient positioning or caused by pathophysiological changes to the body composition like changes in the hydration status. Since this issue has not been reported on in the literature and this study was not designed to further investigate this finding, it necessitates further investigation. However, this problem appears to be difficult, as it seems to be inherent in the typical design for this type of study, because healthy individuals would rarely receive two thoracoabdominal CT scans within a single month.

To summarize, the automatic vBMD measurements by a convolutional neural network-based tool with asynchronous calibration and automated correction for the contrast media phase investigated in this study showed good reproducibility. The slightly lower precision in cross-session and cross-scanner settings may be related to patient positioning, but also short-term changes to the patients’ body compositions, necessitating further investigations. Notably, precision was similar in cross-session settings and the group without scanner-dedicated, asynchronous phantom-based calibration. Patient positioning and body composition may thus be of interest as major determinants of reproducibility for further studies.

Data availability

The datasets used and/or analyzed during the current study are available from the corresponding author on reasonable request.

Abbreviations

BMD:: Bone mineral density
CaHA:: Calcium hydroxylapatite
CI:: Confidence interval
RMSCV:: Root mean square coefficient of variation
RMSE:: Root mean square error
vBMD:: volumetric BMD

References

NIH Consensus Development Panel on Osteoporosis Prevention, Diagnosis, and Therapy (2001) Osteoporosis prevention, diagnosis, and therapy. JAMA 285:785–795. https://doi.org/10.1001/jama.285.6.785
Article Google Scholar
Office of the Surgeon General (US) (2004) Bone health and osteoporosis: A report of the surgeon general. US Department of Health and Human Services, Office of the Surgeon General, Rockville, MD
Hernlund E, Svedbom A, Ivergård M et al (2013) Osteoporosis in the european union: Medical management, epidemiology and economic burden. A report prepared in collaboration with the international osteoporosis foundation (iof) and the european federation of pharmaceutical industry associations (efpia). Arch Osteoporos 8:136. https://doi.org/10.1007/s11657-013-0136-1
Article CAS PubMed PubMed Central Google Scholar
National Osteoporosis Foundation (2002) America’s bone health: The state of osteoporosis and low bone mass in our nation. National Osteoporosis Foundation, Washington (DC)
Link TM (2012) Osteoporosis imaging: state of the art and advanced imaging. Radiology 263:3–17. https://doi.org/10.1148/radiol.2631201201
Article PubMed PubMed Central Google Scholar
Ensrud KE, Crandall CJ (2018) Osteoporosis. Ann Intern Med 168:306–307. https://doi.org/10.7326/L17-0587
Article PubMed Google Scholar
Chesnut III CH (2001) Osteoporosis, an underdiagnosed disease. JAMA 286:2865–2866. https://doi.org/10.1001/jama.286.22.2865
Article Google Scholar
Engelke K, Chaudry O, Bartenschlager S (2023) Opportunistic screening techniques for analysis of ct scans. Curr Osteoporos Rep 21:65–76. https://doi.org/10.1007/s11914-022-00764-5
Article PubMed Google Scholar
OECD (2023) Computed tomography (ct) exams (indicator). Available via https://www.oecd-ilibrary.org/social-issues-migration-health/computed-tomography-ct-exams/indicator/english_3c994537-en. Accessed April 19 2023. https://doi.org/10.1787/3c994537-en
Sekuboyina A, Husseini ME, Bayat A et al (2021) Verse: A vertebrae labelling and segmentation benchmark for multi-detector ct images. Med Image Anal 73:102166. https://doi.org/10.1016/j.media.2021.102166
Article PubMed Google Scholar
Sollmann N, Löffler MT, El Husseini M et al (2022) Automated opportunistic osteoporosis screening in routine computed tomography of the spine: Comparison with dedicated quantitative ct. J Bone Miner Res 37:1287–1296. https://doi.org/10.1002/jbmr.4575
Article CAS PubMed Google Scholar
Rühling S, Navarro F, Sekuboyina A et al (2022) Automated detection of the contrast phase in mdct by an artificial neural network improves the accuracy of opportunistic bone mineral density measurements. Eur Radiol 32:1465–1474. https://doi.org/10.1007/s00330-021-08284-z
Article PubMed Google Scholar
Löffler MT, Jacob A, Scharr A et al (2021) Automatic opportunistic osteoporosis screening in routine ct: Improved prediction of patients with prevalent vertebral fractures compared to dxa. Eur Radiol 31:6069–6077. https://doi.org/10.1007/s00330-020-07655-2
Article CAS PubMed PubMed Central Google Scholar
Block JE, Smith R, Glueer CC, Steiger P, Ettinger B, Genant HK (1989) Models of spinal trabecular bone loss as determined by quantitative computed tomography. J Bone Miner Res 4:249–257. https://doi.org/10.1002/jbmr.5650040218
Article CAS PubMed Google Scholar
Expert Panel on Musculoskeletal I, Ward RJ, Roberts CC et al (2017) Acr appropriateness criteria((r)) osteoporosis and bone mineral density. J Am Coll Radiol 14:S189–S202. https://doi.org/10.1016/j.jacr.2017.02.018
Article Google Scholar
Bligh M, Bidaut L, White RA, Murphy WA, Stevens DM, Cody DD (2009) Helical multidetector row quantitative computed tomography (qct) precision. Acad Radiol 16:150–159. https://doi.org/10.1016/j.acra.2008.08.007
Article PubMed Google Scholar
Brett AD, Brown JK (2015) Quantitative computed tomography and opportunistic bone density screening by dual use of computed tomography scans. J Orthop Translat 3:178–184. https://doi.org/10.1016/j.jot.2015.08.006
Article PubMed PubMed Central Google Scholar
Hofmann P, Lenhart B, Sedlmair M, Schmidt B, Flort T, Mahnken A (2016) Assessment of the effects of scanning and reconstruction parameters on bone densitometry results for single energy quantitative ct (qct) scans in a phantom study. Paper presented at the European Congress of Radiology 2016, Vienna, Austria. https://doi.org/10.1594/ecr2016/C-2260
Suetens P (2017) Fundamentals of medical imaging. Cambridge university press, Cambridge Medicine. https://doi.org/10.1017/9781316671849
Tothill P, Hannan WJ (2007) Precision and accuracy of measuring changes in bone mineral density by dual-energy x-ray absorptiometry. Osteoporos Int 18:1515–1523. https://doi.org/10.1007/s00198-007-0382-4
Article CAS PubMed Google Scholar
Link TM, Kazakia G (2020) Update on imaging-based measurement of bone mineral density and quality. Curr Rheumatol Rep 22:13. https://doi.org/10.1007/s11926-020-00892-w
Article CAS PubMed PubMed Central Google Scholar
Engelke K, Gluer CC, Genant HK (1995) Factors influencing short-term precision of dual x-ray bone absorptiometry (dxa) of spine and femur. Calcif Tissue Int 56:19–25. https://doi.org/10.1007/BF00298739
Article CAS PubMed Google Scholar
Bonescreen. Available via www.bonescreen.de. Accessed 05 2024.
Löffler MT, Sollmann N, Mei K et al (2020) X-ray-based quantitative osteoporosis imaging at the spine. Osteoporos Int 31:233–250. https://doi.org/10.1007/s00198-019-05212-2
Article PubMed Google Scholar
Bauer JS, Henning TD, Müeller D, Lu Y, Majumdar S, Link TM (2007) Volumetric quantitative ct of the spine and hip derived from contrast-enhanced mdct: conversion factors. AJR Am J Roentgenol 188:1294–1301. https://doi.org/10.2214/ajr.06.1006
Article Google Scholar
Pickhardt PJ, Lauder T, Pooler BD et al (2016) Effect of iv contrast on lumbar trabecular attenuation at routine abdominal ct: correlation with dxa and implications for opportunistic osteoporosis screening. Osteoporos Int 27:147–152. https://doi.org/10.1007/s00198-015-3224-9
Article CAS PubMed Google Scholar
Kaesmacher J, Liebl H, Baum T, Kirschke JS (2017) Bone mineral density estimations from routine multidetector computed tomography: A comparative study of contrast and calibration effects. J Comput Assist Tomogr 41:217–223. https://doi.org/10.1097/rct.0000000000000518
Article PubMed PubMed Central Google Scholar
Baum T, Muller D, Dobritz M et al (2012) Converted lumbar bmd values derived from sagittal reformations of contrast-enhanced mdct predict incidental osteoporotic vertebral fractures. Calcif Tissue Int 90:481–487. https://doi.org/10.1007/s00223-012-9596-3
Article CAS PubMed Google Scholar
Nickoloff EL, Feldman F, Atherton JV (1988) Bone mineral assessment: New dual-energy ct approach. Radiology 168:223–228. https://doi.org/10.1148/radiology.168.1.3380964
Article CAS PubMed Google Scholar
Wesarg S, Kirschner M, Becker M, Erdt M, Kafchitsas K, Khan MF (2012) Dual-energy ct-based assessment of the trabecular bone in vertebrae. Methods Inf Med 51:398–405
Article CAS PubMed Google Scholar
Gruenewald LD, Koch V, Martin SS et al (2022) Diagnostic accuracy of quantitative dual-energy ct-based volumetric bone mineral density assessment for the prediction of osteoporosis-associated fractures. Eur Radiol 32:3076–3084. https://doi.org/10.1007/s00330-021-08323-9
Boden SD, Goodenough DJ, Stockham CD, Jacobs E, Dina T, Allman RM (1989) Precise measurement of vertebral bone density using computed tomography without the use of an external reference phantom. J Digit Imaging 2:31–38. https://doi.org/10.1007/bf03168013
Article CAS PubMed Google Scholar
Wang L, Su Y, Wang Q et al (2017) Validation of asynchronous quantitative bone densitometry of the spine: Accuracy, short-term reproducibility, and a comparison with conventional quantitative computed tomography. Sci Rep 7:6284. https://doi.org/10.1038/s41598-017-06608-y
Article CAS PubMed PubMed Central Google Scholar
Brown JK, Timm W, Bodeen G et al (2017) Asynchronously calibrated quantitative bone densitometry. J Clin Densitom 20:216–225. https://doi.org/10.1016/j.jocd.2015.11.001
Article CAS PubMed Google Scholar

Download references

Acknowledgements

No large language models were used for this manuscript.

Funding

The study was funded by the German Research Foundation (Deutsche Forschungsgemeinschaft, DFG, Grant Number 432290010; awarded to JSK and TB). Open Access funding enabled and organized by Projekt DEAL.

Author information

Jannis Bodden and Philipp Prucker contributed equally to this work.

Authors and Affiliations

Department of Neuroradiology, TUM School of Medicine, Klinikum rechts der Isar, Technical University of Munich, Munich, Germany
Jannis Bodden, Philipp Prucker, Malek El Husseini, Katharina Grau, Sebastian Rühling, Claus Zimmer, Thomas Baum & Jan S. Kirschke
Department of Informatics, TUM School of Computation, Information and Technology, Technical University of Munich, Munich, Germany
Anjany Sekuboyina
Department of Quantitative Biomedicine, University of Zurich, Zurich, Switzerland
Anjany Sekuboyina
Department of diagnostic and interventional Radiology, University Hospital of Ulm, Ulm, Germany
Egon Burian
TUM-Neuroimaging Center, Klinikum rechts der Isar, Technical University of Munich, 81675, Munich, Germany
Claus Zimmer & Jan S. Kirschke

Authors

Jannis Bodden
View author publications
You can also search for this author in PubMed Google Scholar
Philipp Prucker
View author publications
You can also search for this author in PubMed Google Scholar
Anjany Sekuboyina
View author publications
You can also search for this author in PubMed Google Scholar
Malek El Husseini
View author publications
You can also search for this author in PubMed Google Scholar
Katharina Grau
View author publications
You can also search for this author in PubMed Google Scholar
Sebastian Rühling
View author publications
You can also search for this author in PubMed Google Scholar
Egon Burian
View author publications
You can also search for this author in PubMed Google Scholar
Claus Zimmer
View author publications
You can also search for this author in PubMed Google Scholar
Thomas Baum
View author publications
You can also search for this author in PubMed Google Scholar
Jan S. Kirschke
View author publications
You can also search for this author in PubMed Google Scholar

Contributions

JB, PP, and JSK contributed to study conception and design. JB and PP drafted the manuscript. JB, PP, and JSK acquired the dataset analyzed in this study. JB and JSK performed statistical analyses and created the figures. AS, MEH, and JSK created the software for vBMD calculation. All authors substantively revised the manuscript, approved the final version of the manuscript, and agreed to be personally accountable for the author’s own contributions and to ensure that questions related to the accuracy or integrity of any part of the work, even ones in which the author was not personally involved, were appropriately investigated, resolved, and the resolution documented in the literature.

Corresponding author

Correspondence to Jannis Bodden.

Ethics declarations

Ethics approval and consent to participate

This retrospective study was approved by the local institutional review board (Ethikkommission der TU München, 27/19 S-SR, 04/22/2024) and written informed consent was waived.

Competing interests

JSK, AS, MEH, and SR are co-founders of Bonescreen, Munich, Germany (www.bonescreen.de). The remaining authors declare no relationships with any companies, whose products or services may be related to the subject matter of the article.

Consent for publication

Not applicable.

Additional information

Publisher’s Note Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/.

Reprints and permissions

About this article

Cite this article

Bodden, J., Prucker, P., Sekuboyina, A. et al. Reproducibility of CT-based opportunistic vertebral volumetric bone mineral density measurements from an automated segmentation framework. Eur Radiol Exp 8, 86 (2024). https://doi.org/10.1186/s41747-024-00483-9

Download citation

Received: 02 April 2024
Accepted: 23 May 2024
Published: 01 August 2024
DOI: https://doi.org/10.1186/s41747-024-00483-9

Reproducibility of CT-based opportunistic vertebral volumetric bone mineral density measurements from an automated segmentation framework

Abstract

Background

Methods

Results

Conclusion

Relevance statement

Key Points

Graphical Abstract

Background

Methods

Study population

Dataset acquisition, scanner calibration, and automated vertebral body segmentation and vBMD extraction

Group definitions

Statistical analysis

Results

Group statistics

Reproducibility measurements

Discussion

Data availability

Abbreviations

References

Acknowledgements

Funding

Author information

Authors and Affiliations

Contributions

Corresponding author

Ethics declarations

Ethics approval and consent to participate

Competing interests

Consent for publication

Additional information

Rights and permissions

About this article

Cite this article

Share this article

Keywords