Skip to main content
  • Original article
  • Open access
  • Published:

Image quality and metal artifact reduction in total hip arthroplasty CT: deep learning-based algorithm versus virtual monoenergetic imaging and orthopedic metal artifact reduction

Abstract

Background

To compare image quality, metal artifacts, and diagnostic confidence of conventional computed tomography (CT) images of unilateral total hip arthroplasty patients (THA) with deep learning-based metal artifact reduction (DL-MAR) to conventional CT and 130-keV monoenergetic images with and without orthopedic metal artifact reduction (O-MAR).

Methods

Conventional CT and 130-keV monoenergetic images with and without O-MAR and DL-MAR images of 28 unilateral THA patients were reconstructed. Image quality, metal artifacts, and diagnostic confidence in bone, pelvic organs, and soft tissue adjacent to the prosthesis were jointly scored by two experienced musculoskeletal radiologists. Contrast-to-noise ratios (CNR) between bladder and fat and muscle and fat were measured. Wilcoxon signed-rank tests with Holm-Bonferroni correction were used.

Results

Significantly higher image quality, higher diagnostic confidence, and less severe metal artifacts were observed on DL-MAR and images with O-MAR compared to images without O-MAR (p < 0.001 for all comparisons). Higher image quality, higher diagnostic confidence for bone and soft tissue adjacent to the prosthesis, and less severe metal artifacts were observed on DL-MAR when compared to conventional images and 130-keV monoenergetic images with O-MAR (p ≤ 0.014). CNRs were higher for DL-MAR and images with O-MAR compared to images without O-MAR (p < 0.001). Higher CNRs were observed on DL-MAR images compared to conventional images and 130-keV monoenergetic images with O-MAR (p ≤ 0.010).

Conclusions

DL-MAR showed higher image quality, diagnostic confidence, and superior metal artifact reduction compared to conventional CT images and 130-keV monoenergetic images with and without O-MAR in unilateral THA patients.

Relevance statement

DL-MAR resulted into improved image quality, stronger reduction of metal artifacts, and improved diagnostic confidence compared to conventional and virtual monoenergetic images with and without metal artifact reduction, bringing DL-based metal artifact reduction closer to clinical application.

Key points

• Metal artifacts introduced by total hip arthroplasty hamper radiologic assessment on CT.

• A deep-learning algorithm (DL-MAR) was compared to dual-layer CT images with O-MAR.

• DL-MAR showed best image quality and diagnostic confidence.

• Highest contrast-to-noise ratios were observed on the DL-MAR images.

Graphical Abstract

Background

Total hip arthroplasty (THA) is widely used as an effective treatment for pain in patients with osteoarthritis or inflammatory arthritis [1]. More than 1.8 million THA procedures are performed annually worldwide, and this number is expected to increase in the coming years [2]. Computed tomography (CT) is often the imaging modality of choice for follow-up of these patients because CT is fast, relatively inexpensive, and well suited for visualizing bone effects [3]. In addition, there is a large group of patients with THA who undergo CT scans of the pelvis or abdomen for clinical indications other than THA follow-up. However, metal hip implants introduce metal artifacts in these CT images due to scattering, beam hardening, and photon starvation effects which are typically observed as dark and bright streaking artifacts throughout the image, hampering the assessment of bone and soft tissue [4, 5].

Metal artifacts can be reduced by increasing tube voltage or by repositioning of the patient but also by more advanced reconstruction-based metal artifact reduction techniques such as orthopedic metal artifact reduction (O-MAR), which rely on the detection and replacement of projection data that is affected by metal artifacts. Another method for metal artifact reduction is the use of high-energy virtual monoenergetic images (monoE) that are useful for reduction of beam hardening artifacts and can be generated by dual energy CT or photon counting CT [5,6,7]. High-energy monoE are useful for the reduction of relatively mild artifacts but come at the cost of reduced overall image contrast. Although the optimal energy is dependent on the implant material, size, and shape, monoE of 130 keV are generally effective in reducing these mild metal artifacts in CT images with hip prostheses [8]. High-energy monoE are of limited use when dealing with more severe metal artifacts. In these cases, reconstruction-based metal artifact reduction techniques should be used [4, 5]. However, these reconstruction-based techniques may introduce secondary artifacts [6, 9,10,11].

With the emergence of deep learning (DL), numerous DL-based techniques have been developed for detection, segmentation, classification, and prediction purposes within radiology [12]. Deep learning-based metal artifact reduction techniques (DL-MAR) for CT have also shown promising results [13,14,15,16,17]. These models are typically trained using simulated CT data which allows for the synthesis of CT images with and without metal artifacts [13,14,15,16,17]. However, studies reporting on external evaluation of DL-MAR using clinical CT data are limited. Recently, a first quantitative evaluation in patients after sacroiliac joint fusion showed strong reduction of metal artifacts by a generic DL-MAR, outperforming O-MAR [18]. To date, no qualitative or quantitative evaluation of DL-MAR has been performed in THA patients and the ability of DL-MAR to reduce metal artifacts in CT images of THA patients compared to O-MAR, and high-energy monoE is unknown. Therefore, the aim of this study was to compare conventional CT with and without O-MAR, 130-keV monoE with and without O-MAR, and DL-MAR of patients with unilateral THA in terms of image quality and metal artifacts with regard to diagnostic confidence. We hypothesize that CT images corrected with DL-MAR will result in less metal artifacts, higher image quality and higher diagnostic confidence than conventional CT with and without O-MAR, and 130-keV monoE with and without O-MAR.

Methods

Patient population

Thirty-one consecutive patients with a unilateral hip prosthesis, scheduled for a CT scan in Isala Hospital Meppel, were prospectively included between August 2022 and April 2023. One patient had an additional metal implant in the arm and was therefore excluded for analysis. The study was approved by our institutional review board (number 20220412). The need for informed consent was waived.

Image acquisition and reconstruction

Patients were scanned on a Philips Spectral CT 7500 system (Philips Healthcare, Best, the Netherlands) (Table 1). Since patients were consecutively included for varying indications, slightly different scan protocols were used. Conventional polychromatic images and 130-keV monoE of the 30 patients were reconstructed with and without O-MAR. In addition, a generic DL-MAR was applied to the conventional image without O-MAR. Details about the development of this DL-MAR algorithm such as data, ground truth, data partitioning, model, training, and evaluations are described in a previous study [18]. In summary, the DL-MAR algorithm was developed using simulated data, which allowed for the synthesis of paired CT images containing metal implants with and without metal artifacts. A total of 105,163 paired images were used to train the DL-MAR algorithm using a deep residual U-NET architecture. The network was trained using a combined loss function existing in a SSIM loss, L1 loss, and L2 loss.

Table 1 Acquisition and reconstruction parameters

Subjective evaluation

All conventional images with and without O-MAR, 130-keV monoE with and without O-MAR, and the DL-MAR images were jointly assessed by two radiologists from two different hospitals to the one where DL-MAR was developed (C.F.v.D., 25 years of experience; M.M., 28 years of experience) to reach consensus per reconstruction per patient on the following questions:

  1. 1.

    How would you rate image quality in general?

  2. 2.

    How would you rate the diagnostic confidence for bone structures?

  3. 3.

    How would you rate the diagnostic confidence for pelvic organs?

  4. 4.

    How would you rate the diagnostic confidence for soft tissue adjacent to the prosthesis?

  5. 5.

    How would you rate the metal artifacts in the image?

For questions 1–4, a 5-point scale with a score from 1 to 5 was used, including very poor (1), poor (2), fair (3), good (4), and excellent (5). For question 5, a 5-point scale including severe (1), pronounced (2), moderate (3), mild (4), and none (5) was used. The CT images of two patients were used for a training session of the radiologists, and the CT images of the other 28 patients were used for the actual qualitative scoring.

MeVisLab (MeVis Medical Solutions AG, Fraunhofer MEVIS, Bremen, Germany) was used to develop an interface to present the anonymized CT images in random order alongside the scoring form. The radiologists assessed CT images in axial view in bone window (width 1,600, level 400 HU) and soft tissue window (400 and 40 HU, respectively).

Objective evaluation

Three regions of interest (ROIs) were drawn by an experienced researcher (MS) on the conventional images without O-MAR on the axial slice at the middle of the prosthesis’ head using the “Fiji” open-source platform [19]. One circular ROI with a diameter of 30 mm was placed in the bladder at the medial side of the hip prosthesis. Another circular ROI with a diameter of 19 mm was placed at the lateral side of the hip prosthesis in muscle in the area with most pronounced artifacts. A third circular ROI with a diameter of 19 mm was placed in gluteal subcutaneous fat at the side of the hip prosthesis. Only in patients with a relatively thin gluteal subcutaneous fat the diameter of this ROI was carefully resized to assure that only fat was measured. Figure 1 shows the locations where the ROIs were placed (see supplemental materials 1 for specific examples). The ROIs were copied to the exact same locations on the other reconstructed images for each patient. Mean CT values in Hounsfield units (HU) and noise as the standard deviation in HU were measured in all ROIs. Contrast-to-noise ratios (CNRs) between the bladder and fat and between muscle and fat were calculated by dividing the difference in CT values between the tissues by the mean noise values of both tissues. To assess interobserver reliability of ROI placement, ROIs were also placed in the bladder, muscle, and fat on the conventional images without O-MAR of 15 randomly selected patients by an experienced radiologist (M.F.B., 14 years of experience).

Fig. 1
figure 1

Conventional (a), 130-keV monoE (b), conventional with O-MAR (c), 130-keV monoE with O-MAR (d), and DL-MAR (e) images of a patient with unilateral total hip arthroplasty at the middle of the prosthesis’ head. Three regions of interest were drawn on the conventional CT image and subsequently copied to the other reconstructed images to measure CT values and noise as the standard deviation in HU. Window: width 1,600, level 400 HU. CT Computed tomography, O-MAR Orthopedic metal artifact reduction, DL-MAR Deep learning-based metal artifact reduction

Statistical methods

The scores on the questionnaire were considered as continuous variables and analyzed for each question separately. The eyeball test in combination with the Shapiro-Wilk test showed that the data was not normally distributed. Wilcoxon signed-rank tests were used to test for differences between conventional images, conventional images with O-MAR, 130-keV monoE, 130-keV monoE with O-MAR, and DL-MAR. Adjusted p-values were calculated using the Holm-Bonferroni method to adjust for multiple comparisons. A significance level of 5% was used. In addition, the interobserver reliability between the two observers was calculated using the intraclass correlation coefficient for absolute agreement using a two-way mixed model. IBM SPSS software Version 27.0 (IBM Corporation, Armonk, NY, USA) was used for all statistical analyses.

Results

Subjective evaluation

Lowest scores were observed for the conventional images without O-MAR and 130-keV monoE images without O-MAR on the questions concerning image quality, diagnostic confidence for bone, diagnostic confidence for pelvic organs, diagnostic confidence for soft tissue adjacent to the prosthesis, and metal artifacts (Table 2).

Table 2 Median and interquartile range (IQR) for conventional images, 130-keV monoE, conventional images with O-MAR, 130-keV monoE with O-MAR, and DL-MAR images

The two radiologists scored image quality; diagnostic confidence for bone, pelvic organs, and soft tissue adjacent to the prosthesis; and metal artifacts significantly higher on conventional images with O-MAR and 130-keV monoE with O-MAR than on conventional images and 130-keV monoE without O-MAR (p ≤ 0.001 for all comparisons; see supplemental materials 2 for p-values of all pairwise comparisons). DL-MAR was also rated significantly higher on image quality, diagnostic confidence in all studied areas, and metal artifacts in comparison to conventional images without O-MAR and 130-keV monoE without O-MAR (p < 0.001).

Responses to questions on image quality, diagnostic confidence for bone, diagnostic confidence for soft tissue adjacent to the prosthesis, and metal artifacts on DL-MAR were significantly higher than on conventional images with O-MAR and 130-keV monoE with O-MAR (p ≤ 0.014 for all comparisons; Figs. 2, 3 and 4). No significant difference was observed when comparing scores for diagnostic confidence for pelvic organs on DL-MAR to conventional images with O-MAR (p = 0.063; Fig. 5) and 130-keV monoE with O-MAR (p = 0.013, but not significant after Holm-Bonferroni correction).

Fig. 2
figure 2

Conventional (a), 130-keV monoE (b), conventional with O-MAR (c), 130-keV monoE with O-MAR (d), and DL-MAR (e) images of a patient with unilateral total hip arthroplasty at level of the acetabulum. Artifacts on the conventional CT image and 130-keV monoE conceal a bladder cyst to the right of the prosthesis cup that is visible on the conventional image O-MAR, 130-keV monoE with O-MAR, and DL-MAR (solid yellow arrow). Although not observed in all patients, some artifacts remain between the prosthesis cup and prosthesis head on DL-MAR (open yellow arrow). Window: width 400, level 40 HU. CT Computed tomography, O-MAR Orthopedic metal artifact reduction, DL-MAR Deep learning-based metal artifact reduction

Fig. 3
figure 3

Conventional (a), 130-keV monoE (b), conventional with O-MAR (c), 130-keV monoE with O-MAR (d), and DL-MAR (e) images of a patient with unilateral total hip arthroplasty at height of the prosthesis’ femoral stem. In this patient, metal artifacts are reduced on the images with O-MAR, but secondary artifacts are introduced resulting in degradation of cortical bone anterior to the femoral stem (yellow arrow). DL-MAR reduces artifacts without introducing these secondary artifacts. Window: width 1,600, level 400 HU. CT Computed tomography, O-MAR Orthopedic metal artifact reduction, DL-MAR Deep learning-based metal artifact reduction

Fig. 4
figure 4

Conventional (a), 130-keV monoE (b), conventional with O-MAR (c), 130-keV monoE with O-MAR (d), and DL-MAR (e) images of a patient with unilateral total hip arthroplasty at height of the acetabulum. Severe metal artifacts on the conventional CT image and 130-keV monoE that are strongly reduced by O-MAR. Even stronger reduction can be observed on the DL-MAR images. Window: width 400, level 40 HU. CT Computed tomography, O-MAR Orthopedic metal artifact reduction, DL-MAR Deep learning-based metal artifact reduction

Fig. 5
figure 5

Conventional (a), 130-keV monoE (b), conventional with O-MAR (c), 130-keV monoE with O-MAR (d), and DL-MAR (e) images of a patient with unilateral total hip arthroplasty at height of the acetabulum. Although not observed in all patients, some artifacts are remaining in the lesser pelvis on the DL-MAR image (yellow arrow). O-MAR Orthopedic metal artifact reduction, DL-MAR Deep learning-based metal artifact reduction

No statistical differences between were observed when comparing the radiologists’ scores on conventional images without O-MAR to 130-keV monoE without O-MAR for image quality, diagnostic confidence for bone, pelvic organs and soft tissue adjacent to the prosthesis, and metal artifacts. Also, no significant differences were observed when comparing conventional images with O-MAR to 130-keV monoE with O-MAR for image quality, diagnostic confidence in all studied areas, and metal artifacts.

Objective evaluation

An intraclass correlation coefficient of 0.841 was calculated for CT values measured in ROIs placed in the bladder, indicating good interobserver reliability [20]. For ROIs placed in muscle and fat, intraclass correlation coefficients of 0.960 and 0.944, respectively, were observed indicating excellent interobserver agreement [20].

HU values of the bladder and muscle were significantly higher on conventional images with O-MAR and 130-keV images with O-MAR compared to conventional images and 130-keV images without O-MAR (p < 0.002 for all comparisons; Fig. 6; see supplemental materials 3 for p-values of all pairwise comparisons). DL-MAR images showed significantly higher HU values of the bladder and muscle compared to all other reconstructions (p < 0.001 for all comparisons). HU values of fat were highest on 130-keV monoE with and without O-MAR compared to the other reconstructions (p < 0.002 for all comparisons). HU values of fat on DL-MAR images were significantly higher compared to conventional images with O-MAR (p = 0.004).

Fig. 6
figure 6

Boxplots of HU in the bladder (a), muscle (b), and fat (c) for conventional, 130-keV monoE, conventional with O-MAR, 130-keV monoE with O-MAR, and DL-MAR images of patients with unilateral total hip arthroplasty. All comparisons between reconstructions were statistically significant (p ≤ 0.05) except for those indicated by n.s. O-MAR Orthopedic metal artifact reduction, DL-MAR Deep learning-based metal artifact reduction

Noise in the bladder and fat was lower on 130-keV monoE without O-MAR compared to conventional images without O-MAR (p < 0.001 for all comparisons; Fig. 7). Conventional images with O-MAR showed lower noise in the bladder and muscle compared to conventional images without O-MAR and 130-keV images without O-MAR (p < 0.002 for all comparisons). Noise in fat was lower on conventional images with O-MAR compared to conventional images without O-MAR (p < 0.001), while noise in fat was higher on conventional images with O-MAR compared to 130 keV without O-MAR (p < 0.001). Noise values in the bladder, muscle, and fat were lower on 130-keV monoE with O-MAR compared to conventional images with and without O-MAR and 130-keV monoE without O-MAR (p < 0.002 for all comparisons). DL-MAR showed lower noise in the bladder compared to conventional images without O-MAR and 130-keV monoE without O-MAR (p < 0.001 for all comparisons) and lower noise in muscle and fat than all other reconstructions (p < 0.002 for all comparisons).

Fig. 7
figure 7

Boxplots of noise as the standard deviation in HU in the bladder (a), muscle (b), and fat (c) for conventional, 130-keV monoE, conventional with O-MAR, 130-keV monoE with O-MAR, and DL-MAR images of patients with unilateral total hip arthroplasty. All comparisons between reconstructions were statistically significant (p ≤ 0.05) except for those indicated by n.s. O-MAR Orthopedic metal artifact reduction, DL-MAR Deep learning-based metal artifact reduction

CNR between muscle and fat and CNR between the bladder and fat were significantly higher on conventional images with O-MAR than on conventional images without O-MAR and 130-keV monoE without O-MAR (p < 0.001; Fig. 8). Monochromatic 130-keV images with O-MAR showed a significantly higher CNR between muscle and fat compared to conventional images without O-MAR (p = 0.003) and 130-keV monoE without O-MAR (p = 0.005). Monochromatic 130-keV images with O-MAR also showed a significantly higher CNR between the bladder and fat compared to conventional images without O-MAR (p = 0.016) and 130-keV monoE without O-MAR (p = 0.002).

Fig. 8
figure 8

Boxplots of contrast-to-noise ratios (CNR) between muscle and fat and the bladder and fat for conventional, 130-keV monoE, conventional with O-MAR, 130-keV monoE with O-MAR, and DL-MAR images of patients with unilateral total hip arthroplasty. All comparisons between reconstructions were statistically significant (p ≤ 0.05) except for those indicated by n.s. O-MAR Orthopedic metal artifact reduction, DL-MAR Deep learning-based metal artifact reduction

CNR between muscle and fat was significantly higher on DL-MAR images compared to all other reconstructed images (p < 0.001 for all comparisons). CNR between the bladder and fat was also significantly higher on DL-MAR compared to conventional images without O-MAR (p < 0.001), 130-keV monoE without O-MAR (p < 0.001), conventional images with O-MAR (p = 0.010), and 130-keV monoE with O-MAR (p < 0.001). No significant difference in CNR was observed when comparing conventional images without O-MAR to 130-keV monoE without O-MAR and when comparing conventional images with O-MAR to 130-keV monoE with O-MAR.

Discussion

In this study, we compared a novel generic DL-MAR algorithm for the reduction of metal artifacts to an established metal artifact reduction technique, O-MAR, in CT scans with unilateral THA. Strongest reduction of metal artifacts, highest image quality, and highest diagnostic confidence in bone and soft tissue adjacent to the prosthesis were observed on the DL-MAR images compared to conventional images and 130-keV monoE with or without O-MAR. This study is the first to perform a prospective evaluation of DL-MAR in patients with unilateral THA, thereby moving the field of DL-based MAR closer to the clinical application of deep learning-based methods for metal artifact reduction.

Although DL-MAR can greatly reduce metal artifacts resulting in improved diagnostic confidence and image quality, some metal artifacts were still present. In particular, metal artifact reduction was not shown consistently in the lesser pelvis on DL-MAR images. This may explain why there was no significant difference in diagnostic confidence for pelvic organs between DL-MAR images and conventional images with O-MAR and 130-keV monoE with O-MAR. However, the lack of a significant difference may also be due to the relatively small study population. In particular, severe metal artifacts sometimes remain present on DL-MAR images, for example, between the cup and head of the prosthesis. This may be explained by the fact that the conventional image is used as the input for DL-MAR. When artifacts on the conventional image are so severe that they completely obscure anatomical structures, then there is no information in the conventional image available that DL-MAR can use to reduce metal artifacts. In contrast to DL-MAR, O-MAR is able to depict the prosthesis cup accurately, probably because O-MAR has access to the projection data. This indicates what has already been suggested in simulation studies, i.e., that the use of projection data may further improve the ability of DL-MAR to reduce metal artifacts [13, 14, 21,22,23]. However, the development and clinical application of such an algorithm are challenging as this requires the cooperation of CT vendors to provide access to the projection data.

Conventional images with O-MAR were rated significantly higher by two experienced radiologists in terms of image quality, diagnostic confidence, and reduction of metal artifacts compared to conventional images without O-MAR, which is consistent with results of previous studies [24,25,26,27,28,29]. Our finding that 130-keV images with O-MAR were of higher image quality, diagnostic confidence showed stronger reduction of metal artifacts in comparison to 130-keV image without O-MAR is also in line with previous studies [6, 29, 30]. We did not find statistical differences between 130-keV monoE with O-MAR and conventional images with O-MAR or 130-keV monoE without O-MAR and conventional images without O-MAR, while several previous studies did report a significant difference between these images [27, 29, 30]. Laukamp et al. found stronger reduction of metal artifacts and improved visibility of bone, muscle, and pelvic organs when monoE of 110, 150, and 200 keV were compared to conventional images [27]. Yoo et al. found improved visibility of periprosthetic bone when comparing a set of multiple high-energy monoE ranging from 50 to 200 keV to conventional images [29]. Furthermore, Neuhaus et al. observed stronger reduction of metal artifacts and improved visibility of pelvic organs and bone in 180- and 200-keV monoE images with O-MAR compared to conventional images with O-MAR [30]. The discrepancy between the results of our study and the studies of Laukamp et al. [27], Yoo et al. [29], and Neuhaus et al. [30] may be explained by the fact that they included patients with bilateral prostheses that introduce more severe metal artifacts than the unilateral prostheses included in our study.

Studies comparing DL-based metal artifact reduction with widely used metal artifact reduction techniques are scarce. One quantitative study reported stronger reduction of metal artifacts by DL-MAR compared to O-MAR in patients scanned before and after sacroiliac joint fusion [18]. Other studies investigating DL-based metal artifact reduction focused on development of the algorithm rather than clinical evaluation. They typically evaluated one or two cases without comparison to commercial metal artifact reduction techniques or high-energy monoE [13, 16, 17, 22, 31,32,33]. Further studies are needed to evaluate the potential of DL-MAR in clinical practice, preferably using paired pre-surgery and post-surgery CT-images with and without metal because CT images without metal provide the most optimal reference within patients. These studies should focus not only on THA or sacroiliac joint fusion but also on other implants because DL-MAR was trained on a wide variety of metal implants. Furthermore, future research focuses on individual pathologies to investigate the clinical utility of DL-MAR.

This study has several limitations. First, the shape, size, and material of hip prosthesis affects the severity of metal artifacts [5, 8]. However, to investigate this further would require a much larger cohort of patients and, given our prospective design, a much longer inclusion period. We chose to analyze consecutive patients which reflect the presentation of hip prosthesis in CT images in clinical practice. Second, the images were scored by two radiologists, while scoring by more radiologists could be a more comprehensive approach. However, the differences in image quality and metal artifacts between the reconstructions studied are large, and both radiologists were experienced MSK radiologists in the assessment of CT images with metal artifacts. Therefore, we expect similar results if more radiologists were asked to evaluate the images. Finally, the radiologists assessed the images in axial view, whereas the coronal and sagittal views are also provided in clinical practice. However, we expect similar results if the other views had been provided because the image quality, metal artifacts, and diagnostic confidence for bone, pelvic organs, and soft tissue adjacent to the prosthesis could be assessed very well on the axial images.

Based on our results, we conclude that DL-MAR yields superior image quality and diagnostic confidence, and strongest reduction of metal artifacts compared to conventional images and 130-keV monoE with or without O-MAR in patients with unilateral THA.

Availability of data and materials

The datasets and material used during the current study are available from the corresponding author on reasonable request.

Abbreviations

CNR:

Contrast-to-noise ratio

CT:

Computed tomography

DL:

Deep learning

DL-MAR:

DL-based metal artifact reduction

MonoE:

Monoenergetic images

O-MAR:

Orthopedic metal artifact reduction

ROI:

Region of interest

THA:

Total hip arthroplasty

References

  1. Crawford RW, Murray DW (1997) Total hip replacement: indications for surgery and risk factors for failure. Ann Rheum Dis 56:455–457. https://doi.org/10.1136/ARD.56.8.455

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  2. Pabinger C, Lothaller H, Portner N, Geissler A (2018) Projections of hip arthroplasty in OECD countries up to 2050. Hip Int 28:498–506. https://doi.org/10.1177/1120700018757940

  3. Boomsma MF, Warringa N, Edens MA, et al (2016) Quantitative analysis of orthopedic metal artefact reduction in 64-slice computed tomography scans in large head metal-on-metal total hip replacement, a phantom study. Springerplus. https://doi.org/10.1186/s40064-016-2006-y

  4. Wellenberg RHH, Hakvoort ET, Slump CH, Boomsma MF, Maas M, Streekstra GJ (2018) Metal artifact reduction techniques in musculoskeletal CT-imaging. Eur J Radiol 107:60-69. https://doi.org/10.1016/j.ejrad.2018.08.010

  5. Gjesteby L, De Man B, Jin Y et al (2016) Metal artifact reduction in CT: where are we after four decades? IEEE Access. https://doi.org/10.1109/ACCESS.2016.2608621

    Article  Google Scholar 

  6. Selles M, Stuivenberg VH, Wellenberg RHH et al (2021) Quantitative analysis of metal artifact reduction in total hip arthroplasty using virtual monochromatic imaging and orthopedic metal artifact reduction, a phantom study. Insights Imaging 12:171. https://doi.org/10.1186/S13244-021-01111-5

    Article  PubMed  PubMed Central  Google Scholar 

  7. Rau A, Straehle J, Stein T et al (2023) Photon-counting computed tomography (PC-CT) of the spine: impact on diagnostic confidence and radiation dose. Eur Radiol 1:1–9. https://doi.org/10.1007/S00330-023-09511-5/TABLES/4

    Article  Google Scholar 

  8. Wellenberg RHH, Boomsma MF, van Osch JAC, et al (2017) Quantifying metal artefact reduction using virtual monochromatic dual-layer detector spectral CT imaging in unilateral and bilateral total hip prostheses. Eur J Radiol. https://doi.org/10.1016/j.ejrad.2017.01.002

  9. Selles M, Korte JH, Boelhouwers HJ, et al (2022) Metal artifact reduction in computed tomography: Is it of benefit in evaluating sacroiliac joint fusion? Eur J Radiol 148. https://doi.org/10.1016/J.EJRAD.2022.110159

  10. Bolstad K, Flatabø S, Aadnevik D, et al (2018) Metal artifact reduction in CT, a phantom study: subjective and objective evaluation of four commercial metal artifact reduction algorithms when used on three different orthopedic metal implants. Acta Radiol https://doi.org/10.1177/0284185117751278

  11. Pessis E, Sverzut JM, Campagna R, et al (2015) Reduction of metal artifact with dual-energy CT: virtual monospectral imaging with fast kilovoltage switching and metal artifact reduction software. Semin Musculoskelet Radiol. https://doi.org/10.1055/s-0035-1569256

  12. Montagnon E, Cerny M, Cadrin-Chênevert A et al (2020) Deep learning workflow in radiology: a primer. Insights Imaging 11:1–15. https://doi.org/10.1186/S13244-019-0832-5/TABLES/2

    Article  Google Scholar 

  13. Zhou B, Chen X, Zhou SK, et al (2022) DuDoDR-Net: dual-domain data consistent recurrent network for simultaneous sparse view and metal artifact reduction in computed tomography. Med Image Anal 75. https://doi.org/10.1016/J.MEDIA.2021.102289

  14. Lin WA, Liao H, Peng C, et al (2019) DuDoNet: Dual domain network for CT metal artifact reduction. In: Proceedings of the IEEE Computer Society Conference on Computer Vision and Pattern Recognition

  15. Ghani MU, Karl WC (2019) Fast enhanced CT metal artifact reduction using data domain deep learning. IEEE Trans Comput Imaging. https://doi.org/10.1109/tci.2019.2937221

  16. Bauer DF, Ulrich C, Russ T, et al (2021) End-to-end deep learning CT image reconstruction for metal artifact reduction. Appl Sci 2022 12:404. 12:404. https://doi.org/10.3390/APP12010404

  17. Zhang Y, Yu H (2018) Convolutional neural network based metal artifact reduction in X-ray computed tomography. IEEE Trans Med Imaging 37:1370–1381. https://doi.org/10.1109/TMI.2018.2823083

    Article  PubMed  PubMed Central  Google Scholar 

  18. Selles M, Slotman DJ, van Osch JAC et al (2023) Is AI the way forward for reducing metal artifacts in CT? Development of a generic deep learning-based method and initial evaluation in patients with sacroiliac joint implants. Eur J Radiol 163:110844. https://doi.org/10.1016/j.ejrad.2023.110844

    Article  PubMed  Google Scholar 

  19. Schindelin J, Arganda-Carreras I, Frise E et al (2012) Fiji: an open-source platform for biological-image analysis. Nat Methods 9:676–682. https://doi.org/10.1038/nmeth.2019

    Article  CAS  PubMed  Google Scholar 

  20. Koo TK, Li MY (2016) A guideline of selecting and reporting intraclass correlation coefficients for reliability research. J Chiropr Med. https://doi.org/10.1016/j.jcm.2016.02.012

  21. Wang T, Xia W, Huang Y, et al (2021) DAN-Net: dual-domain adaptive-scaling non-local network for CT metal artifact reduction. Phys Med Biol 66. https://doi.org/10.1088/1361-6560/AC1156

  22. Gjesteby L, Shan H, Yang Q et al (2019) A dual-stream deep convolutional network for reducing metal streak artifacts in CT images. Phys Med Biol 64:235003. https://doi.org/10.1088/1361-6560/AB4E3E

    Article  CAS  PubMed  Google Scholar 

  23. Arabi H, Zaidi H (2021) Deep learning–based metal artefact reduction in PET/CT imaging. Eur Radiol 31:6384–6396. https://doi.org/10.1007/S00330-021-07709-Z/TABLES/2

    Article  PubMed  PubMed Central  Google Scholar 

  24. Andersson KM, Norrman E, Geijer H et al (2016) Visual grading evaluation of commercially available metal artefact reduction techniques in hip prosthesis computed tomography. Br J Radiol. https://doi.org/10.1259/bjr.20150993

    Article  PubMed  Google Scholar 

  25. Andersson KM, Nowik P, Persliden J, et al (2015) Metal artefact reduction in CT imaging of hip prostheses-an evaluation of commercial techniques provided by four vendors. Br J Radiol. https://doi.org/10.1259/bjr.20140473

  26. Wellenberg RHH, Boomsma MF, Van Osch JAC, et al (2016) Computed tomography imaging of a hip prosthesis using iterativemodel-based reconstruction and orthopaedicmetal artefact reduction: a quantitative analysis. J Comput Assist Tomogr. https://doi.org/10.1097/RCT.0000000000000449

  27. Laukamp KR, Lennartz S, Neuhaus V-F et al (2018) CT metal artifacts in patients with total hip replacements: for artifact reduction monoenergetic reconstructions and post-processing algorithms are both efficient but not similar. Eur Radiolo 28:4524–4533. https://doi.org/10.1007/S00330-018-5414-2

    Article  Google Scholar 

  28. Park J, Kim SH, Han JK (2019) Combined application of virtual monoenergetic high keV images and the orthopedic metal artifact reduction algorithm (O-MAR): effect on image quality. Abdom Radiol (NY) 44:756–765. https://doi.org/10.1007/S00261-018-1748-0/METRICS

    Article  Google Scholar 

  29. Yoo HJ, Hong SH, Choi JY, Chae HD (2022) Comparison of metal artifact reduction algorithms in patients with hip prostheses: virtual monoenergetic images vs. orthopedic metal artifact reduction. J Korean Soc Radiol 83. https://doi.org/10.3348/JKSR.2021.0130

  30. Neuhaus V, Grosse Hokamp N, Zopfs D et al (2019) Reducing artifacts from total hip replacements in dual layer detector CT: combination of virtual monoenergetic images and orthopedic metal artifact reduction. Eur J Radiol. https://doi.org/10.1016/j.ejrad.2018.12.008

    Article  PubMed  Google Scholar 

  31. Yu L, Zhang Z, Li X et al (2021) Metal artifact reduction in 2D CT images with self-supervised cross-domain learning. Phys Med Biol 66:175003. https://doi.org/10.1088/1361-6560/AC195C

    Article  Google Scholar 

  32. Busi M, Kehl C, Frisvad JR, Olsen UL (2022) Metal artifact reduction in spectral X-ray CT using spectral deep learning. J Imaging 8. https://doi.org/10.3390/JIMAGING8030077

  33. Liao H, Lin WA, Zhou SK, Luo J (2020) ADN: artifact disentanglement network for unsupervised metal artifact reduction. IEEE Trans Med Imaging 39:634–643. https://doi.org/10.1109/TMI.2019.2933425

    Article  PubMed  Google Scholar 

Download references

Acknowledgements

No large language models were used to create this article.

Funding

The Department of Radiology, Isala, Zwolle, the Netherlands, has established a research collaboration with Philips Healthcare regarding metal artifact reduction. This work was supported by a research exhibit with Philips Healthcare (Exhibit B-12: Photon Counting in Metal Artifact Reduction). However, the subsiding party had no decisive role in deep learning model development, data collection, data analysis, nor data interpretation.

Author information

Authors and Affiliations

Authors

Contributions

All authors made substantial contributions to the conception of this work. MS, RHHW, IMN, KFvD, MM, and MFB designed this work. Data acquisition was done by MS, KFvD, and MM, while data interpretation was performed by MS, IMN, RHHW, and MFB. MS, DJS, and JACvO created the DL-MAR that was used. Finally, MS drafted the manuscript. All authors read and approved the final manuscript.

Corresponding author

Correspondence to Mark Selles.

Ethics declarations

Ethics approval and consent to participate

Ethical approval was obtained from our institutional review board (ref. number: 20220412). The need for informed consent was waived because all data was handled anonymously.

Consent for publication

Not applicable.

Competing interests

The authors declare that they have no competing interests.

Additional information

Publisher's Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Supplementary Information

Additional file 1: Supplemental materials 1.

Three examples (a-c) of placement of regions of interest (ROI) in the bladder (ROI 1), muscle (ROI 2) and fat (ROI 3). Supplemental materials 2A. P-values for pairwise comparisons of the reconstructed images for image quality. *statistically significant after Holm-Bonferroni correction. Abbreviations: keV: kiloelectron volt, O-MAR: orthopedic metal artifact reduction, DL-MAR: deep learning based metal artifact reduction. Supplemental materials 2B. P-values for pairwise comparisons of the reconstructed images for diagnostic confidence for bone structures.*statistically significant after Holm-Bonferroni correction. Abbreviations: keV: kiloelectron volt, O-MAR: orthopedic metal artifact reduction, DL-MAR: deep learning based metal artifact reduction. Supplemental materials 2C. P-values for pairwise comparisons of the reconstructed images for diagnostic confidence for pelvic organs.*statistically significant after Holm-Bonferroni correction. Abbreviations: keV: kiloelectron volt, O-MAR: orthopedic metal artifact reduction, DL-MAR: deep learning based metal artifact reduction. Supplemental materials 2D. P-values for pairwise comparisons of the reconstructed images for diagnostic confidence for soft tissue adjacent to the prosthesis. *statistically significant after Holm-Bonferroni correction. Abbreviations: keV: kiloelectron volt, O-MAR: orthopedic metal artifact reduction, DL-MAR: deep learning based metal artifact reduction. Supplemental materials 2E. P-values for pairwise comparisons of the reconstructed images for metal artifacts. *statistically significant after Holm-Bonferroni correction. Abbreviations: keV: kiloelectron volt, O-MAR: orthopedic metal artifact reduction, DL-MAR: deep learning based metal artifact reduction. Supplemental materials 3A. P-values for pairwise comparisons of Hounsfield units in bladder. *statistically significant after Holm-Bonferroni correction. Abbreviations: keV: kiloelectron volt, O-MAR: orthopedic metal artifact reduction, DL-MAR: deep learning based metal artifact reduction. Supplemental materials 3B. P-values for pairwise comparisons of Hounsfield units in muscle. *statistically significant after Holm-Bonferroni correction. Abbreviations: keV: kiloelectron volt, O-MAR: orthopedic metal artifact reduction, DL-MAR: deep learning based metal artifact reduction. Supplemental materials 3C. P-values for pairwise comparisons of Hounsfield units in fat. *statistically significant after Holm-Bonferroni correction. Abbreviations: keV: kiloelectron volt, O-MAR: orthopedic metal artifact reduction, DL-MAR: deep learning based metal artifact reduction. Supplemental materials 3D. P-values for pairwise comparisons of noise as the standard deviation in Hounsfield units in bladder. *statistically significant after Holm-Bonferroni correction. Abbreviations: keV: kiloelectron volt, O-MAR: orthopedic metal artifact reduction, DL-MAR: deep learning based metal artifact reduction. Supplemental materials 3E. P-values for pairwise comparisons of noise as the standard deviation in Hounsfield units in muscle. *statistically significant after Holm-Bonferroni correction. Abbreviations: keV: kiloelectron volt, O-MAR: orthopedic metal artifact reduction, DL-MAR: deep learning based metal artifact reduction. Supplemental materials 3F. P-values for pairwise comparisons of noise as the standard deviation in Hounsfield units in fat. *statistically significant after Holm-Bonferroni correction. Abbreviations: keV: kiloelectron volt, O-MAR: orthopedic metal artifact reduction, DL-MAR: deep learning based metal artifact reduction.

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/.

Reprints and permissions

About this article

Check for updates. Verify currency and authenticity via CrossMark

Cite this article

Selles, M., Wellenberg, R.H.H., Slotman, D.J. et al. Image quality and metal artifact reduction in total hip arthroplasty CT: deep learning-based algorithm versus virtual monoenergetic imaging and orthopedic metal artifact reduction. Eur Radiol Exp 8, 31 (2024). https://doi.org/10.1186/s41747-024-00427-3

Download citation

  • Received:

  • Accepted:

  • Published:

  • DOI: https://doi.org/10.1186/s41747-024-00427-3

Keywords