Improving image quality of sparse-view lung tumor CT images with U-Net

Ries, Annika; Dorosti, Tina; Thalhammer, Johannes; Sasse, Daniel; Sauter, Andreas; Meurer, Felix; Benne, Ashley; Lasser, Tobias; Pfeiffer, Franz; Schaff, Florian; Pfeiffer, Daniela

doi:10.1186/s41747-024-00450-4

Original article
Open access
Published: 03 May 2024

Improving image quality of sparse-view lung tumor CT images with U-Net

Annika Ries^1,2^na1,
Tina Dorosti ORCID: orcid.org/0000-0002-5747-3250^1,2,3^na1,
Johannes Thalhammer^1,2,3,4,
Daniel Sasse³,
Andreas Sauter³,
Felix Meurer³,
Ashley Benne^3,4,
Tobias Lasser^2,5,
Franz Pfeiffer^1,2,3,4,
Florian Schaff^1,2 &
…
Daniela Pfeiffer^3,4

European Radiology Experimental volume 8, Article number: 54 (2024) Cite this article

488 Accesses
1 Altmetric
Metrics details

Abstract

Background

We aimed to improve the image quality (IQ) of sparse-view computed tomography (CT) images using a U-Net for lung metastasis detection and determine the best tradeoff between number of views, IQ, and diagnostic confidence.

Methods

CT images from 41 subjects aged 62.8 ± 10.6 years (mean ± standard deviation, 23 men), 34 with lung metastasis, 7 healthy, were retrospectively selected (2016–2018) and forward projected onto 2,048-view sinograms. Six corresponding sparse-view CT data subsets at varying levels of undersampling were reconstructed from sinograms using filtered backprojection with 16, 32, 64, 128, 256, and 512 views. A dual-frame U-Net was trained and evaluated for each subsampling level on 8,658 images from 22 diseased subjects. A representative image per scan was selected from 19 subjects (12 diseased, 7 healthy) for a single-blinded multireader study. These slices, for all levels of subsampling, with and without U-Net postprocessing, were presented to three readers. IQ and diagnostic confidence were ranked using predefined scales. Subjective nodule segmentation was evaluated using sensitivity and Dice similarity coefficient (DSC); clustered Wilcoxon signed-rank test was used.

Results

The 64-projection sparse-view images resulted in 0.89 sensitivity and 0.81 DSC, while their counterparts, postprocessed with the U-Net, had improved metrics (0.94 sensitivity and 0.85 DSC) (p = 0.400). Fewer views led to insufficient IQ for diagnosis. For increased views, no substantial discrepancies were noted between sparse-view and postprocessed images.

Conclusions

Projection views can be reduced from 2,048 to 64 while maintaining IQ and the confidence of the radiologists on a satisfactory level.

Relevance statement

Our reader study demonstrates the benefit of U-Net postprocessing for regular CT screenings of patients with lung metastasis to increase the IQ and diagnostic confidence while reducing the dose.

Key points

• Sparse-projection-view streak artifacts reduce the quality and usability of sparse-view CT images.

• U-Net-based postprocessing removes sparse-view artifacts while maintaining diagnostically accurate IQ.

• Postprocessed sparse-view CTs drastically increase radiologists’ confidence in diagnosing lung metastasis.

Graphical Abstract

Background

Lung cancer maintains the highest mortality rate for malignancies around the globe, with more than 2.2 million new cases recorded worldwide in 2020 [1, 2]. More than half of all cancerous lung tumor diagnoses have reached a progressive stage by the time patients present with symptoms [3]. Regular screenings enable early detection and thereby increase survival rates [3, 4].

Computed tomography (CT) is considered standard practice in present-day medicine for diagnosing lung nodules [4,5,6], but this comes with the cost of radiation exposure [7, 8]. To make regular screenings possible, a tradeoff between radiation dose and image quality (IQ) must be found [4]. Sparse-view CT is a technique for dose reduction. However, this technique leads to a degradation of image quality due to distinct streak artifacts caused by a limited number of projection views in the reconstruction process [9, 10].

Machine learning approaches have shown promising results for sparse-view artifact correction [9,10,11,12,13,14]. Specifically, residual learning has delivered superior results compared to the direct approach [11, 12]. The goal of the network in residual learning is to estimate the difference between sparse-view and full-view images. In a direct approach, the network aims to predict the artifact-free image. The simpler topological structure of residual images allows for more efficient learning [12]. A popular network architecture for such artifact-correction tasks is the U-Net [15]. With a large receptive field, the model is capable of handling global artifacts such as the sparse-view streak artifacts [11, 12]. The dual-frame U-Net was proposed as a more robust variant of the standard U-Net for the task at hand [13].

In this work, we assess the performance of the dual-frame U-Net in correcting for streak artifacts present in sparse-view CT scans of the lung with metastasis [13]. An image reconstructed from 2,048 views, later referred to as a full-view image, was taken to calculate the residual image. Six levels of subsampled input images were reconstructed from 16, 32, 64, 128, 256, and 512 views, respectively. By conducting a reader study with the unprocessed sparse-view images and their U-Net postprocessed counterpart images, we aim to find the best tradeoff between the number of views, IQ, and confidence of the participating radiologists on their diagnosis.

Methods

The code is available at https://github.com/tidorosti/Reader-Study_UNet-Processed-Lung-Cancer-CT.

Datasets

We received approval from the institutional review board, and the requirement for written informed consent was waived, as all original data (acquired between January 2016 and December 2018) was analyzed anonymously and retrospectively. Seven CT images from seven subjects without lung metastasis, additional pleural effusion, atelectasis, or other lung diseases were selected as the healthy controls. A total of 16,023 CT images from 42 subjects were considered for the diseased group such that all images presented exactly one lung metastatic nodule of size roughly from 1 to 2 cm in diameter. As we aimed to focus solely on subjects with lung metastasis and without other lung diseases, the following exclusion process was applied: after the elimination of cases with perihilar localization of metastases, 14,578 images from 38 subjects remained. Next, images with pleural effusion, atelectasis, or other lung diseases were removed. Finally, 8,670 images from 34 subjects with metastatic lung nodules were selected as the diseased group. The complete dataset consisted of 8,677 images from 41 subjects (34 diseased and 7 healthy subjects). From this, independent datasets were utilized for model assessment (8,658 images from 22 diseased subjects) and the reader study (19 images corresponding to one image per subject; 12 diseased and 7 healthy subjects). Additional 9,481 images from the Luna16 external dataset [16, 17] were utilized for testing the model's robustness. Table 1 shows the subject demographics for the internal datasets.

Table 1 Subject demographics for internal datasets (n = 41)

Full size table

Data preparation

The CT images were forward projected onto 2,048-view sinograms. Sparse-view CT data subsets at varying levels of undersampling were generated using the filtered back projection algorithm with 16, 32, 64, 128, 256, and 512 views, respectively. The full-view data was generated using 2,048 views. All operations were performed using the Astra toolbox (version 2.1.1) [18,19,20]. Images were of size 512 × 512 pixels. The intensity values of all images were clipped to the lung CT window (width 1,700, level -600 HU) and normalized to a range between zero and one.

Twenty-two of the diseased subjects were split on CT scan level into train (n = 12, images = 4,723), validation (n = 2, images = 787), and test sets (n = 8, images = 3,148). The residual ground truth label images were calculated as the difference between the full-view and the sparse-view images for each projection view. The final postprocessed image was the pure-artifact U-Net prediction subtracted from the sparse-view input.

Network architecture

The dual-frame U-Net was utilized, as depicted in Fig. 1. The contracting path consists of four subsequently applied encoder blocks, each with two convolution layers (3 × 3 kernels, followed by batch normalization and a rectified linear unit activation). A 2 × 2 max pooling layer is applied after each encoder block. Following the two convolution layers in the bottleneck, the features are upsampled with four subsequently applied decoder blocks mirroring the contracting path via a 2 × 2 upsampling with nearest neighbor interpolative resizing before each decoder block. The dual-frame U-Net introduces additional skip connections, bridging the output of each encoder block after pooling to the input of the associated decoder block before upsampling. These additional connections ensure the frame condition is met, thereby reducing blurring and image artifacts. The final image is obtained with a 1 × 1 convolution [13].

A train-validation-test split method was chosen instead of a cross-validation method due to time and computation constraints. The training data was randomly selected from all the available internal data on a patient level using Python’s built-in random function. The proposed model was additionally tested on an external test set to ensure the robustness of the final model, and it was concluded that the train-validation-test split method did not hinder model performance.

An NVIDIA RTX A4000 graphics card with 16 GB of VRAM was utilized to train this dual-frame U-Net with 21,971,584 parameters. The network was implemented by the Keras interface of the TensorFlow library (version 2.4.0), randomly initialized, and trained individually for each number of projection views [21, 22]. The sparse-view images were taken as input, and the residual images were taken as labels. No data augmentation was applied as the model achieved comparable results for the training and validation set without overfitting. Mean squared error (MSE) loss with an adaptive moment estimation optimizer was utilized. Early stopping was implemented if validation loss did not improve. Training took place for a maximum of \(n=30\) epochs and a batch size of six. The initial learning rate \(lr\) was set to \(lr=0.001\) and decayed exponentially per epoch following \({lr}_{n}={lr}_{n-1}\cdot {e}^{-0.1}.\) The model with the smallest validation loss among all epochs was chosen for inference on the test sets and the reader study. The quality of postprocessed images was evaluated with the MSE and the structural similarity index measure (SSIM) metrics [23].

The dual-frame U-Net was chosen as it generated robust outputs and had a comparable computational effort as the standard U-Net. More specifically, The test data was analyzed with both the dual-frame and the standard U-Net, and there were no major differences in the MSE and SSIM values between the two models. Furthermore, the number of model parameters and the computation time were also comparable. Lastly, our expert radiologist (D.P.) examined the data and concluded that images postprocessed with the dual-frame U-Net more accurately display medically relevant structures, such as small vessels.

Multireader study and statistical analysis

CT scans from 19 subjects (12 diseased, 7 healthy) were considered for this single-blinded study. Three board-certified radiologists and an in-training radiologist, respectively with 15 (D.P.), 11 (A.S.), 10 (F.M.), and 5 (D.S.) years of experience in chest radiology, participated in the study. Using the full-view images, D.P. selected a representative slice per subject and marked the ground truth lung nodule segmentation (\(1.11 \left[0.91, 1.31\right] {\text{cm}}\) diameter given as mean with 95% confidence interval) for the diseased subjects. All nodules were confirmed metastases by biopsy, patient history, and follow-up procedures. The sparse-view images reconstructed from 16, 32, 64, 128, and 256 views and postprocessed by the U-Net were presented to the other three radiologists, resulting in a total of 190 evaluated images per reader.

Full-view and all sparse-view images of an exemplary slice are shown in Fig. 2. Slices reconstructed and postprocessed using 512 views were excluded from the study as D.P. determined that even without any postprocessing, they are of comparable quality to the full-view images.

Readers were asked to independently annotate each slice using our in-house tool by rating every image on quality, the confidence of diagnosis, and the severity of artifacts present in the image according to pre-defined labels in Tables 2 and 3. Furthermore, the radiologists were asked to independently segment perceived suspect pulmonary nodules. Sensitivity, specificity, F₁ score, and the negative predictive value, were considered to compare the diagnostic reliability of images for different views [24]. For all true positive cases, the segmentation overlaps were calculated with the Dice similarity coefficient (DSC) [25, 26]. In case of no overlap, or if one of the segmentations was empty, the resulting DSC was zero.

Table 2 Score system for image quality and diagnostic confidence

Full size table

Table 3 Score system for image artifacts

Full size table

The superiority of the postprocessed labeled data over the sparse-view labeled data for each view was assessed: p-values were calculated with the clustered Wilcoxon signed-rank test utilizing Python’s SciPy library (version 1.4.1), and a significance threshold of 0.05 was set [27, 28]. The sample size for the reader study was n = 57 after pooling the results from the three readers, with each having annotated 19 CT images.

Results

The following results show the model’s performance on 3,148 images from eight diseased subjects and 9,481 images from the Luna16 dataset. Furthermore, results of the reader study on 19 CT-wise images from 12 diseased and seven healthy subjects are described.

Network performance

Figure 2 shows an example slice with varying levels of subsampling alongside the corresponding U-Net postprocessed results. It can be observed that fewer projection views result in more artifacts. The sparse-view images from extremely limited views also lead to a loss of structural integrity in their postprocessed counterparts. This was especially prominent for 16 views, as metastatic lung nodule distortion and microvascular structures generate diminished performance capabilities. Metastatic nodule composition and primary anatomical characteristics can better be amassed once reconstruction views have increased to 32. For 64 views, streak artifacts did not impact the nodule’s visibility due to tissue density, but minimalistic structural identification, such as small vessels, are not clearly portrayed. Minor features were displayed for 128 and 256 views; however, for 128 views, some streak artifacts remained present. For the postprocessed image of 32 views, the nodule shape was mostly correct, and the display of the vascular structures was improved. For 64 or more views, the nodule appearance in the postprocessed image was similar to the full-view image. Furthermore, vascular distinction on imaging can be detected with the postprocessed 128-view image. The postprocessed image from 256 views is very close in quality to the full-view image. For 512 views, no qualitative differences can be detected.

A directly proportional relationship is observed between improved IQ and higher views. As shown in Table 4, calculated mean MSE values decrease and mean SSIM values increase with more projection views for the internal test set and the external Luna16 dataset. Although mean MSE and SSIM values are marginally better for the internal test set, the model achieves comparable results on the external Luna16 dataset.

Table 4 Mean MSE and SSIM

Full size table

Multireader study

The resulting mean values for quality, confidence, and artifacts reported by the readers are shown in Fig. 3a–c. The labeled mean quality for sparse-view images decreases linearly from roughly “sufficient” to approximately “not diagnostic” for decreasing number of projection views, as seen in Fig. 3a. Figure 3b shows that the tendency for the mean confidence is similar for both sparse-view and postprocessed images. For the sparse-view images, the confidence again decreases linearly with decreasing number of views, ranging from “fairly confident” or “very confident” to “not confident at all” or “slightly confident.” The subjective quality (p = 0.002) and confidence (p = 0.020) of postprocessed images are significantly higher than their unprocessed pairs for 64 and fewer views. The presence of artifacts increases for the sparse-view images with fewer views, as observed in Fig. 3c. Postprocessed images have significantly fewer subjective artifacts than their unprocessed pairs for 128 and fewer projection views (p < 0.001).

Confusion matrices are shown in Fig. 4. The corresponding sensitivity, specificity, F₁ score, and negative predictive values are shown in Table 5. In some images, incorrect subjective segmentation by the readers resulted in falsely marked pixels in an alternate location. Such cases are counted as false negatives and mostly appeared for the sparse-view images reconstructed with 16 views. An example of such an inaccurately marked image, as well as a correctly marked image, and an image with an extra perceived nodule, are shown in Fig. 5.

Table 5 Sensitivity, specificity, F₁ score, and negative predictive value (NPV) for sparse-view CT images and their postprocessed counterpart images for all projection views calculated over 19 subject-wise images presented to three readers (n = 57)

Full size table

The confusion matrices in Fig. 4 show increasing false negative cases with a decreasing number of views for the sparse-view images and their postprocessed counterparts. This leads to a decreased sensitivity, as seen in Table 5. The symmetric representation of true positive rate and sensitivity is understood with the F₁ score: For 256 and 64 views, the F₁ score remains unchanged among the sparse-view and the postprocessed pairs. For all other projection views, the F₁ score is higher for the sparse-view images. Furthermore, the number of false positive cases is mostly independent of the number of views, which leads to specificity values between 0.86 and 1.00. The negative predictive value decreases with decreasing projection views for both sparse-view and postprocessed images. However, only for 64 views do the postprocessed images achieve a higher negative predictive value compared to their sparse-view counterparts.

Figure 3d shows the mean DSC for sparse-view images with and without postprocessing by the model. The mean DSC shows only slight differences between sparse-view images with and without postprocessing for 32 or more views. For instance, in the case of 64 views, sparse-view images without postprocessing resulted in DSC = 0.81, while images postprocessed by the model had reached DSC = 0.85 (p = 0.400). It must be noted that although no statistically significant discrepancy in segmentation overlap is observed, subjective quality (p = 0.002) and confidence (p = 0.020) assessment was markedly higher in the postprocessed images of 64 views and fewer.

Discussion

We implemented a postprocessing correction with a dual-frame U-Net based on a residual approach to improve the IQ of sparse-view CT images with lung metastasis. External evaluation with a public dataset demonstrated the model's robustness. Furthermore, a single-blinded reader study determined a tradeoff between the number of projection views, IQ, and diagnostic confidence. The results suggest that postprocessing by the U-Net can reduce the number of views from 2,048 to only 64 while maintaining diagnostically accurate IQ for nodule detection (sensitivity = 0.94). Although the DSC for the lung nodule segmentations by the readers did not significantly improve for the postprocessed images, the sparse-view artifact-corrected images drastically increased the readers’ confidence in detecting lung nodules.

It must be noted that every image labeled as “not diagnostic” in terms of IQ or “not confident at all” in terms of confidence of diagnosis would not be considered in a clinical workflow. This is especially the case for sparse-view images reconstructed from 16 views but also for some sparse-view images reconstructed from 32 views. Thus, these instances will not be considered for further discussion.

All images postprocessed by the model are labeled with better IQ and diagnostic confidence. More precisely, the difference between sparse-view images with and without postprocessing is the most prominent result for all assigned labels. It indicates that the radiologists prefer working with the postprocessed images over the unprocessed sparse-view ones: rating the quality higher, seeing fewer artifacts in the images, and most importantly, being more confident in their diagnosis. Especially the higher quality and the increased confidence could be accompanied by a shorter processing time and, in the long run, lead to fewer signs of fatigue compared to working with unprocessed sparse-view images. Since 256, 128, and 64 views lead to very similar results regarding the quality and confidence labels and worse results are achieved with 32 views, 64-view images appear to be the best choice.

To define a threshold providing a reasonable tradeoff between a reduced number of projection views and diagnostic value, sensitivity and specificity values should be maximized. Accordingly, false positive and false negative values should be minimized–false positive cases should be avoided as these cause unnecessary follow-up procedures, potentially exposing the patient to more radiation if a full-view scan is required. However, it is of utmost importance to avoid false negative cases since these would lead to afflicted patients not getting diagnosed. Low false positive cases are correlated with high specificity, and low false negative values are associated with high sensitivity.

We must consider other existing work in the literature to establish concrete baseline threshold values for sensitivity and specificity. However, finding fitting pre-defined thresholds for sensitivity and specificity values proves difficult in the extant literature. This is mainly due to the challenges of establishing a truth value from which the performance of radiologists in lung nodule detection should be assessed [29]. Furthermore, the variability in study design and data are limiting factors [29, 30]. Nonetheless, we take the values presented in the National Lung Screening Trial by Aberle et al. [31] as the closest established baselines to which we can compare the values obtained in our study: these are a sensitivity threshold of 0.94 and a specificity threshold of 0.73. According to these thresholds, the lowest possible number of projection views allowing reliable diagnosis would be achieved for postprocessed images of 64 views, leading to 0.94 sensitivity and 0.90 specificity.

The mean DSC values did not consistently show a trend of improvement between the postprocessed and the unprocessed sparse-view images. Yet, these findings support the choice of the tradeoff threshold at 64 views; the mean DSC values for the postprocessed images of 64 views resulted in the greatest improvement over the mean DSC values of their unprocessed counterparts in comparison to the other projection views.

Some study limitations must be considered. In clinical practice, radiologists often search the entire stack of images for malignancies. The present reader study could have modeled the clinical workflow more precisely as it only considered single CT images. Including neighboring slices would come closer to clinical diagnosis based on CT scans and most likely reduce the amount of falsely classified patients. Furthermore, the sparse-view data generated for this study was obtained using simplified conditions not reflective of the complex reconstruction processes in clinical settings. Therefore, only the reduced number of projection views compared to the full-view images can be reported, and an exact measure of dose reduction is hence unachievable. Our relatively small sample size was also a limiting factor, which can be addressed in future works. Additionally, testing for noninferiority or equivalence of U-Net-based postprocessing with the existing methods needs further exploration before integration of such technologies in the medical workflow.

Overall, the amount of projection views can be reduced by a factor of 32 compared to the full-view image with postprocessing by a dual-frame U-Net while keeping the diagnostic value and the confidence of the radiologists at a satisfactory level. Regarding the radiologists’ confidence, the images postprocessed with the model lead to drastically better results than the unprocessed sparse-view images. These findings suggest that postprocessed sparse-view CT images by the dual-frame U-Net could help enable dose-efficient screening for lung metastasis detection.

Availability of data and materials

The CT data and the model for inference are available upon reasonable request from the corresponding author.

Abbreviations

CT:: Computed tomography
DSC:: Dice similarity coefficient
IQ:: Image quality
MSE:: Mean squared error
SSIM:: Structural similarity index measure

References

World Health Organization (2022) Cancer. Available at: https://www.who.int/news-room/fact-sheets/detail/cancer. Accessed 10 Feb 2023
World Cancer Research Fund International (2022) Lung cancer. Available at: https://www.wcrf.org/cancer-trends/lung-cancer-statistics/. Accessed 20 Mar 2023
Gesellschaft der epidemiologischen Krebsregister e.V. und Zentrum für Krebsregisterdaten im Robert Koch-Institut (2018) Krebs in Deutschland. Available at: https://www.krebsdaten.de/Krebs/DE/Content/Publikationen/Krebs_in_Deutschland/kid_2021/kid_2021_c33_c34_lunge.pdf?__blob=publicationFile. Accessed 10 Feb 2023
American Cancer Society (2023) Lung cancer. Available at: https://www.cancer.org/cancer/lung-cancer.html. Accessed 10 Feb 2023
Deutsche Krebsgesellschaft (2013) Lungenkrebs / Lungenkarzinom. Available at: https://www.krebsgesellschaft.de/onko-internetportal/basis-informationen-krebs/krebsarten/lungenkrebs.html. Accessed 10 Feb 2023
National Health Service (2022) Overview - Lung cancer. Available at: https://www.nhs.uk/conditions/lung-cancer/. Accessed 10 Feb 2023
Hamada N, Fujimichi Y (2014) Classification of radiation effects for dose limitation purposes: history, current situation and future prospects. J Radiat Res 55:629–640. https://doi.org/10.1093/jrr/rru019
Article PubMed PubMed Central Google Scholar
US Food and Drug Administration, Center for Devices and Radiological Health (2018) What are the radiation risks from CT? Available at: https://www.fda.gov/radiation-emitting-products/medical-x-ray-imaging/what-are-radiation-risks-ct. Accessed 10 Feb 2023
Kudo H, Suzuki T, Rashed EA (2013) Image reconstruction for sparse-view CT and interior CT-introduction to compressed sensing and differentiated backprojection. Quant Imaging Med Surg 3:147–161. https://doi.org/10.3978/j.issn.2223-4292.2013.06.01
Article PubMed PubMed Central Google Scholar
Zhang Z, Liang X, Dong X, Xie Y, Cao G (2018) A sparse-view CT reconstruction method based on combination of denseNet and deconvolution. IEEE Trans Med Imaging 37:1407–1417. https://doi.org/10.1109/TMI.2018.2823338
Article PubMed Google Scholar
Jin KH, McCann MT, Froustey E, Unser M (2016) Deep convolutional neural network for inverse problems in imaging. IEEE Trans Image Process 26:4509–5422. https://doi.org/10.1109/TIP.2017.2713099
Article Google Scholar
Han Y, Yoo J, Ye JC (2016) Deep residual learning for compressed sensing CT reconstruction via persistent homology analysis. arXiv. Available at: https://arxiv.org/abs/1611.06391. Accessed 10 Feb 2023
Han Y, Ye JC (2018) Framing U-Net via deep convolutional framelets: application to sparse-view CT. IEEE Trans Med Imaging 37:1418–1429. https://doi.org/10.1109/TMI.2018.2823768
Article PubMed Google Scholar
Koetzier LR, Mastrodicasa D, Szczykutowicz TP et al (2023) Deep learning image reconstruction for CT: technical principles and clinical prospects. Radiology 306:e221257. https://doi.org/10.1148/radiol.221257
Article PubMed Google Scholar
Ronneberger O, Fischer P, Brox T (2015) U-Net: convolutional networks for biomedical image segmentation. In: Navab N, Hornegger J, Wells W, Frangi A (eds) MICCAI. 18^th International conference on medical image computing and computer-assisted intervention, Munich, October 2015. Lecture notes in computer science(), vol 9351. Springer, Cham, p 234–241. https://doi.org/10.1007/978-3-319-24574-4_28
van Ginneken B, Setio AAA, Jacobs C (2016) Lung nodule analysis. Available at: https://luna16.grand-challenge.org. Accessed 16 Jan 2020
Armato SG 3rd, McLennan G, Bidaut L et al (2011) The lung image database consortium (LIDC) and image database resource initiative (IDRI): a completed reference database of lung nodules on CT scans. Med Phys 38:915–931. https://doi.org/10.1118/1.3528204
Article PubMed PubMed Central Google Scholar
van Aarle W, Palenstijn WJ, De Beenhouwer J et al (2015) The ASTRA toolbox: a platform for advanced algorithm development in electron tomography. Ultramicroscopy 157:35–47. https://doi.org/10.1016/j.ultramic.2015.05.002
Article CAS PubMed Google Scholar
van Aarle W, Palenstijn WJ, Cant J et al (2016) Fast and flexible X-ray tomography using the ASTRA toolbox. Opt Express 24:25129–25147. https://doi.org/10.1364/OE.24.025129
Article PubMed Google Scholar
Palenstijn WJ, Batenburg KJ, Sijbers J (2011) Performance improvements for iterative electron tomography reconstruction using graphics processing units (GPUs). J Struct Biol 176:250–253. https://doi.org/10.1016/j.jsb.2011.07.017
Article CAS PubMed Google Scholar
Chollet F et al (2015) Keras. Available at: https://keras.io. Accessed 10 Feb 2023
Abadi M, Agarwal A, Barham P et al (2015) TensorFlow: large-scale machine learning on heterogeneous systems. Available at: https://www.tensorflow.org/. Accessed 10 Feb 2023
Wang Z, Bovik AC, Sheikh HR, Simoncelli EP (2004) Image quality assessment: from error visibility to structural similarity. IEEE Trans Image Process 13:600–612. https://doi.org/10.1109/TIP.2003.819861
Article PubMed Google Scholar
Parikh R, Mathai A, Parikh S, Sekhar CG, Thomas R (2008) Understanding and using sensitivity, specificity and predictive values. Indian J Ophthalmol 56:45–50. https://doi.org/10.4103/0301-4738.37595
Article PubMed PubMed Central Google Scholar
Dice LR (1945) Measures of the amount of ecologic association between species. Ecology 26:297–302. https://doi.org/10.2307/1932409
Article Google Scholar
Fleiss JL, Levin B, Paik MC (2003) The measurement of interrater agreement. In: Shewart WA, Wilks SS (eds). Statistical methods for rates and proportions 2003. John Wiley & Sons, p 598–626. https://doi.org/10.1002/0471445428.ch18
Wilcoxon F (1945) Individual comparisons by ranking methods. Biometrics Bull 1:80–83. https://doi.org/10.2307/3001968
Article Google Scholar
Virtanen P, Gommers R, Oliphant TE et al (2020) SciPy 1.0: Fundamental Algorithms for Scientific Computing in Python. Nat Methods 17:261–272. https://doi.org/10.1038/s41592-019-0686-2
Article CAS PubMed PubMed Central Google Scholar
Armato SG 3rd, Roberts RY, Kocherginsky M et al (2009) Assessment of radiologist performance in the detection of lung nodules: dependence on the definition of “truth.” Acad Radiol 16:28–38. https://doi.org/10.1016/j.acra.2008.05.022
Article PubMed PubMed Central Google Scholar
Rubin GD (2015) Lung nodule and cancer detection in computed tomography screening. J Thorac Imaging 30:130–138. https://doi.org/10.1097/RTI.0000000000000140
Article PubMed PubMed Central Google Scholar
Aberle DR, DeMello S, Berg CD et al (2013) Results of the two incidence screenings in the national lung screening trial. N Engl J Med 369:920–931. https://doi.org/10.1056/NEJMoa1208962
Article CAS PubMed PubMed Central Google Scholar

Download references

Acknowledgements

The authors thank Dr. Bernhard Haller for his valuable advice on the appropriate statistical methods for this work.

We declare no use of large language models in our manuscript.

Funding

Open Access funding enabled and organized by Projekt DEAL. Funded by the Federal Ministry of Education and Research (BMBF) and the Free State of Bavaria under the Excellence Strategy of the Federal Government and the Länder, the German Research Foundation (GRK2274), as well as by the Technical University of Munich–Institute for Advanced Study.

Author information

Annika Ries and Tina Dorosti shared the first authorship.

Authors and Affiliations

Chair of Biomedical Physics, Department of Physics, School of Natural Sciences, Technical University of Munich, Garching, 85748, Germany
Annika Ries, Tina Dorosti, Johannes Thalhammer, Franz Pfeiffer & Florian Schaff
Munich Institute of Biomedical Engineering, Technical University of Munich, 85748, Garching, Germany
Annika Ries, Tina Dorosti, Johannes Thalhammer, Tobias Lasser, Franz Pfeiffer & Florian Schaff
Department of Diagnostic and Interventional Radiology, School of Medicine, Klinikum Rechts Der Isar, Technical University of Munich, 81675, Munich, Germany
Tina Dorosti, Johannes Thalhammer, Daniel Sasse, Andreas Sauter, Felix Meurer, Ashley Benne, Franz Pfeiffer & Daniela Pfeiffer
Institute for Advanced Study, Technical University of Munich, 85748, Garching, Germany
Johannes Thalhammer, Ashley Benne, Franz Pfeiffer & Daniela Pfeiffer
Computational Imaging and Inverse Problems, Department of Computer Science, School of Computation, Information, and Technology, Technical University of Munich, 85748, Garching, Germany
Tobias Lasser

Authors

Annika Ries
View author publications
You can also search for this author in PubMed Google Scholar
Tina Dorosti
View author publications
You can also search for this author in PubMed Google Scholar
Johannes Thalhammer
View author publications
You can also search for this author in PubMed Google Scholar
Daniel Sasse
View author publications
You can also search for this author in PubMed Google Scholar
Andreas Sauter
View author publications
You can also search for this author in PubMed Google Scholar
Felix Meurer
View author publications
You can also search for this author in PubMed Google Scholar
Ashley Benne
View author publications
You can also search for this author in PubMed Google Scholar
Tobias Lasser
View author publications
You can also search for this author in PubMed Google Scholar
Franz Pfeiffer
View author publications
You can also search for this author in PubMed Google Scholar
Florian Schaff
View author publications
You can also search for this author in PubMed Google Scholar
Daniela Pfeiffer
View author publications
You can also search for this author in PubMed Google Scholar

Contributions

All the authors contributed at the different stages of the study; AR and TD shared first authorship as they contributed equally to this work, with AR implementing the deep learning model and TD conducting the reader study at our clinic. The literature review was carried out by AR, and the statistical analysis by TD. AR, TD, JT, and DP curated and analyzed the data. DS, AS, FM, AB, and DP contributed to the clinical methodology and the reader study. AR, TD, JT, AB, TL, FP, FS, and DP investigated and interpreted the results. FP and DP conceptualized, administered, and acquired funding for this study. All authors agreed to guarantee that any questions related to this work are appropriately investigated. All authors reviewed the manuscript. All authors read and approved the final manuscript.

Corresponding author

Correspondence to Tina Dorosti.

Ethics declarations

Ethics approval and consent to participate

This retrospective study obtained approval from our institutional review board and was conducted in accordance with the regulations of our institution (approval code: 87/18 S, Institutional Review Board of the Faculty of Medicine, Technical University of Munich, Germany).

Consent for publication

Not applicable.

Competing interests

We report no conflict of interest.

Additional information

Publisher’s Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/.

Reprints and permissions

About this article

Cite this article

Ries, A., Dorosti, T., Thalhammer, J. et al. Improving image quality of sparse-view lung tumor CT images with U-Net. Eur Radiol Exp 8, 54 (2024). https://doi.org/10.1186/s41747-024-00450-4

Download citation

Received: 20 November 2023
Accepted: 09 February 2024
Published: 03 May 2024
DOI: https://doi.org/10.1186/s41747-024-00450-4

Improving image quality of sparse-view lung tumor CT images with U-Net

Abstract

Background

Methods

Results

Conclusions

Relevance statement

Key points

Graphical Abstract

Background

Methods

Datasets

Data preparation

Network architecture

Multireader study and statistical analysis

Results

Network performance

Multireader study

Discussion

Availability of data and materials

Abbreviations

References

Acknowledgements

Funding

Author information

Authors and Affiliations

Contributions

Corresponding author

Ethics declarations

Ethics approval and consent to participate

Consent for publication

Competing interests

Additional information

Publisher’s Note

Rights and permissions

About this article

Cite this article

Share this article

Keywords