seg2med: a segmentation-based medical image generation framework using denoising diffusion probabilistic models

Yang, Zeyu; Chen, Zhilin; Sun, Yipeng; Strittmatter, Anika; Raj, Anish; Allababidi, Ahmad; Rink, Johann S.; Zöllner, Frank G.

Electrical Engineering and Systems Science > Image and Video Processing

arXiv:2504.09182v1 (eess)

[Submitted on 12 Apr 2025 (this version), latest version 12 Jun 2025 (v2)]

Title:seg2med: a segmentation-based medical image generation framework using denoising diffusion probabilistic models

Authors:Zeyu Yang, Zhilin Chen, Yipeng Sun, Anika Strittmatter, Anish Raj, Ahmad Allababidi, Johann S. Rink, Frank G. Zöllner

View PDF HTML (experimental)

Abstract:In this study, we present seg2med, an advanced medical image synthesis framework that uses Denoising Diffusion Probabilistic Models (DDPM) to generate high-quality synthetic medical images conditioned on anatomical masks from TotalSegmentator. The framework synthesizes CT and MR images from segmentation masks derived from real patient data and XCAT digital phantoms, achieving a Structural Similarity Index Measure (SSIM) of 0.94 +/- 0.02 for CT and 0.89 +/- 0.04 for MR images compared to ground-truth images of real patients. It also achieves a Feature Similarity Index Measure (FSIM) of 0.78 +/- 0.04 for CT images from XCAT. The generative quality is further supported by a Fréchet Inception Distance (FID) of 3.62 for CT image generation.
Additionally, seg2med can generate paired CT and MR images with consistent anatomical structures and convert images between CT and MR modalities, achieving SSIM values of 0.91 +/- 0.03 for MR-to-CT and 0.77 +/- 0.04 for CT-to-MR conversion. Despite the limitations of incomplete anatomical details in segmentation masks, the framework shows strong performance in cross-modality synthesis and multimodal imaging.
seg2med also demonstrates high anatomical fidelity in CT synthesis, achieving a mean Dice coefficient greater than 0.90 for 11 abdominal organs and greater than 0.80 for 34 organs out of 59 in 58 test cases. The highest Dice of 0.96 +/- 0.01 was recorded for the right scapula. Leveraging the TotalSegmentator toolkit, seg2med enables segmentation mask generation across diverse datasets, supporting applications in clinical imaging, data augmentation, multimodal synthesis, and diagnostic algorithm development.

Comments:	17 pages, 10 figures
Subjects:	Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
Cite as:	arXiv:2504.09182 [eess.IV]
	(or arXiv:2504.09182v1 [eess.IV] for this version)
	https://doi.org/10.48550/arXiv.2504.09182

Submission history

From: Zeyu Yang [view email]
[v1] Sat, 12 Apr 2025 11:32:32 UTC (10,487 KB)
[v2] Thu, 12 Jun 2025 23:39:43 UTC (10,380 KB)

Electrical Engineering and Systems Science > Image and Video Processing

Title:seg2med: a segmentation-based medical image generation framework using denoising diffusion probabilistic models

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Electrical Engineering and Systems Science > Image and Video Processing

Title:seg2med: a segmentation-based medical image generation framework using denoising diffusion probabilistic models

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators