Rapid Fire Abstracts
Mehri Mehrnia, MSc
PhD Student
Northwestern University
Mehri Mehrnia, MSc
PhD Student
Northwestern University
Mohamed Elbayumi, MSc
Research Assistant
Northwestern University
Mohammed Elbaz, PhD
Assistant Professor
Northwestern University
The cohort included 44 pre-ablation AF patients (age 67±9 years, 34% female) from the public CARMA 3D LGE CMR database with a total of 1227 2D slices 5, which includes ground truth segmentations of both the LA wall and endocardium (blood pool). We tested MedSAM1 and MedSAM2 foundational models for semi -automated, prompt-based LA segmentation following the analysis pipeline in Fig. 1. MedSAM1 performs 2D segmentation, requiring a user-defined box prompt (ROI) for each LA slice (Fig.1a). MedSAM2, on the other hand, is capable of 3D segmentation using only a single freehand (scribble) prompt drawn on one slice6, followed by automated tracking across the subsequent slices (Fig.1b). The accuracy of LA segmentation was assessed using image-based Dice scores (median [IQR]), with the ground truth defined as a single combined ROI that includes both the LA wall and endocardium (blood pool).
Results: Both 2D-based MedSAM1 (per slice prompt) and 3D-based MedSAM2 (single prompt per scan) demonstrated good comparable performance for LA segmentation, with MedSAM1 Dice = 0.84 [0.73, 0.89] and MedSAM2 Dice = 0.84 [0.64,0.91] (average of 30 LA slices per scan ) (Fig. 2). However, both models exhibited limitations, in areas with multiple enhanced objects. MedSAM2 frequently produced erroneous segmentation of the aorta in the final slices (Fig. 3a) and sometimes skips the first few LA slices where LA size is small with poor contrast. Yet, MedSAM2 excelled in mid-slice LA segmentation, delivering near-perfect results (Fig. 3b). In contrast, MedSAM1 struggles when pulmonary veins are visible, as its single box prompt per slice approach seem to cause confusion. Additionally, in MedSAM1, the user manually creates a box prompt for each slice, taking approximately 10 seconds per slice (5 minutes for a case with 30 slices), while MedSAM2 requires only a single scribble, taking 20 seconds.
Conclusion: This study demonstrates the potential of foundational models like MedSAM1 and MedSAM2 for automating LA segmentation in 3D LGE-CMR scans of AF patients. MedSAM2 offers similar accuracy with greater efficiency, needing only a single prompt per scan, unlike MedSAM1's per-slice prompts. While both struggled in complex regions, their key strength lies in their adaptability without pre-scan tuning. This generalizability could reduce manual effort, enhance consistency, and streamline cardiac imaging workflows.