Keypoint-Guided Medical Video Segmentation Model With Spatiotemporal Feature Fusion

Mar 16, 2026·
Minghao Wang
,
Shaoyi Du*
Corresponding author
,
Huanhuan Huo
,
Juejiang
,
Dong Zhang
HAN Hongcheng
HAN Hongcheng
,
Shengdi Hou
,
Juan Wang*
Corresponding author
· 0 min read
Abstract
Atrial fibrillation, characterized by high prevalence and poor prognosis, presents a significant global health burden. Accurate segmentation and measurement of left ventricular and left atrial appendage morphology and function are essential for reliable risk assessment. However, these tasks are hindered by ambiguous boundaries, complex cardiac motion, and sparse annotations. To address these challenges, we propose a Keypoint-Guided Medical Video Segmentation Model with Spatiotemporal Feature Fusion (KG-STS). First, we propose a shape-constrained point encoder that explicitly encodes boundary points to improve the representation of ambiguous boundaries. Next, we introduce a motion-aware alignment module that models cardiac motion by forming coherent motion information across frames. Building on these two modules, we develop a keypoint-guided spatiotemporal feature fusion module that integrates spatial boundary representations with temporal motion cues to enhance decoding features under sparse annotations, enabling temporally consistent segmentation and supporting morphological measurement. We evaluate the segmentation and measurement performance of our method on a self-constructed multi-view transesophageal echocardiography dataset and two publicly available transthoracic echocardiography datasets. The results demonstrate that KG-STS achieves superior temporal consistency in segmentation and higher accuracy in morphological measurements compared to competing methods.
Type
Publication
IEEE Transactions on Medical Imaging, 45(6)
License
CC-BY-4.0
publications
HAN Hongcheng
Authors
PhD Candidate in Control Science and Engineering
Han Hongcheng (韩泓丞) received the degree of B.Eng. in School of Energy and Power Engineering, Xi’an Jiaotong University, Xi’an, China in 2020. Since then, he is studying for Ph.D. degree in Institute of Artificial Intelligence and Robotics, Xi’an Jiaotong University. His interests focus on intelligent transportation and medical image analysis, specializing in Multimodal data fusion and Image Synthesis.