-
SPS
IEEE Members: $11.00
Non-members: $15.00Length: 00:01:58
Accurate bone segmentation and anatomical landmark detection are vital tasks for the clinical evaluation and treatment planning for patients with lower limbs X-ray films. To leverage the information between the two tasks and deal with the large-scale images, we propose an efficient end-to-end deep network, i.e., multi-task and multiscale attention network (MMAN), to concurently segment lower limb bones and localize landmarks from large-scale X-ray films in one stage. The results demonstratethat our MMAN outperforms the other state-of-the-art methods for multi-task learning or single-task landmark detection using two separate stages. Our MMAN method has two main technical contributions. First, the local and global encoders are designed to capture multi-scale inputs and provide shared representations including local image details and global contexts, respectively. Second, a global-local attention module is designed to efficiently leverage the global context and learn task-specific information from shared representations under limited computational costs.