Vision Transformers for Medical Image Segmentation: A Comprehensive Survey

Chen Wei*, Xia Liu * corresponding author

vision transformer medical image segmentation deep learning ViT attention mechanism
Book ISBN: 978-1-83568-134-3 Book DOI: 10.12345/csit.mlai.2023 Published: September 2023
Authors
2
Keywords
5
Book
Machine Learning and Artificial Int...
Series
CSA

Abstract

This survey provides a thorough examination of Vision Transformer (ViT) architectures applied to medical image segmentation tasks. We review over 120 recent publications covering fundus photography, CT scan analysis, MRI brain segmentation, and histopathology slide processing. We identify key architectural innovations, benchmark datasets, and open research challenges.

Keywords

vision transformer medical image segmentation deep learning ViT attention mechanism

Authors

CW
Chen Wei Corresponding
Tsinghua University
chen.wei@tsinghua.edu.cn
XL
Xia Liu
Tsinghua University
xia.liu@tsinghua.edu.cn