Transformers in Medical Imaging: A Survey
- URL: http://arxiv.org/abs/2201.09873v1
- Date: Mon, 24 Jan 2022 18:50:18 GMT
- Title: Transformers in Medical Imaging: A Survey
- Authors: Fahad Shamshad, Salman Khan, Syed Waqas Zamir, Muhammad Haris Khan,
Munawar Hayat, Fahad Shahbaz Khan, Huazhu Fu
- Abstract summary: Transformers have been successfully applied to several computer vision problems, achieving state-of-the-art results.
Medical imaging has also witnessed growing interest for Transformers that can capture global context compared to CNNs with local receptive fields.
We provide a review of the applications of Transformers in medical imaging covering various aspects, ranging from recently proposed architectural designs to unsolved issues.
- Score: 88.03790310594533
- License: http://creativecommons.org/licenses/by-sa/4.0/
- Abstract: Following unprecedented success on the natural language tasks, Transformers
have been successfully applied to several computer vision problems, achieving
state-of-the-art results and prompting researchers to reconsider the supremacy
of convolutional neural networks (CNNs) as {de facto} operators. Capitalizing
on these advances in computer vision, the medical imaging field has also
witnessed growing interest for Transformers that can capture global context
compared to CNNs with local receptive fields. Inspired from this transition, in
this survey, we attempt to provide a comprehensive review of the applications
of Transformers in medical imaging covering various aspects, ranging from
recently proposed architectural designs to unsolved issues. Specifically, we
survey the use of Transformers in medical image segmentation, detection,
classification, reconstruction, synthesis, registration, clinical report
generation, and other tasks. In particular, for each of these applications, we
develop taxonomy, identify application-specific challenges as well as provide
insights to solve them, and highlight recent trends. Further, we provide a
critical discussion of the field's current state as a whole, including the
identification of key challenges, open problems, and outlining promising future
directions. We hope this survey will ignite further interest in the community
and provide researchers with an up-to-date reference regarding applications of
Transformer models in medical imaging. Finally, to cope with the rapid
development in this field, we intend to regularly update the relevant latest
papers and their open-source implementations at
\url{https://github.com/fahadshamshad/awesome-transformers-in-medical-imaging}.
Related papers
- Transformer for Object Re-Identification: A Survey [69.61542572894263]
Vision Transformers have spurred a growing number of studies delving deeper into Transformer-based Re-ID.
This paper provides a comprehensive review and in-depth analysis of the Transformer-based Re-ID.
Considering the trending unsupervised Re-ID, we propose a new Transformer baseline, UntransReID, achieving state-of-the-art performance.
arXiv Detail & Related papers (2024-01-13T03:17:57Z) - A Comprehensive Survey on Applications of Transformers for Deep Learning
Tasks [60.38369406877899]
Transformer is a deep neural network that employs a self-attention mechanism to comprehend the contextual relationships within sequential data.
transformer models excel in handling long dependencies between input sequence elements and enable parallel processing.
Our survey encompasses the identification of the top five application domains for transformer-based models.
arXiv Detail & Related papers (2023-06-11T23:13:51Z) - Advances in Medical Image Analysis with Vision Transformers: A
Comprehensive Review [6.953789750981636]
We provide an encyclopedic review of the applications of Transformers in medical imaging.
Specifically, we present a systematic and thorough review of relevant recent Transformer literature for different medical image analysis tasks.
arXiv Detail & Related papers (2023-01-09T16:56:23Z) - Vision Transformers in Medical Imaging: A Review [0.0]
Transformer, a model comprising attention-based encoder-decoder architecture, have gained prevalence in the field of natural language processing (NLP)
In this paper, we attempt to provide a comprehensive and recent review on the application of transformers in medical imaging by; describing the transformer model comparing it with a diversity of convolutional neural networks (CNNs)
arXiv Detail & Related papers (2022-11-18T05:52:37Z) - Transforming medical imaging with Transformers? A comparative review of
key properties, current progresses, and future perspectives [21.164122592628388]
Transformer, the latest technological advance of deep learning, has gained prevalence in natural language processing or computer vision.
We offer a comprehensive review of the state-of-the-art Transformer-based approaches for medical imaging.
arXiv Detail & Related papers (2022-06-02T16:38:31Z) - Vision Transformers in Medical Computer Vision -- A Contemplative
Retrospection [0.9677949377607575]
Vision Transformers are evolved as one of the most contemporary and dominant architectures that are being used in the field of computer vision.
We surveyed the application of Vision transformers in different areas of medical computer vision such as image-based disease classification, anatomical structure segmentation, registration, region-based lesion Detection, captioning, report generation.
We also put some light on available data sets, adopted methodology, their performance measures, challenges and their solutions in form of discussion.
arXiv Detail & Related papers (2022-03-29T06:32:43Z) - Transformers in Medical Image Analysis: A Review [46.71636151229035]
Our paper presents both a position paper and a primer, promoting awareness and application of Transformers in the field of medical image analysis.
Specifically, we first overview the core concepts of the attention mechanism built into Transformers and other basic components.
Second, we give a new taxonomy of various Transformer architectures tailored for medical image applications and discuss their limitations.
arXiv Detail & Related papers (2022-02-24T16:04:03Z) - Medical Transformer: Gated Axial-Attention for Medical Image
Segmentation [73.98974074534497]
We study the feasibility of using Transformer-based network architectures for medical image segmentation tasks.
We propose a Gated Axial-Attention model which extends the existing architectures by introducing an additional control mechanism in the self-attention module.
To train the model effectively on medical images, we propose a Local-Global training strategy (LoGo) which further improves the performance.
arXiv Detail & Related papers (2021-02-21T18:35:14Z) - TransUNet: Transformers Make Strong Encoders for Medical Image
Segmentation [78.01570371790669]
Medical image segmentation is an essential prerequisite for developing healthcare systems.
On various medical image segmentation tasks, the u-shaped architecture, also known as U-Net, has become the de-facto standard.
We propose TransUNet, which merits both Transformers and U-Net, as a strong alternative for medical image segmentation.
arXiv Detail & Related papers (2021-02-08T16:10:50Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.