Cross-Scale Fusion Transformer for Histopathological Image Classification

Sheng Kai Huang, Yu Ting Yu, Chun Rong Huang*, Hsiu Chi Cheng

*Corresponding author for this work

Research output: Contribution to journalArticlepeer-review

6 Scopus citations

Abstract

Histopathological images provide the medical evidences to help the disease diagnosis. However, pathologists are not always available or are overloaded by work. Moreover, the variations of pathological images with respect to different organs, cell sizes and magnification factors lead to the difficulty of developing a general method to solve the histopathological image classification problems. To address these issues, we propose a novel cross-scale fusion (CSF) transformer which consists of the multiple field-of-view patch embedding module, the transformer encoders and the cross-fusion modules. Based on the proposed modules, the CSF transformer can effectively integrate patch embeddings of different field-of-views to learn cross-scale contextual correlations, which represent tissues and cells of different sizes and magnification factors, with less memory usage and computation compared with the state-of-the-art transformers. To verify the generalization ability of the CSF transformer, experiments are performed on four public datasets of different organs and magnification factors. The CSF transformer outperforms the state-of-the-art task specific methods, convolutional neural network-based methods and transformer-based methods.

Original languageEnglish
Pages (from-to)297-308
Number of pages12
JournalIEEE Journal of Biomedical and Health Informatics
Volume28
Issue number1
DOIs
StatePublished - 1 Jan 2024

Keywords

  • Deep learning
  • histopathological image classification
  • transformer

Fingerprint

Dive into the research topics of 'Cross-Scale Fusion Transformer for Histopathological Image Classification'. Together they form a unique fingerprint.

Cite this