Paper Detail

Multimodal Brain Tumour Classification Using Feature Fusion

Wajih ul Islam, Muhammad Yaqoob, Javed Ali Khan, Volker Steuber

Browse

Workflow Queues

arxiv Score 10.3

Published 2026-06-09 · First seen 2026-06-10

General AI

Open paper source

Abstract

Clinicians diagnose brain tumors by synthesizing patient symptoms, medical history, and quantitative imaging data from modalities such as MRI and CT scans into a unified clinical judgement. However, most deep learning models rely on MRI/CT images alone, failing to replicate the clinicians multimodal reasoning. We explore a two-branch multimodal network combining raw MRI scans with 91 extracted radiomic features (intensity, texture, shape, and boundary descriptors) to classify brain tumors into glioma, meningioma, pituitary, and no-tumor. A pre-trained CNN backbone encodes the image stream, whereas a dedicated MLP encodes the radiomic stream. Both streams are fused via concatenation, gated, or bidirectional cross-modal attention strategies. Across nine experimental runs on a balanced 7,200 image dataset, all multimodal configurations outperform unimodal baselines with gated fusion achieving the best accuracy of 96.13%.

Workflow Status

Review status: pending
Role: unreviewed
Read priority: now
Vote: Not set.
Saved: no
Collections: Not filed yet.
Next action: Not filled yet.

Reading Brief

No structured notes yet. Add `summary_sections`, `why_relevant`, `claim_impact`, or `next_action` in `papers.jsonl` to enrich this view.

Why It Surfaced

No ranking explanation is available yet.

BibTeX

@article{islam2026multimodal,
  title = {Multimodal Brain Tumour Classification Using Feature Fusion},
  author = {Wajih ul Islam and Muhammad Yaqoob and Javed Ali Khan and Volker Steuber},
  year = {2026},
  abstract = {Clinicians diagnose brain tumors by synthesizing patient symptoms, medical history, and quantitative imaging data from modalities such as MRI and CT scans into a unified clinical judgement. However, most deep learning models rely on MRI/CT images alone, failing to replicate the clinicians multimodal reasoning. We explore a two-branch multimodal network combining raw MRI scans with 91 extracted radiomic features (intensity, texture, shape, and boundary descriptors) to classify brain tumors into g},
  url = {https://arxiv.org/abs/2606.11107},
  keywords = {eess.IV, cs.CV, cs.LG},
  eprint = {2606.11107},
  archiveprefix = {arXiv},
}

Metadata

{}