XAI Vision Transfer | RICHARD CHEAM

TL;DR

Problem: Compare how CNNs, transformers, and hybrid transformers behave under limited data, domain shift, and explainability constraints.
Approach: Controlled CIFAR-10 source study with robustness and data-efficiency analysis, followed by transfer to EuroSAT and Brain Tumor MRI with scratch, linear-probe, and full-fine-tune settings.
Outcome: DHVT became the strongest clean CIFAR-10 model and the best Brain MRI transfer model, while ViT stayed the most texture-robust and linear probing was consistently weaker than full fine-tuning.

What I Built

Unified training pipeline: one configurable framework for CNN, ViT, and DHVT across source and downstream datasets.

Frugal-learning protocol: CIFAR-10 data-efficiency runs plus downstream comparison between scratch, frozen-backbone linear probing, and full fine-tuning.

Checkpoint-first evaluation: saved model weights, per-run histories, plot regeneration, and canonical result export through a master results table.

Explainability workflow: Grad-CAM for CNN, attention rollout for ViT, head-token influence for DHVT, confusion matrices, class diagnostics, and misclassification interpretability.

Results

Source stage

88.88%

DHVT clean CIFAR-10 accuracy, the strongest source-stage model in the study.

EuroSAT

97.52%

Best downstream EuroSAT result with DHVT trained from scratch.

Brain MRI

94.00%

Best downstream Brain Tumor MRI result with pretrained DHVT and full fine-tuning.

Architecture comparison: DHVT was strongest on clean CIFAR-10, CNN remained competitive in the low-data regime, and vanilla ViT was the most robust to texture corruption.

Budget-aware transfer: linear probing reduced cost for transformer-style models, but the performance drop was too large to make it the preferred transfer strategy.

XAI insight: the explanation maps often showed that failures came from semantically plausible confusion rather than attention drifting to unrelated background.

Visuals

Selected report-ready panels from the experiment repository.

Source-stage overview panel

Downstream learning dynamics panel

Downstream interpretability panel

Links

GitHub Report