To Örebro University

oru.seÖrebro University Publications
Change search
CiteExportLink to record
Permanent link

Direct link
Cite
Citation style
  • apa
  • ieee
  • modern-language-association-8th-edition
  • vancouver
  • Other style
More styles
Language
  • de-DE
  • en-GB
  • en-US
  • fi-FI
  • nn-NO
  • nn-NB
  • sv-SE
  • Other locale
More languages
Output format
  • html
  • text
  • asciidoc
  • rtf
Advanced GradCAM++: Improved Visual Explanations of CNN Decisions in Diabetic Retinopathy
Computer Science and Engineering Discipline, Khulna University, Khulna, Bangladesh.
Computer Science and Engineering Discipline, Khulna University, Khulna, Bangladesh.
Computer Science and Engineering Discipline, Khulna University, Khulna, Bangladesh.ORCID iD: 0000-0001-7387-6650
Australian e-Health Research Centre, CSIRO, Floreat, Australia.
2023 (English)In: Computer Vision and Image Analysis for Industry 4.0 / [ed] Nazmul Siddique; Mohammad Shamsul Arefin; Md Atiqur Rahman Ahad; M. Ali Akber Dewan, New York: Taylor & Francis Group, 2023, p. 64-75Chapter in book (Refereed)
Abstract [en]

Convolutional neural network (CNN)-based methods have achieved state-of-the-art performance in solving several complex computer vision problems including assessment of diabetic retinopathy (DR). Despite this, CNN-based methods are often criticized as “black box” methods for providing limited to no understanding about their internal functioning. In recent years there has been an increased interest to develop explainable deep learning models, and this paper is an effort in that direction in the context of DR. Based on one of the best performing method called Grad-CAM++, we propose Advanced Grad-CAM++ to provide further improvement in visual explanations of CNN model predictions (when compared to Grad-CAM++), in terms of better localization of DR pathology as well as explaining occurrences of multiple DR pathology types in a fundus image. By keeping all the layers and operations as is, the proposed method adds an additional non-learnable bilateral convolutional layer between the input image and the very first learnable convolutional layer of Grad-CAM++. Experiments were conducted on fundus images collected from publicly available sources namely EyePACS and DIARETDB1. Intersection over Union (IoU) score between the ground truth and heatmap produced by the methods were used to quantitatively compare the performance.The overall IoU score for Advanced Grad-CAM++ is 0.179, whereas for Grad-CAM++ it is score 0.161. Thus an 11.18% improvement in agreement with the ground truths by the proposed method is inferable.

Place, publisher, year, edition, pages
New York: Taylor & Francis Group, 2023. p. 64-75
Keywords [en]
Deep Learning, Interpretable ML, CNN, Diabetic Retinopathy, Optic Disc
National Category
Computer Vision and Robotics (Autonomous Systems)
Research subject
Signal Processing; Computerized Image Analysis
Identifiers
URN: urn:nbn:se:oru:diva-106530Scopus ID: 2-s2.0-85161149678ISBN: 9781003256106 (electronic)ISBN: 9781032164168 (print)ISBN: 9781032187624 (print)OAI: oai:DiVA.org:oru-106530DiVA, id: diva2:1773121
Available from: 2023-06-22 Created: 2023-06-22 Last updated: 2023-12-20Bibliographically approved

Open Access in DiVA

No full text in DiVA

Other links

ScopusComputer Vision and Image Analysis for Industry 4.0

Authority records

Rahaman, G. M. Atiqur

Search in DiVA

By author/editor
Rahaman, G. M. Atiqur
Computer Vision and Robotics (Autonomous Systems)

Search outside of DiVA

GoogleGoogle Scholar

isbn
urn-nbn

Altmetric score

isbn
urn-nbn
Total: 18 hits
CiteExportLink to record
Permanent link

Direct link
Cite
Citation style
  • apa
  • ieee
  • modern-language-association-8th-edition
  • vancouver
  • Other style
More styles
Language
  • de-DE
  • en-GB
  • en-US
  • fi-FI
  • nn-NO
  • nn-NB
  • sv-SE
  • Other locale
More languages
Output format
  • html
  • text
  • asciidoc
  • rtf