際際滷

際際滷Share a Scribd company logo
Trustworthy AI
LFI-CAM: Learning Feature Importance
for Better Visual Explanation
Kwang Hee Lee, Chaewon Park, Junghyun Oh, Nojun Kwak
Trustworthy AI
Overview
? We propose a novel architecture, LFI-CAM, which is trainable for image classification
and visual explanation in an end-to-end manner.
? LFI-CAM¨s attention map generated during forward propagation, leverages to improve
the classification performance through the attention mechanism.
? Feature Importance Network (FIN) focuses on learning the feature importance
instead of directly learning the attention map to obtain a more reliable and consistent
attention map
Trustworthy AI
Related Works
Perturbation-based Method: LIME, RISE
Response-based Method: CAM
Gradient-based Method: Grad-CAM, Grad-CAM++ Hybrid-based Method: Score-CAM, ABN
Trustworthy AI
Related Works
? Class Activation Mapping
Trustworthy AI
Related Works
? Grad-CAM
Trustworthy AI
Related Works
? LIME (Local Interpretable Model-agnostic Explanations)
Ribeiro MT, Singh S, Guestrin C. " Why should i trust you?" Explaining the predictions of any classifier. InProceedings of
the 22nd ACM SIGKDD international conference on knowledge discovery and data mining 2016 Aug 13 (pp. 1135-1144).
Trustworthy AI
Related Works
? RISE (Randomized input sampling for explanation of black-box models)
Petsiuk V, Das A, Saenko K. Rise: Randomized input sampling for explanation of black-box models.
arXiv preprint arXiv:1806.07421. 2018 Jun 19.
Trustworthy AI
Related Works
? Score-CAM
Trustworthy AI
Related Works
? Attention Branch Network (ABN)
Trustworthy AI
Motivation
? Conventional CAM variants still have the limitations such as
classification performance degradation, multiple forward
computing, and additive backpropagation.
? In particular, ABN outputs unreliable and inconsistent
attention maps through several experiments with various
hyper-parameters.
? Although Score-CAM achieves better visual performance
with less noise and better stability than the gradient-based
approaches, multiple forward computing makes the
generation of visual explanation very slow.
? Inspired by ABN and Score-CAM, we proposed a new
architecture for image classification and visual explanation in
an end-to-end manner and outputs a more reliable and
consistent attention map.
ABN
ABN
LFI-CAM
LFI-CAM
Trustworthy AI
Proposed Method
Trustworthy AI
Feature Importance Network (Attention Branch)
? While ABN directly learns the class activation map
in the attention branch, we replaced the ABN
model¨s attention branch with a new network
architecture, ^Feature Importance Network (FIN) ̄.
? FIN helps the LFI-CAM model learn the feature
importance to generate better class activation map
than ABN.
? The attention map is generated by the weighted
sum of the feature maps from the last
convolutional layer and the learned feature
importance vector.
? To learn the feature importance, in a similar way to
the Score-CAM, we convert the original input into
a gray image, which is down-sampled to the size
of the feature map.
Trustworthy AI
Perception Branch
? The perception branch takes the original input
image as an input and outputs the final probability
of each class.
? The attention map is applied to the feature maps
by the attention mechanism.
? The attention mechanism helps the attention
map improve the classification performance.
Trustworthy AI
Training
? LFI-CAM model is trained in an end-to-end manner using training loss
calculated as the combination of the Softmax function and cross-
entropy at the perception branch in image classification task.
? The FIN is optimized by the attention mechanism of the perception
branch to improve the classification accuracy without any additional
loss function.
Trustworthy AI
Experiments
? Experimental Settings on Image Classification
? Datasets: CIFAR10, CIFAR100, STL10, Cat&Dog and ImageNet
? Baseline Models: CIFAR ResNet backbone (ResNet 20, 32, 44, 56, 110)
ImageNet ResNet backbone (ResNet 18, 34, 50, 101, 152)
? Optimizer and Hyper-parameters
? SGD with momentum
? 300 epochs for CIFAR10,100, STL10 and Cat&Dog
? 90 epochs for ImageNet
? Learning rate is initialized with 0.1 and later on divided by 10 at 50% and 75% of the total number of
training epochs
? Batch size of 128 for CIFAR ResNet backbone
? Batch size of 256 for ImageNet ResNet backbone
Trustworthy AI
Experiments- Visual Explanation Evaluation
Trustworthy AI
Experiments- Visual Explanation Evaluation
Trustworthy AI
Experiments- Effectiveness of Feature Importance Network
Trustworthy AI
Experiments- Accuracy on Image Classification
Trustworthy AI
Experiments- Stability Evaluation of Visual Explanation
Trustworthy AI
Summary
? We proposed the LFI-CAM model, which is trainable for
image classification and produces better visual explanation in
an end-to-end manner.
? Feature Importance Network (FIN) helps our model focus on
learning the feature importance to generate a more stable
and reliable attention map.
? LFI-CAM is on par with ABN in terms of classification
accuracy and outmatches ABN in terms of attention map
quality.

More Related Content

What's hot (18)

侮業僥楼壓粥或鴛議哘喘
侮業僥楼壓粥或鴛議哘喘侮業僥楼壓粥或鴛議哘喘
侮業僥楼壓粥或鴛議哘喘
CHENHuiMei
?
[CVPR2020] Simple but effective image enhancement techniques
[CVPR2020] Simple but effective image enhancement techniques[CVPR2020] Simple but effective image enhancement techniques
[CVPR2020] Simple but effective image enhancement techniques
JaeJun Yoo
?
Review : Structure Boundary Preserving Segmentation?for Medical Image with Am...
Review : Structure Boundary Preserving Segmentation?for Medical Image with Am...Review : Structure Boundary Preserving Segmentation?for Medical Image with Am...
Review : Structure Boundary Preserving Segmentation?for Medical Image with Am...
Dongmin Choi
?
CBIR in the Era of Deep Learning
CBIR in the Era of Deep LearningCBIR in the Era of Deep Learning
CBIR in the Era of Deep Learning
Xiaohu ZHU
?
Step zhedong
Step zhedongStep zhedong
Step zhedong
學叫 孱
?
Pratik ibm-open power-ppt
Pratik ibm-open power-pptPratik ibm-open power-ppt
Pratik ibm-open power-ppt
Vaibhav R
?
Modeling perceptual similarity and shift invariance in deep networks
Modeling perceptual similarity and shift invariance in deep networksModeling perceptual similarity and shift invariance in deep networks
Modeling perceptual similarity and shift invariance in deep networks
NAVER Engineering
?
Automatic Image Annotation (AIA)
Automatic Image Annotation (AIA)Automatic Image Annotation (AIA)
Automatic Image Annotation (AIA)
Farzaneh Rezaei
?
Obscenity Detection in Images
Obscenity Detection in ImagesObscenity Detection in Images
Obscenity Detection in Images
Anil Kumar Gupta
?
Image analytics - A Primer
Image analytics - A PrimerImage analytics - A Primer
Image analytics - A Primer
Gopi Krishna Nuti
?
Binary code-based Human Detection
Binary code-based Human DetectionBinary code-based Human Detection
Binary code-based Human Detection
MPRG_Chubu_University
?
BOIL: Towards Representation Change for Few-shot Learning
BOIL: Towards Representation Change for Few-shot LearningBOIL: Towards Representation Change for Few-shot Learning
BOIL: Towards Representation Change for Few-shot Learning
Hyungjun Yoo
?
Deep learning on mobile
Deep learning on mobileDeep learning on mobile
Deep learning on mobile
Anirudh Koul
?
Deep learning for person re-identification
Deep learning for person re-identificationDeep learning for person re-identification
Deep learning for person re-identification
學叫 孱
?
"Machine Learning- based Image Compression: Ready for Prime Time?," a Present...
"Machine Learning- based Image Compression: Ready for Prime Time?," a Present..."Machine Learning- based Image Compression: Ready for Prime Time?," a Present...
"Machine Learning- based Image Compression: Ready for Prime Time?," a Present...
Edge AI and Vision Alliance
?
IRJET- Concepts, Methods and Applications of Neural Style Transfer: A Rev...
IRJET-  	  Concepts, Methods and Applications of Neural Style Transfer: A Rev...IRJET-  	  Concepts, Methods and Applications of Neural Style Transfer: A Rev...
IRJET- Concepts, Methods and Applications of Neural Style Transfer: A Rev...
IRJET Journal
?
Hao_Guan_2011
Hao_Guan_2011Hao_Guan_2011
Hao_Guan_2011
Hao Guan
?
Rapid object detection using boosted cascade of simple features
Rapid object detection using boosted  cascade of simple featuresRapid object detection using boosted  cascade of simple features
Rapid object detection using boosted cascade of simple features
Hirantha Pradeep
?
侮業僥楼壓粥或鴛議哘喘
侮業僥楼壓粥或鴛議哘喘侮業僥楼壓粥或鴛議哘喘
侮業僥楼壓粥或鴛議哘喘
CHENHuiMei
?
[CVPR2020] Simple but effective image enhancement techniques
[CVPR2020] Simple but effective image enhancement techniques[CVPR2020] Simple but effective image enhancement techniques
[CVPR2020] Simple but effective image enhancement techniques
JaeJun Yoo
?
Review : Structure Boundary Preserving Segmentation?for Medical Image with Am...
Review : Structure Boundary Preserving Segmentation?for Medical Image with Am...Review : Structure Boundary Preserving Segmentation?for Medical Image with Am...
Review : Structure Boundary Preserving Segmentation?for Medical Image with Am...
Dongmin Choi
?
CBIR in the Era of Deep Learning
CBIR in the Era of Deep LearningCBIR in the Era of Deep Learning
CBIR in the Era of Deep Learning
Xiaohu ZHU
?
Pratik ibm-open power-ppt
Pratik ibm-open power-pptPratik ibm-open power-ppt
Pratik ibm-open power-ppt
Vaibhav R
?
Modeling perceptual similarity and shift invariance in deep networks
Modeling perceptual similarity and shift invariance in deep networksModeling perceptual similarity and shift invariance in deep networks
Modeling perceptual similarity and shift invariance in deep networks
NAVER Engineering
?
Automatic Image Annotation (AIA)
Automatic Image Annotation (AIA)Automatic Image Annotation (AIA)
Automatic Image Annotation (AIA)
Farzaneh Rezaei
?
Obscenity Detection in Images
Obscenity Detection in ImagesObscenity Detection in Images
Obscenity Detection in Images
Anil Kumar Gupta
?
BOIL: Towards Representation Change for Few-shot Learning
BOIL: Towards Representation Change for Few-shot LearningBOIL: Towards Representation Change for Few-shot Learning
BOIL: Towards Representation Change for Few-shot Learning
Hyungjun Yoo
?
Deep learning on mobile
Deep learning on mobileDeep learning on mobile
Deep learning on mobile
Anirudh Koul
?
Deep learning for person re-identification
Deep learning for person re-identificationDeep learning for person re-identification
Deep learning for person re-identification
學叫 孱
?
"Machine Learning- based Image Compression: Ready for Prime Time?," a Present...
"Machine Learning- based Image Compression: Ready for Prime Time?," a Present..."Machine Learning- based Image Compression: Ready for Prime Time?," a Present...
"Machine Learning- based Image Compression: Ready for Prime Time?," a Present...
Edge AI and Vision Alliance
?
IRJET- Concepts, Methods and Applications of Neural Style Transfer: A Rev...
IRJET-  	  Concepts, Methods and Applications of Neural Style Transfer: A Rev...IRJET-  	  Concepts, Methods and Applications of Neural Style Transfer: A Rev...
IRJET- Concepts, Methods and Applications of Neural Style Transfer: A Rev...
IRJET Journal
?
Hao_Guan_2011
Hao_Guan_2011Hao_Guan_2011
Hao_Guan_2011
Hao Guan
?
Rapid object detection using boosted cascade of simple features
Rapid object detection using boosted  cascade of simple featuresRapid object detection using boosted  cascade of simple features
Rapid object detection using boosted cascade of simple features
Hirantha Pradeep
?

Recently uploaded (20)

A Comprehensive Investigation into the Accuracy of Soft Computing Tools for D...
A Comprehensive Investigation into the Accuracy of Soft Computing Tools for D...A Comprehensive Investigation into the Accuracy of Soft Computing Tools for D...
A Comprehensive Investigation into the Accuracy of Soft Computing Tools for D...
Journal of Soft Computing in Civil Engineering
?
Axial Capacity Estimation of FRP-strengthened Corroded Concrete Columns
Axial Capacity Estimation of FRP-strengthened Corroded Concrete ColumnsAxial Capacity Estimation of FRP-strengthened Corroded Concrete Columns
Axial Capacity Estimation of FRP-strengthened Corroded Concrete Columns
Journal of Soft Computing in Civil Engineering
?
Software Engineering Project Presentation Tanisha Tasnuva
Software Engineering Project Presentation Tanisha TasnuvaSoftware Engineering Project Presentation Tanisha Tasnuva
Software Engineering Project Presentation Tanisha Tasnuva
tanishatasnuva76
?
Research_Sensitization_&_Innovative_Project_Development.pptx
Research_Sensitization_&_Innovative_Project_Development.pptxResearch_Sensitization_&_Innovative_Project_Development.pptx
Research_Sensitization_&_Innovative_Project_Development.pptx
niranjancse
?
芙坪茶氏Y創_Chain of Thought .
芙坪茶氏Y創_Chain of Thought                           .芙坪茶氏Y創_Chain of Thought                           .
芙坪茶氏Y創_Chain of Thought .
鰻粥京晦粥皆幄塀氏芙
?
Pruebas y Solucion de problemas empresariales en redes de Fibra Optica
Pruebas y Solucion de problemas empresariales en redes de Fibra OpticaPruebas y Solucion de problemas empresariales en redes de Fibra Optica
Pruebas y Solucion de problemas empresariales en redes de Fibra Optica
OmarAlfredoDelCastil
?
The first edition of the AIAG-VDA FMEA.pptx
The first edition of the AIAG-VDA FMEA.pptxThe first edition of the AIAG-VDA FMEA.pptx
The first edition of the AIAG-VDA FMEA.pptx
Mayank Mathur
?
fHUINhKG5lM1WBBk608.pptxfhjjhhjffhiuhhghj
fHUINhKG5lM1WBBk608.pptxfhjjhhjffhiuhhghjfHUINhKG5lM1WBBk608.pptxfhjjhhjffhiuhhghj
fHUINhKG5lM1WBBk608.pptxfhjjhhjffhiuhhghj
yadavshivank2006
?
MODULE 6 - 1 VAE - Variational Autoencoder
MODULE 6 - 1 VAE - Variational AutoencoderMODULE 6 - 1 VAE - Variational Autoencoder
MODULE 6 - 1 VAE - Variational Autoencoder
DivyaMeenaS
?
Artificial Power 2025 raport krajobrazowy
Artificial Power 2025 raport krajobrazowyArtificial Power 2025 raport krajobrazowy
Artificial Power 2025 raport krajobrazowy
dominikamizerska1
?
International Journal of Advance Robotics & Expert Systems (JARES)
International Journal of Advance Robotics & Expert Systems (JARES)International Journal of Advance Robotics & Expert Systems (JARES)
International Journal of Advance Robotics & Expert Systems (JARES)
jaresjournal868
?
Electrical and Electronics Engineering: An International Journal (ELELIJ)
Electrical and Electronics Engineering: An International Journal (ELELIJ)Electrical and Electronics Engineering: An International Journal (ELELIJ)
Electrical and Electronics Engineering: An International Journal (ELELIJ)
elelijjournal653
?
Tree_Traversals.pptbbbbbbbbbbbbbbbbbbbbbbbbb
Tree_Traversals.pptbbbbbbbbbbbbbbbbbbbbbbbbbTree_Traversals.pptbbbbbbbbbbbbbbbbbbbbbbbbb
Tree_Traversals.pptbbbbbbbbbbbbbbbbbbbbbbbbb
RATNANITINPATIL
?
ANFIS Models with Subtractive Clustering and Fuzzy C-Mean Clustering Techniqu...
ANFIS Models with Subtractive Clustering and Fuzzy C-Mean Clustering Techniqu...ANFIS Models with Subtractive Clustering and Fuzzy C-Mean Clustering Techniqu...
ANFIS Models with Subtractive Clustering and Fuzzy C-Mean Clustering Techniqu...
Journal of Soft Computing in Civil Engineering
?
IOt Based Research on Challenges and Future
IOt Based Research on Challenges and FutureIOt Based Research on Challenges and Future
IOt Based Research on Challenges and Future
SACHINSAHU821405
?
Third Review PPT that consists of the project d etails like abstract.
Third Review PPT that consists of the project d etails like abstract.Third Review PPT that consists of the project d etails like abstract.
Third Review PPT that consists of the project d etails like abstract.
Sowndarya6
?
May 2025: Top 10 Read Articles Advanced Information Technology
May 2025: Top 10 Read Articles Advanced Information TechnologyMay 2025: Top 10 Read Articles Advanced Information Technology
May 2025: Top 10 Read Articles Advanced Information Technology
ijait
?
Structural Design for Residential-to-Restaurant Conversion
Structural Design for Residential-to-Restaurant ConversionStructural Design for Residential-to-Restaurant Conversion
Structural Design for Residential-to-Restaurant Conversion
DanielRoman285499
?
Presentacio?n Tomograf┴a Axial Computarizada
Presentacio?n Tomograf┴a Axial ComputarizadaPresentacio?n Tomograf┴a Axial Computarizada
Presentacio?n Tomograf┴a Axial Computarizada
Juliana Ovalle Jim└nez
?
New Microsoft Office Word Documentfrf.docx
New Microsoft Office Word Documentfrf.docxNew Microsoft Office Word Documentfrf.docx
New Microsoft Office Word Documentfrf.docx
misheetasah
?
Software Engineering Project Presentation Tanisha Tasnuva
Software Engineering Project Presentation Tanisha TasnuvaSoftware Engineering Project Presentation Tanisha Tasnuva
Software Engineering Project Presentation Tanisha Tasnuva
tanishatasnuva76
?
Research_Sensitization_&_Innovative_Project_Development.pptx
Research_Sensitization_&_Innovative_Project_Development.pptxResearch_Sensitization_&_Innovative_Project_Development.pptx
Research_Sensitization_&_Innovative_Project_Development.pptx
niranjancse
?
Pruebas y Solucion de problemas empresariales en redes de Fibra Optica
Pruebas y Solucion de problemas empresariales en redes de Fibra OpticaPruebas y Solucion de problemas empresariales en redes de Fibra Optica
Pruebas y Solucion de problemas empresariales en redes de Fibra Optica
OmarAlfredoDelCastil
?
The first edition of the AIAG-VDA FMEA.pptx
The first edition of the AIAG-VDA FMEA.pptxThe first edition of the AIAG-VDA FMEA.pptx
The first edition of the AIAG-VDA FMEA.pptx
Mayank Mathur
?
fHUINhKG5lM1WBBk608.pptxfhjjhhjffhiuhhghj
fHUINhKG5lM1WBBk608.pptxfhjjhhjffhiuhhghjfHUINhKG5lM1WBBk608.pptxfhjjhhjffhiuhhghj
fHUINhKG5lM1WBBk608.pptxfhjjhhjffhiuhhghj
yadavshivank2006
?
MODULE 6 - 1 VAE - Variational Autoencoder
MODULE 6 - 1 VAE - Variational AutoencoderMODULE 6 - 1 VAE - Variational Autoencoder
MODULE 6 - 1 VAE - Variational Autoencoder
DivyaMeenaS
?
Artificial Power 2025 raport krajobrazowy
Artificial Power 2025 raport krajobrazowyArtificial Power 2025 raport krajobrazowy
Artificial Power 2025 raport krajobrazowy
dominikamizerska1
?
International Journal of Advance Robotics & Expert Systems (JARES)
International Journal of Advance Robotics & Expert Systems (JARES)International Journal of Advance Robotics & Expert Systems (JARES)
International Journal of Advance Robotics & Expert Systems (JARES)
jaresjournal868
?
Electrical and Electronics Engineering: An International Journal (ELELIJ)
Electrical and Electronics Engineering: An International Journal (ELELIJ)Electrical and Electronics Engineering: An International Journal (ELELIJ)
Electrical and Electronics Engineering: An International Journal (ELELIJ)
elelijjournal653
?
Tree_Traversals.pptbbbbbbbbbbbbbbbbbbbbbbbbb
Tree_Traversals.pptbbbbbbbbbbbbbbbbbbbbbbbbbTree_Traversals.pptbbbbbbbbbbbbbbbbbbbbbbbbb
Tree_Traversals.pptbbbbbbbbbbbbbbbbbbbbbbbbb
RATNANITINPATIL
?
IOt Based Research on Challenges and Future
IOt Based Research on Challenges and FutureIOt Based Research on Challenges and Future
IOt Based Research on Challenges and Future
SACHINSAHU821405
?
Third Review PPT that consists of the project d etails like abstract.
Third Review PPT that consists of the project d etails like abstract.Third Review PPT that consists of the project d etails like abstract.
Third Review PPT that consists of the project d etails like abstract.
Sowndarya6
?
May 2025: Top 10 Read Articles Advanced Information Technology
May 2025: Top 10 Read Articles Advanced Information TechnologyMay 2025: Top 10 Read Articles Advanced Information Technology
May 2025: Top 10 Read Articles Advanced Information Technology
ijait
?
Structural Design for Residential-to-Restaurant Conversion
Structural Design for Residential-to-Restaurant ConversionStructural Design for Residential-to-Restaurant Conversion
Structural Design for Residential-to-Restaurant Conversion
DanielRoman285499
?
Presentacio?n Tomograf┴a Axial Computarizada
Presentacio?n Tomograf┴a Axial ComputarizadaPresentacio?n Tomograf┴a Axial Computarizada
Presentacio?n Tomograf┴a Axial Computarizada
Juliana Ovalle Jim└nez
?
New Microsoft Office Word Documentfrf.docx
New Microsoft Office Word Documentfrf.docxNew Microsoft Office Word Documentfrf.docx
New Microsoft Office Word Documentfrf.docx
misheetasah
?
Ad

LFI-CAM: Learning Feature Importance for Better Visual Explanation

  • 1. Trustworthy AI LFI-CAM: Learning Feature Importance for Better Visual Explanation Kwang Hee Lee, Chaewon Park, Junghyun Oh, Nojun Kwak
  • 2. Trustworthy AI Overview ? We propose a novel architecture, LFI-CAM, which is trainable for image classification and visual explanation in an end-to-end manner. ? LFI-CAM¨s attention map generated during forward propagation, leverages to improve the classification performance through the attention mechanism. ? Feature Importance Network (FIN) focuses on learning the feature importance instead of directly learning the attention map to obtain a more reliable and consistent attention map
  • 3. Trustworthy AI Related Works Perturbation-based Method: LIME, RISE Response-based Method: CAM Gradient-based Method: Grad-CAM, Grad-CAM++ Hybrid-based Method: Score-CAM, ABN
  • 4. Trustworthy AI Related Works ? Class Activation Mapping
  • 6. Trustworthy AI Related Works ? LIME (Local Interpretable Model-agnostic Explanations) Ribeiro MT, Singh S, Guestrin C. " Why should i trust you?" Explaining the predictions of any classifier. InProceedings of the 22nd ACM SIGKDD international conference on knowledge discovery and data mining 2016 Aug 13 (pp. 1135-1144).
  • 7. Trustworthy AI Related Works ? RISE (Randomized input sampling for explanation of black-box models) Petsiuk V, Das A, Saenko K. Rise: Randomized input sampling for explanation of black-box models. arXiv preprint arXiv:1806.07421. 2018 Jun 19.
  • 9. Trustworthy AI Related Works ? Attention Branch Network (ABN)
  • 10. Trustworthy AI Motivation ? Conventional CAM variants still have the limitations such as classification performance degradation, multiple forward computing, and additive backpropagation. ? In particular, ABN outputs unreliable and inconsistent attention maps through several experiments with various hyper-parameters. ? Although Score-CAM achieves better visual performance with less noise and better stability than the gradient-based approaches, multiple forward computing makes the generation of visual explanation very slow. ? Inspired by ABN and Score-CAM, we proposed a new architecture for image classification and visual explanation in an end-to-end manner and outputs a more reliable and consistent attention map. ABN ABN LFI-CAM LFI-CAM
  • 12. Trustworthy AI Feature Importance Network (Attention Branch) ? While ABN directly learns the class activation map in the attention branch, we replaced the ABN model¨s attention branch with a new network architecture, ^Feature Importance Network (FIN) ̄. ? FIN helps the LFI-CAM model learn the feature importance to generate better class activation map than ABN. ? The attention map is generated by the weighted sum of the feature maps from the last convolutional layer and the learned feature importance vector. ? To learn the feature importance, in a similar way to the Score-CAM, we convert the original input into a gray image, which is down-sampled to the size of the feature map.
  • 13. Trustworthy AI Perception Branch ? The perception branch takes the original input image as an input and outputs the final probability of each class. ? The attention map is applied to the feature maps by the attention mechanism. ? The attention mechanism helps the attention map improve the classification performance.
  • 14. Trustworthy AI Training ? LFI-CAM model is trained in an end-to-end manner using training loss calculated as the combination of the Softmax function and cross- entropy at the perception branch in image classification task. ? The FIN is optimized by the attention mechanism of the perception branch to improve the classification accuracy without any additional loss function.
  • 15. Trustworthy AI Experiments ? Experimental Settings on Image Classification ? Datasets: CIFAR10, CIFAR100, STL10, Cat&Dog and ImageNet ? Baseline Models: CIFAR ResNet backbone (ResNet 20, 32, 44, 56, 110) ImageNet ResNet backbone (ResNet 18, 34, 50, 101, 152) ? Optimizer and Hyper-parameters ? SGD with momentum ? 300 epochs for CIFAR10,100, STL10 and Cat&Dog ? 90 epochs for ImageNet ? Learning rate is initialized with 0.1 and later on divided by 10 at 50% and 75% of the total number of training epochs ? Batch size of 128 for CIFAR ResNet backbone ? Batch size of 256 for ImageNet ResNet backbone
  • 16. Trustworthy AI Experiments- Visual Explanation Evaluation
  • 17. Trustworthy AI Experiments- Visual Explanation Evaluation
  • 18. Trustworthy AI Experiments- Effectiveness of Feature Importance Network
  • 19. Trustworthy AI Experiments- Accuracy on Image Classification
  • 20. Trustworthy AI Experiments- Stability Evaluation of Visual Explanation
  • 21. Trustworthy AI Summary ? We proposed the LFI-CAM model, which is trainable for image classification and produces better visual explanation in an end-to-end manner. ? Feature Importance Network (FIN) helps our model focus on learning the feature importance to generate a more stable and reliable attention map. ? LFI-CAM is on par with ABN in terms of classification accuracy and outmatches ABN in terms of attention map quality.