際際滷

際際滷Share a Scribd company logo
Finding Bugs in Deep Learning
Programs
SoftWare Analytics
and Technologies Lab
S.W.A.T
Foutse Khomh, PhD, Ing.
foutse.khomh@polymtl.ca
@SWATLab
Canada CIFAR AI Chair on Trustworthy Machine Learning Systems
Trustworthy Engineering of AI Software
1
Quality Assurance of ML-enabled systems
System evolution & continuous delivery
2
Reliable system
Data Health Model Health
Code Health
Time
Distribution shift
Data imbalance
Coding errors
Noisy labels
Underspecification
Bias
Vulnerabilities
Deep Learning is being rapidly adopted in industry
3
Preliminary
preparation
Data
Collection
Data Preprocessing
Model
Implementation
Model
Training
Model
Evaluation
Model
Tuning
Data postprocessing
Model
Prediction
DL development phases produce a lot of code!
Han et al., What do Programmers Discuss about Deep Learning Frameworks
DL programs can be faulty!
5
6
Multi-dimensional space of DL bugs
Model
Issues
not enough learning
capacity
non-optimal
regularization
correct
incorrect feature
extraction
incorrect gradient
computation

Implementation Issues
correct
7
TOSEM21
NeuraLint : A linter for DL programs
 Capture defects early, so saves rework cost.
 Less expensive, because it doesnt require
execution.
 Find defects in seconds.
 
NeuraLint is fast and effective!
 It achieves an accuracy of 91.7 % .
 It correctly reported 18 additional bugs that were
not found by developers.
 The average execution time of NeuraLint for the
studied TensorFlow and Keras based programs are
2.892 and 3.197 seconds respectively.
Try it out!
NeuraLint has two pillars
8
A meta-model of DL programs Taxonomy of common DL faults
Gunel Jahangirova, Nargiz Humbatova, Gabriele Bavota, Vincenzo Riccio, Andrea Stocco, and Paolo Tonella. 2019. Taxonomy of Real Faults in Deep
Learning Systems. arXiv preprint arXiv:1910.11015
NeuraLint: Execution Flow
9
Graph transformation
Rules
Potential issues
Program
Original program
Model Extraction
Model
Run
List of detected Issues
TheDeepChecker outperforms AWS SMD
 DL coding bugs and misconfigurations are detected
with (precision, recall), respectively, equal to (90%,
96.4%) and (77%, 83.3%).
 Finds 30% more defects than AWS SageMaker.
10
TOSEM22
TheDeepChecker : Dynamic testing of DL programs
 Capture defects during the training process.
 Less expensive than testing the resulting model.
 Some overhead on the training process.

Try it out!
11
TheDeepChecker verification rules
Parameters-related Issues Untrained Parameters
Poor Weight Initialization
Parameters Values Divergence
Parameters Unstable Learning
Activation-related Issues Activations out of Range
Neuron Saturation
Dead ReLU
Optimization-related Issues Unable to fit a small sample
Zero Loss
Diverging Loss
Slow or Non decreasing Loss
Loss Fluctuations
Unstable Gradient: Exploding
Unstable Gradient: Vanishing
Parameters-related Issues Untrained Parameters
Given a layer  and  iterations

0
= 
1
,
0
= 
1

1
= 
2
,
1
= 
2


1
= 

,
1
= 

Issue
Poor Weight Initialization
Parameters Values Divergence
Given a layer  and an iteration 



 

+1


 
+1
   [0,   1]
Verification
Routine
Parameters Unstable Learning
TheDeepChecker verification rules
13
Activation-related Issues Activations out of Range
Given a layer 
基基  [, ]
Issue
Neuron Saturation
Given a layer 
  基基  
Verification
Routine
Dead ReLU
Specification of verification rules
14
Optimization-related Issues Unable to fit a small sample
The DNN could not properly
minimize the loss.
Issue
Zero Loss
Diverging Loss
Slow or Non decreasing Loss
The DNN (with regularization off)
should overfit a tiny sample of data.
Given N iterations
 = 0
Verification
Routine
Loss Fluctuations
Unstable Gradient: Exploding
Unstable Gradient: Vanishing
TheDeepChecker verification rules
15
Program
Pre-processing
Post-processing
Verification
Routines
Potential issues
Program
Original program
Monitoring
Monitored Program
Run
Sanity Check of Program
TheDeepChecker: Execution Flow
TheDeepChecker outperforms AWS SMD
 DL coding bugs and misconfigurations are detected
with (precision, recall), respectively, equal to (90%,
96.4%) and (77%, 83.3%).
 Finds 30% more defects than AWS SageMaker.
16
TOSEM22
TheDeepChecker : Dynamic testing of DL programs
 Capture defects during the training process.
 Less expensive than testing the resulting model.
 Some overhead on the training process.

Try it out!
17
Foutse Khomh, PhD, Ing.
foutse.khomh@polymtl.ca
@SWATLab
Canada CIFAR AI Chair
Emmanuel Thepie Fapi
18
Try it out!

More Related Content

Similar to Talk-Foutse-SrangeLoop.pdf (20)

Soft quality & standards
Soft quality & standardsSoft quality & standards
Soft quality & standards
Prince Bhanwra
Testing As A Bottleneck - How Testing Slows Down Modern Development Processes...
Testing As A Bottleneck - How Testing Slows Down Modern Development Processes...Testing As A Bottleneck - How Testing Slows Down Modern Development Processes...
Testing As A Bottleneck - How Testing Slows Down Modern Development Processes...
TEST Huddle
Black box-software-testing-douglas-hoffman2483
Black box-software-testing-douglas-hoffman2483Black box-software-testing-douglas-hoffman2483
Black box-software-testing-douglas-hoffman2483
Chaitanya Kn
Webinar: Question Answering and Virtual Assistants with Deep Learning
Webinar: Question Answering and Virtual Assistants with Deep LearningWebinar: Question Answering and Virtual Assistants with Deep Learning
Webinar: Question Answering and Virtual Assistants with Deep Learning
Lucidworks
Barga Data Science lecture 10
Barga Data Science lecture 10Barga Data Science lecture 10
Barga Data Science lecture 10
Roger Barga
Testing and Mocking Object - The Art of Mocking.
Testing and Mocking Object - The Art of Mocking.Testing and Mocking Object - The Art of Mocking.
Testing and Mocking Object - The Art of Mocking.
Deepak Singhvi
Using the Machine to predict Testability
Using the Machine to predict TestabilityUsing the Machine to predict Testability
Using the Machine to predict Testability
Miguel Lopez
Mlcc #4
Mlcc #4Mlcc #4
Mlcc #4
Chung-Hsiang Ofa Hsueh
How to Actually DO High-volume Automated Testing
How to Actually DO High-volume Automated TestingHow to Actually DO High-volume Automated Testing
How to Actually DO High-volume Automated Testing
TechWell
Testing concepts [3] - Software Testing Techniques (CIS640)
Testing concepts [3] - Software Testing Techniques (CIS640)Testing concepts [3] - Software Testing Techniques (CIS640)
Testing concepts [3] - Software Testing Techniques (CIS640)
Venkatesh Prasad Ranganath
Making the Unstable Stable - An Intro To Testing
Making the Unstable Stable - An Intro To TestingMaking the Unstable Stable - An Intro To Testing
Making the Unstable Stable - An Intro To Testing
Cameron Presley
midterm_fa08.pdf
midterm_fa08.pdfmidterm_fa08.pdf
midterm_fa08.pdf
RavinderKSingla
test
testtest
test
gikrauss
test
testtest
test
gikrauss
Building functional Quality Gates with ReportPortal
Building functional Quality Gates with ReportPortalBuilding functional Quality Gates with ReportPortal
Building functional Quality Gates with ReportPortal
Dmitriy Gumeniuk
Defect free development - QS Tag2019
Defect free development - QS Tag2019Defect free development - QS Tag2019
Defect free development - QS Tag2019
Arnon Axelrod
Designing Self-maintaining UI Tests for Web Applications
Designing Self-maintaining UI Tests for Web ApplicationsDesigning Self-maintaining UI Tests for Web Applications
Designing Self-maintaining UI Tests for Web Applications
TechWell
Testing & implementation system 1-wm
Testing & implementation system 1-wmTesting & implementation system 1-wm
Testing & implementation system 1-wm
Wiwik Muslehatin
Metric Abuse: Frequently Misused Metrics in Oracle
Metric Abuse: Frequently Misused Metrics in OracleMetric Abuse: Frequently Misused Metrics in Oracle
Metric Abuse: Frequently Misused Metrics in Oracle
Steve Karam
[DSC Europe 22] Testing Machine Learning Systems: What it is and why you shou...
[DSC Europe 22] Testing Machine Learning Systems: What it is and why you shou...[DSC Europe 22] Testing Machine Learning Systems: What it is and why you shou...
[DSC Europe 22] Testing Machine Learning Systems: What it is and why you shou...
DataScienceConferenc1
Soft quality & standards
Soft quality & standardsSoft quality & standards
Soft quality & standards
Prince Bhanwra
Testing As A Bottleneck - How Testing Slows Down Modern Development Processes...
Testing As A Bottleneck - How Testing Slows Down Modern Development Processes...Testing As A Bottleneck - How Testing Slows Down Modern Development Processes...
Testing As A Bottleneck - How Testing Slows Down Modern Development Processes...
TEST Huddle
Black box-software-testing-douglas-hoffman2483
Black box-software-testing-douglas-hoffman2483Black box-software-testing-douglas-hoffman2483
Black box-software-testing-douglas-hoffman2483
Chaitanya Kn
Webinar: Question Answering and Virtual Assistants with Deep Learning
Webinar: Question Answering and Virtual Assistants with Deep LearningWebinar: Question Answering and Virtual Assistants with Deep Learning
Webinar: Question Answering and Virtual Assistants with Deep Learning
Lucidworks
Barga Data Science lecture 10
Barga Data Science lecture 10Barga Data Science lecture 10
Barga Data Science lecture 10
Roger Barga
Testing and Mocking Object - The Art of Mocking.
Testing and Mocking Object - The Art of Mocking.Testing and Mocking Object - The Art of Mocking.
Testing and Mocking Object - The Art of Mocking.
Deepak Singhvi
Using the Machine to predict Testability
Using the Machine to predict TestabilityUsing the Machine to predict Testability
Using the Machine to predict Testability
Miguel Lopez
How to Actually DO High-volume Automated Testing
How to Actually DO High-volume Automated TestingHow to Actually DO High-volume Automated Testing
How to Actually DO High-volume Automated Testing
TechWell
Testing concepts [3] - Software Testing Techniques (CIS640)
Testing concepts [3] - Software Testing Techniques (CIS640)Testing concepts [3] - Software Testing Techniques (CIS640)
Testing concepts [3] - Software Testing Techniques (CIS640)
Venkatesh Prasad Ranganath
Making the Unstable Stable - An Intro To Testing
Making the Unstable Stable - An Intro To TestingMaking the Unstable Stable - An Intro To Testing
Making the Unstable Stable - An Intro To Testing
Cameron Presley
Building functional Quality Gates with ReportPortal
Building functional Quality Gates with ReportPortalBuilding functional Quality Gates with ReportPortal
Building functional Quality Gates with ReportPortal
Dmitriy Gumeniuk
Defect free development - QS Tag2019
Defect free development - QS Tag2019Defect free development - QS Tag2019
Defect free development - QS Tag2019
Arnon Axelrod
Designing Self-maintaining UI Tests for Web Applications
Designing Self-maintaining UI Tests for Web ApplicationsDesigning Self-maintaining UI Tests for Web Applications
Designing Self-maintaining UI Tests for Web Applications
TechWell
Testing & implementation system 1-wm
Testing & implementation system 1-wmTesting & implementation system 1-wm
Testing & implementation system 1-wm
Wiwik Muslehatin
Metric Abuse: Frequently Misused Metrics in Oracle
Metric Abuse: Frequently Misused Metrics in OracleMetric Abuse: Frequently Misused Metrics in Oracle
Metric Abuse: Frequently Misused Metrics in Oracle
Steve Karam
[DSC Europe 22] Testing Machine Learning Systems: What it is and why you shou...
[DSC Europe 22] Testing Machine Learning Systems: What it is and why you shou...[DSC Europe 22] Testing Machine Learning Systems: What it is and why you shou...
[DSC Europe 22] Testing Machine Learning Systems: What it is and why you shou...
DataScienceConferenc1

More from Foutse Khomh (11)

Foutse_MSR Vision keynote.pptx
Foutse_MSR Vision keynote.pptxFoutse_MSR Vision keynote.pptx
Foutse_MSR Vision keynote.pptx
Foutse Khomh
Stack overflow code_laundering
Stack overflow code_launderingStack overflow code_laundering
Stack overflow code_laundering
Foutse Khomh
Mining the Relationship between Anti-patterns Dependencies and Fault-Proneness
Mining the Relationship between Anti-patterns Dependencies and Fault-PronenessMining the Relationship between Anti-patterns Dependencies and Fault-Proneness
Mining the Relationship between Anti-patterns Dependencies and Fault-Proneness
Foutse Khomh
Predicting bugs using antipatterns
Predicting bugs using antipatternsPredicting bugs using antipatterns
Predicting bugs using antipatterns
Foutse Khomh
How does Context affect the Distribution of Software Maintainability Metrics?
How does Context affect the Distribution of Software Maintainability Metrics?How does Context affect the Distribution of Software Maintainability Metrics?
How does Context affect the Distribution of Software Maintainability Metrics?
Foutse Khomh
On Rapid Releases and Software Testing
On Rapid Releases and Software TestingOn Rapid Releases and Software Testing
On Rapid Releases and Software Testing
Foutse Khomh
Adapting Linux for Mobile Platforms: An Empirical Study of Android
Adapting Linux for Mobile Platforms: An Empirical Study of AndroidAdapting Linux for Mobile Platforms: An Empirical Study of Android
Adapting Linux for Mobile Platforms: An Empirical Study of Android
Foutse Khomh
Recovering Commit Dependencies for Selective Code Integration in Software Pro...
Recovering Commit Dependencies for Selective Code Integration in Software Pro...Recovering Commit Dependencies for Selective Code Integration in Software Pro...
Recovering Commit Dependencies for Selective Code Integration in Software Pro...
Foutse Khomh
Late Propagation in Software Clones
Late Propagation in Software ClonesLate Propagation in Software Clones
Late Propagation in Software Clones
Foutse Khomh
An Entropy Evaluation Approach for Triaging Field Crashes: A Case Study of Mo...
An Entropy Evaluation Approach for Triaging Field Crashes: A Case Study of Mo...An Entropy Evaluation Approach for Triaging Field Crashes: A Case Study of Mo...
An Entropy Evaluation Approach for Triaging Field Crashes: A Case Study of Mo...
Foutse Khomh
Do Faster Releases Improve Software Quality?
Do Faster Releases Improve Software Quality? Do Faster Releases Improve Software Quality?
Do Faster Releases Improve Software Quality?
Foutse Khomh
Foutse_MSR Vision keynote.pptx
Foutse_MSR Vision keynote.pptxFoutse_MSR Vision keynote.pptx
Foutse_MSR Vision keynote.pptx
Foutse Khomh
Stack overflow code_laundering
Stack overflow code_launderingStack overflow code_laundering
Stack overflow code_laundering
Foutse Khomh
Mining the Relationship between Anti-patterns Dependencies and Fault-Proneness
Mining the Relationship between Anti-patterns Dependencies and Fault-PronenessMining the Relationship between Anti-patterns Dependencies and Fault-Proneness
Mining the Relationship between Anti-patterns Dependencies and Fault-Proneness
Foutse Khomh
Predicting bugs using antipatterns
Predicting bugs using antipatternsPredicting bugs using antipatterns
Predicting bugs using antipatterns
Foutse Khomh
How does Context affect the Distribution of Software Maintainability Metrics?
How does Context affect the Distribution of Software Maintainability Metrics?How does Context affect the Distribution of Software Maintainability Metrics?
How does Context affect the Distribution of Software Maintainability Metrics?
Foutse Khomh
On Rapid Releases and Software Testing
On Rapid Releases and Software TestingOn Rapid Releases and Software Testing
On Rapid Releases and Software Testing
Foutse Khomh
Adapting Linux for Mobile Platforms: An Empirical Study of Android
Adapting Linux for Mobile Platforms: An Empirical Study of AndroidAdapting Linux for Mobile Platforms: An Empirical Study of Android
Adapting Linux for Mobile Platforms: An Empirical Study of Android
Foutse Khomh
Recovering Commit Dependencies for Selective Code Integration in Software Pro...
Recovering Commit Dependencies for Selective Code Integration in Software Pro...Recovering Commit Dependencies for Selective Code Integration in Software Pro...
Recovering Commit Dependencies for Selective Code Integration in Software Pro...
Foutse Khomh
Late Propagation in Software Clones
Late Propagation in Software ClonesLate Propagation in Software Clones
Late Propagation in Software Clones
Foutse Khomh
An Entropy Evaluation Approach for Triaging Field Crashes: A Case Study of Mo...
An Entropy Evaluation Approach for Triaging Field Crashes: A Case Study of Mo...An Entropy Evaluation Approach for Triaging Field Crashes: A Case Study of Mo...
An Entropy Evaluation Approach for Triaging Field Crashes: A Case Study of Mo...
Foutse Khomh
Do Faster Releases Improve Software Quality?
Do Faster Releases Improve Software Quality? Do Faster Releases Improve Software Quality?
Do Faster Releases Improve Software Quality?
Foutse Khomh

Recently uploaded (20)

Software+Bill+of+Materials+Starter+Guide (1).pdf
Software+Bill+of+Materials+Starter+Guide (1).pdfSoftware+Bill+of+Materials+Starter+Guide (1).pdf
Software+Bill+of+Materials+Starter+Guide (1).pdf
kedofef453
e-World Tech PHPMaker Crack 2024.14 Download 2025
e-World Tech PHPMaker Crack 2024.14 Download 2025e-World Tech PHPMaker Crack 2024.14 Download 2025
e-World Tech PHPMaker Crack 2024.14 Download 2025
madni07kp
Radio BOSS Advanced 7.0.5 Crack Free Download 2025
Radio BOSS Advanced 7.0.5 Crack Free Download 2025Radio BOSS Advanced 7.0.5 Crack Free Download 2025
Radio BOSS Advanced 7.0.5 Crack Free Download 2025
f17489615
Siemens Solid Edge crack + Activation Key Download 2025
Siemens Solid Edge crack + Activation Key Download 2025Siemens Solid Edge crack + Activation Key Download 2025
Siemens Solid Edge crack + Activation Key Download 2025
shahg123tt
Custom AI Chatbot Development Services.pdf
Custom AI Chatbot Development Services.pdfCustom AI Chatbot Development Services.pdf
Custom AI Chatbot Development Services.pdf
Belitsoft
Marketo Engage North America Virtual User Group: Adobe Summit 2025 recap
Marketo Engage North America Virtual User Group: Adobe Summit 2025 recapMarketo Engage North America Virtual User Group: Adobe Summit 2025 recap
Marketo Engage North America Virtual User Group: Adobe Summit 2025 recap
BradBedford3
Adobe Substance 3D Designer 14.0.0.8074 Crack 2025
Adobe Substance 3D Designer 14.0.0.8074 Crack 2025Adobe Substance 3D Designer 14.0.0.8074 Crack 2025
Adobe Substance 3D Designer 14.0.0.8074 Crack 2025
abdulbasitayoubi25
GraphPad Prism 10.3.0.507 Crack Free Download
GraphPad Prism 10.3.0.507 Crack Free DownloadGraphPad Prism 10.3.0.507 Crack Free Download
GraphPad Prism 10.3.0.507 Crack Free Download
madni07kp
Top Online Food Ordering Script Company - Become Vendor
Top Online Food Ordering Script Company - Become VendorTop Online Food Ordering Script Company - Become Vendor
Top Online Food Ordering Script Company - Become Vendor
Kevin Miller
Real Converter Pro 7.819 Crack Free Download
Real Converter Pro 7.819 Crack Free DownloadReal Converter Pro 7.819 Crack Free Download
Real Converter Pro 7.819 Crack Free Download
alihamzakpa03
e-World Tech PHPMaker Crack 2024.14 Download 2025
e-World Tech PHPMaker Crack 2024.14 Download 2025e-World Tech PHPMaker Crack 2024.14 Download 2025
e-World Tech PHPMaker Crack 2024.14 Download 2025
basitayoubi104
TubeMate Downloader 5.18. Crack + License Key 2025
TubeMate Downloader 5.18. Crack + License Key 2025TubeMate Downloader 5.18. Crack + License Key 2025
TubeMate Downloader 5.18. Crack + License Key 2025
shanbhai08kp
Mastering Fluent Bit: Ultimate Guide to Integrating Telemetry Pipelines with ...
Mastering Fluent Bit: Ultimate Guide to Integrating Telemetry Pipelines with ...Mastering Fluent Bit: Ultimate Guide to Integrating Telemetry Pipelines with ...
Mastering Fluent Bit: Ultimate Guide to Integrating Telemetry Pipelines with ...
Eric D. Schabell
E-Ptw Area Map - TECH EHS Solution
E-Ptw Area Map - TECH EHS Solution E-Ptw Area Map - TECH EHS Solution
E-Ptw Area Map - TECH EHS Solution
TECH EHS Solution
Alluxio Webinar | Inside Deepseek 3FS: A Deep Dive into AI-Optimized Distribu...
Alluxio Webinar | Inside Deepseek 3FS: A Deep Dive into AI-Optimized Distribu...Alluxio Webinar | Inside Deepseek 3FS: A Deep Dive into AI-Optimized Distribu...
Alluxio Webinar | Inside Deepseek 3FS: A Deep Dive into AI-Optimized Distribu...
Alluxio, Inc.
Coreldraw 2021 Crack Latest Version 2025
Coreldraw 2021 Crack Latest Version 2025Coreldraw 2021 Crack Latest Version 2025
Coreldraw 2021 Crack Latest Version 2025
blouch31kp
View Companion Premium 15.40 Crack Free Download 2025
View Companion Premium 15.40 Crack Free Download 2025View Companion Premium 15.40 Crack Free Download 2025
View Companion Premium 15.40 Crack Free Download 2025
asghar045kp
Software Ideas Modeler Ultimate 14.62 Crack Free Download
Software Ideas Modeler Ultimate 14.62 Crack Free DownloadSoftware Ideas Modeler Ultimate 14.62 Crack Free Download
Software Ideas Modeler Ultimate 14.62 Crack Free Download
alihamzakpa01
Evolution and Examples of Java Features, from Java 1.7 to Java 24
Evolution and Examples of Java Features, from Java 1.7 to Java 24Evolution and Examples of Java Features, from Java 1.7 to Java 24
Evolution and Examples of Java Features, from Java 1.7 to Java 24
Yann-Ga谷l Gu辿h辿neuc
Internet Download Manager (IDM) Crack + Lisence key Latest version 2025
Internet Download Manager (IDM) Crack + Lisence key Latest version 2025Internet Download Manager (IDM) Crack + Lisence key Latest version 2025
Internet Download Manager (IDM) Crack + Lisence key Latest version 2025
shahzad011kp
Software+Bill+of+Materials+Starter+Guide (1).pdf
Software+Bill+of+Materials+Starter+Guide (1).pdfSoftware+Bill+of+Materials+Starter+Guide (1).pdf
Software+Bill+of+Materials+Starter+Guide (1).pdf
kedofef453
e-World Tech PHPMaker Crack 2024.14 Download 2025
e-World Tech PHPMaker Crack 2024.14 Download 2025e-World Tech PHPMaker Crack 2024.14 Download 2025
e-World Tech PHPMaker Crack 2024.14 Download 2025
madni07kp
Radio BOSS Advanced 7.0.5 Crack Free Download 2025
Radio BOSS Advanced 7.0.5 Crack Free Download 2025Radio BOSS Advanced 7.0.5 Crack Free Download 2025
Radio BOSS Advanced 7.0.5 Crack Free Download 2025
f17489615
Siemens Solid Edge crack + Activation Key Download 2025
Siemens Solid Edge crack + Activation Key Download 2025Siemens Solid Edge crack + Activation Key Download 2025
Siemens Solid Edge crack + Activation Key Download 2025
shahg123tt
Custom AI Chatbot Development Services.pdf
Custom AI Chatbot Development Services.pdfCustom AI Chatbot Development Services.pdf
Custom AI Chatbot Development Services.pdf
Belitsoft
Marketo Engage North America Virtual User Group: Adobe Summit 2025 recap
Marketo Engage North America Virtual User Group: Adobe Summit 2025 recapMarketo Engage North America Virtual User Group: Adobe Summit 2025 recap
Marketo Engage North America Virtual User Group: Adobe Summit 2025 recap
BradBedford3
Adobe Substance 3D Designer 14.0.0.8074 Crack 2025
Adobe Substance 3D Designer 14.0.0.8074 Crack 2025Adobe Substance 3D Designer 14.0.0.8074 Crack 2025
Adobe Substance 3D Designer 14.0.0.8074 Crack 2025
abdulbasitayoubi25
GraphPad Prism 10.3.0.507 Crack Free Download
GraphPad Prism 10.3.0.507 Crack Free DownloadGraphPad Prism 10.3.0.507 Crack Free Download
GraphPad Prism 10.3.0.507 Crack Free Download
madni07kp
Top Online Food Ordering Script Company - Become Vendor
Top Online Food Ordering Script Company - Become VendorTop Online Food Ordering Script Company - Become Vendor
Top Online Food Ordering Script Company - Become Vendor
Kevin Miller
Real Converter Pro 7.819 Crack Free Download
Real Converter Pro 7.819 Crack Free DownloadReal Converter Pro 7.819 Crack Free Download
Real Converter Pro 7.819 Crack Free Download
alihamzakpa03
e-World Tech PHPMaker Crack 2024.14 Download 2025
e-World Tech PHPMaker Crack 2024.14 Download 2025e-World Tech PHPMaker Crack 2024.14 Download 2025
e-World Tech PHPMaker Crack 2024.14 Download 2025
basitayoubi104
TubeMate Downloader 5.18. Crack + License Key 2025
TubeMate Downloader 5.18. Crack + License Key 2025TubeMate Downloader 5.18. Crack + License Key 2025
TubeMate Downloader 5.18. Crack + License Key 2025
shanbhai08kp
Mastering Fluent Bit: Ultimate Guide to Integrating Telemetry Pipelines with ...
Mastering Fluent Bit: Ultimate Guide to Integrating Telemetry Pipelines with ...Mastering Fluent Bit: Ultimate Guide to Integrating Telemetry Pipelines with ...
Mastering Fluent Bit: Ultimate Guide to Integrating Telemetry Pipelines with ...
Eric D. Schabell
E-Ptw Area Map - TECH EHS Solution
E-Ptw Area Map - TECH EHS Solution E-Ptw Area Map - TECH EHS Solution
E-Ptw Area Map - TECH EHS Solution
TECH EHS Solution
Alluxio Webinar | Inside Deepseek 3FS: A Deep Dive into AI-Optimized Distribu...
Alluxio Webinar | Inside Deepseek 3FS: A Deep Dive into AI-Optimized Distribu...Alluxio Webinar | Inside Deepseek 3FS: A Deep Dive into AI-Optimized Distribu...
Alluxio Webinar | Inside Deepseek 3FS: A Deep Dive into AI-Optimized Distribu...
Alluxio, Inc.
Coreldraw 2021 Crack Latest Version 2025
Coreldraw 2021 Crack Latest Version 2025Coreldraw 2021 Crack Latest Version 2025
Coreldraw 2021 Crack Latest Version 2025
blouch31kp
View Companion Premium 15.40 Crack Free Download 2025
View Companion Premium 15.40 Crack Free Download 2025View Companion Premium 15.40 Crack Free Download 2025
View Companion Premium 15.40 Crack Free Download 2025
asghar045kp
Software Ideas Modeler Ultimate 14.62 Crack Free Download
Software Ideas Modeler Ultimate 14.62 Crack Free DownloadSoftware Ideas Modeler Ultimate 14.62 Crack Free Download
Software Ideas Modeler Ultimate 14.62 Crack Free Download
alihamzakpa01
Evolution and Examples of Java Features, from Java 1.7 to Java 24
Evolution and Examples of Java Features, from Java 1.7 to Java 24Evolution and Examples of Java Features, from Java 1.7 to Java 24
Evolution and Examples of Java Features, from Java 1.7 to Java 24
Yann-Ga谷l Gu辿h辿neuc
Internet Download Manager (IDM) Crack + Lisence key Latest version 2025
Internet Download Manager (IDM) Crack + Lisence key Latest version 2025Internet Download Manager (IDM) Crack + Lisence key Latest version 2025
Internet Download Manager (IDM) Crack + Lisence key Latest version 2025
shahzad011kp

Talk-Foutse-SrangeLoop.pdf

  • 1. Finding Bugs in Deep Learning Programs SoftWare Analytics and Technologies Lab S.W.A.T Foutse Khomh, PhD, Ing. foutse.khomh@polymtl.ca @SWATLab Canada CIFAR AI Chair on Trustworthy Machine Learning Systems Trustworthy Engineering of AI Software 1
  • 2. Quality Assurance of ML-enabled systems System evolution & continuous delivery 2 Reliable system Data Health Model Health Code Health Time Distribution shift Data imbalance Coding errors Noisy labels Underspecification Bias Vulnerabilities
  • 3. Deep Learning is being rapidly adopted in industry 3
  • 4. Preliminary preparation Data Collection Data Preprocessing Model Implementation Model Training Model Evaluation Model Tuning Data postprocessing Model Prediction DL development phases produce a lot of code! Han et al., What do Programmers Discuss about Deep Learning Frameworks
  • 5. DL programs can be faulty! 5
  • 6. 6 Multi-dimensional space of DL bugs Model Issues not enough learning capacity non-optimal regularization correct incorrect feature extraction incorrect gradient computation Implementation Issues correct
  • 7. 7 TOSEM21 NeuraLint : A linter for DL programs Capture defects early, so saves rework cost. Less expensive, because it doesnt require execution. Find defects in seconds. NeuraLint is fast and effective! It achieves an accuracy of 91.7 % . It correctly reported 18 additional bugs that were not found by developers. The average execution time of NeuraLint for the studied TensorFlow and Keras based programs are 2.892 and 3.197 seconds respectively. Try it out!
  • 8. NeuraLint has two pillars 8 A meta-model of DL programs Taxonomy of common DL faults Gunel Jahangirova, Nargiz Humbatova, Gabriele Bavota, Vincenzo Riccio, Andrea Stocco, and Paolo Tonella. 2019. Taxonomy of Real Faults in Deep Learning Systems. arXiv preprint arXiv:1910.11015
  • 9. NeuraLint: Execution Flow 9 Graph transformation Rules Potential issues Program Original program Model Extraction Model Run List of detected Issues
  • 10. TheDeepChecker outperforms AWS SMD DL coding bugs and misconfigurations are detected with (precision, recall), respectively, equal to (90%, 96.4%) and (77%, 83.3%). Finds 30% more defects than AWS SageMaker. 10 TOSEM22 TheDeepChecker : Dynamic testing of DL programs Capture defects during the training process. Less expensive than testing the resulting model. Some overhead on the training process. Try it out!
  • 11. 11 TheDeepChecker verification rules Parameters-related Issues Untrained Parameters Poor Weight Initialization Parameters Values Divergence Parameters Unstable Learning Activation-related Issues Activations out of Range Neuron Saturation Dead ReLU Optimization-related Issues Unable to fit a small sample Zero Loss Diverging Loss Slow or Non decreasing Loss Loss Fluctuations Unstable Gradient: Exploding Unstable Gradient: Vanishing
  • 12. Parameters-related Issues Untrained Parameters Given a layer and iterations 0 = 1 , 0 = 1 1 = 2 , 1 = 2 1 = , 1 = Issue Poor Weight Initialization Parameters Values Divergence Given a layer and an iteration +1 +1 [0, 1] Verification Routine Parameters Unstable Learning TheDeepChecker verification rules
  • 13. 13 Activation-related Issues Activations out of Range Given a layer 基基 [, ] Issue Neuron Saturation Given a layer 基基 Verification Routine Dead ReLU Specification of verification rules
  • 14. 14 Optimization-related Issues Unable to fit a small sample The DNN could not properly minimize the loss. Issue Zero Loss Diverging Loss Slow or Non decreasing Loss The DNN (with regularization off) should overfit a tiny sample of data. Given N iterations = 0 Verification Routine Loss Fluctuations Unstable Gradient: Exploding Unstable Gradient: Vanishing TheDeepChecker verification rules
  • 16. TheDeepChecker outperforms AWS SMD DL coding bugs and misconfigurations are detected with (precision, recall), respectively, equal to (90%, 96.4%) and (77%, 83.3%). Finds 30% more defects than AWS SageMaker. 16 TOSEM22 TheDeepChecker : Dynamic testing of DL programs Capture defects during the training process. Less expensive than testing the resulting model. Some overhead on the training process. Try it out!
  • 17. 17 Foutse Khomh, PhD, Ing. foutse.khomh@polymtl.ca @SWATLab Canada CIFAR AI Chair Emmanuel Thepie Fapi