狠狠撸

狠狠撸Share a Scribd company logo
Darlene Goldstein 29 January 2003
Receiver Operating Characteristic
Methodology
Outline
? Introduction
? Hypothesis testing
? ROC curve
? Area under the ROC curve (AUC)
? Examples using ROC
? Concluding remarks
Introduction to ROC curves
? ROC = Receiver Operating Characteristic
? Started in electronic signal detection
theory (1940s - 1950s)
? Has become very popular in biomedical
applications, particularly radiology and
imaging
? Also used in machine learning applications to
assess classifiers
? Can be used to compare tests/procedures
ROC curves: simplest case
? Consider diagnostic test for a disease
? Test has 2 possible outcomes:
– ‘postive’ = suggesting presence of disease
– ‘negative’
? An individual can test either positive or
negative for the disease
? Prof. Mean...
Hypothesis testing refresher
? 2 ‘competing theories’ regarding a population
parameter:
– NULL hypothesis H (‘straw man’)
– ALTERNATIVE hypothesis A (‘claim’, or
theory you wish to test)
? H: NO DIFFERENCE
– any observed deviation from what we
expect to see is due to chance variability
? A: THE DIFFERENCE IS REAL
Test statistic
? Measure how far the observed data are from
what is expected assuming the NULL H by
computing the value of a test statistic (TS)
from the data
? The particular TS computed depends on the
parameter
? For example, to test the population mean ?,
the TS is the sample mean (or standardized
sample mean)
? The NULL is rejected fi the TS falls in a
user-specified ‘rejection region’
True disease state vs. Test result
not rejected rejected
No disease
(D = 0)
?
specificity
X
Type I error
(False +) ?
Disease
(D = 1)
X
Type II error
(False -) ?
?
Power 1 - ?;
sensitivity
Disease
Test
Specific Example
Test Result
Pts with
Pts with
disease
disease
Pts without
Pts without
the disease
the disease
Test Result
Call these patients “negative” Call these patients “positive”
Threshold
Test Result
Call these patients “negative” Call these patients “positive”
without the disease
with the disease
True Positives
Some definitions ...
Test Result
Call these patients “negative” Call these patients “positive”
without the disease
with the disease
False
Positives
Test Result
Call these patients “negative” Call these patients “positive”
without the disease
with the disease
True
negatives
Test Result
Call these patients “negative” Call these patients “positive”
without the disease
with the disease
False
negatives
Test Result
without the disease
with the disease
‘‘
‘‘-’’
-’’ ‘‘
‘‘+’’
+’’
Moving the Threshold: right
Test Result
without the disease
with the disease
‘‘
‘‘-’’
-’’ ‘‘
‘‘+’’
+’’
Moving the Threshold: left
True
Positive
Rate
(sensitivity)
0%
100%
False Positive Rate
(1-specificity)
0% 100%
ROC curve
True
Positive
Rate
0
%
100%
False Positive Rate
0
%
100%
True
Positive
Rate
0
%
100%
False Positive Rate
0
%
100%
A good test: A poor test:
ROC curve comparison
Best Test: Worst test:
True
Positive
Rate
0
%
100%
False Positive
Rate
0
%
100
%
True
Positive
Rate
0
%
100%
False Positive
Rate
0
%
100
%
The distributions
don’t overlap at all
The distributions
overlap completely
ROC curve extremes
‘Classical’ estimation
? Binormal model:
– X ~ N(0,1) in nondiseased population
– X ~ N(a, 1/b) in diseased population
? Then
ROC(t) = ?(a + b?-1
(t)) for 0 < t < 1
? Estimate a, b by ML using readings from
sets of diseased and nondiseased patients
ROC curve estimation with
continuous data
? Many biochemical measurements are in fact
continuous, e.g. blood glucose vs. diabetes
? Can also do ROC analysis for continuous (rather
than binary or ordinal) data
? Estimate ROC curve (and smooth) based on
empirical ‘survivor’ function (1 – cdf) in
diseased and nondiseased groups
? Can also do regression modeling of the test
result
? Another approach is to model the ROC curve
directlyas a function of covariates
Area under ROC curve (AUC)
? Overall measure of test performance
? Comparisons between two tests based on
differences between (estimated) AUC
? For continuous data, AUC equivalent to Mann-
Whitney U-statistic (nonparametric test of
difference in location between two
populations)
True
Positive
Rate
0
%
100%
False Positive
Rate
0
%
100
%
True
Positive
Rate
0
%
100%
False Positive
Rate
0
%
100
%
True
Positive
Rate
0
%
100%
False Positive
Rate
0
%
100
%
AUC = 50%
AUC = 90%
AUC = 65%
AUC = 100%
True
Positive
Rate
0
%
100%
False Positive
Rate
0
%
100
%
AUC for ROC curves
Interpretation of AUC
? AUC can be interpreted as the probability
that the test result from a randomly chosen
diseased individual is more indicative of
disease than that from a randomly chosen
nondiseased individual: P(Xi ? Xj | Di = 1, Dj = 0)
? So can think of this as a nonparametric
distance between disease/nondisease test
results
Problems with AUC
? No clinically relevant meaning
? A lot of the area is coming from the range of
large false positive values, no one cares what’s
going on in that region (need to examine
restricted regions)
? The curves might cross, so that there might
be a meaningful difference in performance
that is not picked up by AUC
Examples using ROC analysis
? Threshold selection for ‘tuning’ an already
trained classifier (e.g. neural nets)
? Defining signal thresholds in DNA microarrays
(Bilban et al.)
? Comparing test statistics for identifying
differentially expressed genes in replicated
microarray data (L?nnstedt and Speed)
? Assessing performance of different protein
prediction algorithms (Tang et al.)
? Inferring protein homology (Karwath and King)
Homology Induction ROC
Concluding remarks – remaining
challenges in ROC methodology
? Inference for ROC curve when no ‘gold standard’
? Role of ROC in combining information?
? Incorporating time into ROC analysis
? Alternatives to ROC for describing test
accuracy?
? Generalization of positive/negative predictive
value to continuous test?
(+/-) predictive value = proportion of patients with
(+/-) result who are correctly diagnosed
= True/(True + False)

More Related Content

Similar to Receiver Operating Characteristics SIB-ROC.ppt (20)

Evaluating diagnostic tests.pptx
Evaluating diagnostic tests.pptxEvaluating diagnostic tests.pptx
Evaluating diagnostic tests.pptx
angelabraver1
?
Validity and Screening Test
Validity and Screening TestValidity and Screening Test
Validity and Screening Test
Dr. Abraham Mallela
?
OAJ presentation final draft
OAJ presentation final draftOAJ presentation final draft
OAJ presentation final draft
Brian Eisen
?
Dr Amit Diagnostic Tests.pptx
Dr Amit Diagnostic Tests.pptxDr Amit Diagnostic Tests.pptx
Dr Amit Diagnostic Tests.pptx
AmitBhondve3
?
Tests of diagnostic accuracy
Tests of diagnostic accuracyTests of diagnostic accuracy
Tests of diagnostic accuracy
Simba Takuva
?
Validity and reliability of screening/ diagnostic tests
Validity and reliability of screening/ diagnostic testsValidity and reliability of screening/ diagnostic tests
Validity and reliability of screening/ diagnostic tests
werenjoka
?
screening-140217071714-phpapp02.pdf
screening-140217071714-phpapp02.pdfscreening-140217071714-phpapp02.pdf
screening-140217071714-phpapp02.pdf
SYEDZIYADFURQAN
?
(20180524) vuno seminar roc and extension
(20180524) vuno seminar   roc and extension(20180524) vuno seminar   roc and extension
(20180524) vuno seminar roc and extension
Kyuhwan Jung
?
Diagnotic and screening tests
Diagnotic and screening testsDiagnotic and screening tests
Diagnotic and screening tests
jfwilson2
?
Bio statistical analysis in clinical research
Bio statistical analysis  in clinical research  Bio statistical analysis  in clinical research
Bio statistical analysis in clinical research
Helwan University
?
Screening and diagnostic testing
Screening and diagnostic  testingScreening and diagnostic  testing
Screening and diagnostic testing
amitakashyap1
?
6 Screening in Public Health-converted.pdf
6 Screening in Public Health-converted.pdf6 Screening in Public Health-converted.pdf
6 Screening in Public Health-converted.pdf
KalJohn1
?
Epidemiological method to determine utility of a diagnostic test
Epidemiological method to determine utility of a diagnostic testEpidemiological method to determine utility of a diagnostic test
Epidemiological method to determine utility of a diagnostic test
Bhoj Raj Singh
?
Screening test (basic concepts)
Screening test (basic concepts)Screening test (basic concepts)
Screening test (basic concepts)
Tarek Tawfik Amin
?
Sensitivity, specificity and likelihood ratios
Sensitivity, specificity and likelihood ratiosSensitivity, specificity and likelihood ratios
Sensitivity, specificity and likelihood ratios
Chew Keng Sheng
?
05 diagnostic tests cwq
05 diagnostic tests cwq05 diagnostic tests cwq
05 diagnostic tests cwq
Sumit Prajapati
?
How to read a receiver operating characteritic (ROC) curve
How to read a receiver operating characteritic (ROC) curveHow to read a receiver operating characteritic (ROC) curve
How to read a receiver operating characteritic (ROC) curve
Samir Haffar
?
Evidence-based diagnosis
Evidence-based diagnosisEvidence-based diagnosis
Evidence-based diagnosis
Hesham Gaber
?
Testing and Screening Nov 2021-undergraduate.pptx
Testing and Screening Nov 2021-undergraduate.pptxTesting and Screening Nov 2021-undergraduate.pptx
Testing and Screening Nov 2021-undergraduate.pptx
msusaibrahim424
?
Evidence based diagnosis
Evidence based diagnosisEvidence based diagnosis
Evidence based diagnosis
Hesham Al-Inany
?
Evaluating diagnostic tests.pptx
Evaluating diagnostic tests.pptxEvaluating diagnostic tests.pptx
Evaluating diagnostic tests.pptx
angelabraver1
?
OAJ presentation final draft
OAJ presentation final draftOAJ presentation final draft
OAJ presentation final draft
Brian Eisen
?
Dr Amit Diagnostic Tests.pptx
Dr Amit Diagnostic Tests.pptxDr Amit Diagnostic Tests.pptx
Dr Amit Diagnostic Tests.pptx
AmitBhondve3
?
Tests of diagnostic accuracy
Tests of diagnostic accuracyTests of diagnostic accuracy
Tests of diagnostic accuracy
Simba Takuva
?
Validity and reliability of screening/ diagnostic tests
Validity and reliability of screening/ diagnostic testsValidity and reliability of screening/ diagnostic tests
Validity and reliability of screening/ diagnostic tests
werenjoka
?
screening-140217071714-phpapp02.pdf
screening-140217071714-phpapp02.pdfscreening-140217071714-phpapp02.pdf
screening-140217071714-phpapp02.pdf
SYEDZIYADFURQAN
?
(20180524) vuno seminar roc and extension
(20180524) vuno seminar   roc and extension(20180524) vuno seminar   roc and extension
(20180524) vuno seminar roc and extension
Kyuhwan Jung
?
Diagnotic and screening tests
Diagnotic and screening testsDiagnotic and screening tests
Diagnotic and screening tests
jfwilson2
?
Bio statistical analysis in clinical research
Bio statistical analysis  in clinical research  Bio statistical analysis  in clinical research
Bio statistical analysis in clinical research
Helwan University
?
Screening and diagnostic testing
Screening and diagnostic  testingScreening and diagnostic  testing
Screening and diagnostic testing
amitakashyap1
?
6 Screening in Public Health-converted.pdf
6 Screening in Public Health-converted.pdf6 Screening in Public Health-converted.pdf
6 Screening in Public Health-converted.pdf
KalJohn1
?
Epidemiological method to determine utility of a diagnostic test
Epidemiological method to determine utility of a diagnostic testEpidemiological method to determine utility of a diagnostic test
Epidemiological method to determine utility of a diagnostic test
Bhoj Raj Singh
?
Screening test (basic concepts)
Screening test (basic concepts)Screening test (basic concepts)
Screening test (basic concepts)
Tarek Tawfik Amin
?
Sensitivity, specificity and likelihood ratios
Sensitivity, specificity and likelihood ratiosSensitivity, specificity and likelihood ratios
Sensitivity, specificity and likelihood ratios
Chew Keng Sheng
?
How to read a receiver operating characteritic (ROC) curve
How to read a receiver operating characteritic (ROC) curveHow to read a receiver operating characteritic (ROC) curve
How to read a receiver operating characteritic (ROC) curve
Samir Haffar
?
Evidence-based diagnosis
Evidence-based diagnosisEvidence-based diagnosis
Evidence-based diagnosis
Hesham Gaber
?
Testing and Screening Nov 2021-undergraduate.pptx
Testing and Screening Nov 2021-undergraduate.pptxTesting and Screening Nov 2021-undergraduate.pptx
Testing and Screening Nov 2021-undergraduate.pptx
msusaibrahim424
?

More from ssuserb53446 (9)

Communications Systems Analogue Angle Modulation
Communications  Systems Analogue Angle ModulationCommunications  Systems Analogue Angle Modulation
Communications Systems Analogue Angle Modulation
ssuserb53446
?
Engineering Communication Systems Course
Engineering Communication Systems CourseEngineering Communication Systems Course
Engineering Communication Systems Course
ssuserb53446
?
Embedded Systems ARM Computer Architecture
Embedded Systems ARM Computer ArchitectureEmbedded Systems ARM Computer Architecture
Embedded Systems ARM Computer Architecture
ssuserb53446
?
introduction to database systems ch2.ppt
introduction to database systems ch2.pptintroduction to database systems ch2.ppt
introduction to database systems ch2.ppt
ssuserb53446
?
Asynchronous Transfer Mode Introductory Presentation
Asynchronous Transfer Mode Introductory PresentationAsynchronous Transfer Mode Introductory Presentation
Asynchronous Transfer Mode Introductory Presentation
ssuserb53446
?
Database System Concepts Chapter 1: Introduction to Databases
Database System Concepts Chapter 1: Introduction to DatabasesDatabase System Concepts Chapter 1: Introduction to Databases
Database System Concepts Chapter 1: Introduction to Databases
ssuserb53446
?
Entity relationship Models from Chapter 4
Entity relationship Models from Chapter  4Entity relationship Models from Chapter  4
Entity relationship Models from Chapter 4
ssuserb53446
?
This is an introduction to Database Relational Model
This is an introduction to Database Relational ModelThis is an introduction to Database Relational Model
This is an introduction to Database Relational Model
ssuserb53446
?
Database System Introductory Concepts and All
Database System Introductory Concepts and AllDatabase System Introductory Concepts and All
Database System Introductory Concepts and All
ssuserb53446
?
Communications Systems Analogue Angle Modulation
Communications  Systems Analogue Angle ModulationCommunications  Systems Analogue Angle Modulation
Communications Systems Analogue Angle Modulation
ssuserb53446
?
Engineering Communication Systems Course
Engineering Communication Systems CourseEngineering Communication Systems Course
Engineering Communication Systems Course
ssuserb53446
?
Embedded Systems ARM Computer Architecture
Embedded Systems ARM Computer ArchitectureEmbedded Systems ARM Computer Architecture
Embedded Systems ARM Computer Architecture
ssuserb53446
?
introduction to database systems ch2.ppt
introduction to database systems ch2.pptintroduction to database systems ch2.ppt
introduction to database systems ch2.ppt
ssuserb53446
?
Asynchronous Transfer Mode Introductory Presentation
Asynchronous Transfer Mode Introductory PresentationAsynchronous Transfer Mode Introductory Presentation
Asynchronous Transfer Mode Introductory Presentation
ssuserb53446
?
Database System Concepts Chapter 1: Introduction to Databases
Database System Concepts Chapter 1: Introduction to DatabasesDatabase System Concepts Chapter 1: Introduction to Databases
Database System Concepts Chapter 1: Introduction to Databases
ssuserb53446
?
Entity relationship Models from Chapter 4
Entity relationship Models from Chapter  4Entity relationship Models from Chapter  4
Entity relationship Models from Chapter 4
ssuserb53446
?
This is an introduction to Database Relational Model
This is an introduction to Database Relational ModelThis is an introduction to Database Relational Model
This is an introduction to Database Relational Model
ssuserb53446
?
Database System Introductory Concepts and All
Database System Introductory Concepts and AllDatabase System Introductory Concepts and All
Database System Introductory Concepts and All
ssuserb53446
?

Recently uploaded (20)

Cost sheet. with basics and formats of sheet
Cost sheet. with basics and formats of sheetCost sheet. with basics and formats of sheet
Cost sheet. with basics and formats of sheet
supreetk82004
?
Boosting MySQL with Vector Search Scale22X 2025.pdf
Boosting MySQL with Vector Search Scale22X 2025.pdfBoosting MySQL with Vector Search Scale22X 2025.pdf
Boosting MySQL with Vector Search Scale22X 2025.pdf
Alkin Tezuysal
?
Stasiun kernel pabrik kelapa sawit indonesia
Stasiun kernel pabrik kelapa sawit indonesiaStasiun kernel pabrik kelapa sawit indonesia
Stasiun kernel pabrik kelapa sawit indonesia
fikrimanurung1
?
Stasiun kernel pengolahan kelapa sawit indonesia
Stasiun kernel pengolahan kelapa sawit indonesiaStasiun kernel pengolahan kelapa sawit indonesia
Stasiun kernel pengolahan kelapa sawit indonesia
fikrimanurung1
?
iam free indeed.pptxiam free indeed.pptx
iam free indeed.pptxiam free indeed.pptxiam free indeed.pptxiam free indeed.pptx
iam free indeed.pptxiam free indeed.pptx
muhweziart
?
Introduction Lecture 01 Data Science.pdf
Introduction Lecture 01 Data Science.pdfIntroduction Lecture 01 Data Science.pdf
Introduction Lecture 01 Data Science.pdf
messagetome133
?
Introduction to database and analysis software’s suitable for.pptx
Introduction to database and analysis software’s suitable for.pptxIntroduction to database and analysis software’s suitable for.pptx
Introduction to database and analysis software’s suitable for.pptx
nabinparajuli9
?
MLecture 1 Introduction to AI . The basics.pptx
MLecture 1 Introduction to AI . The basics.pptxMLecture 1 Introduction to AI . The basics.pptx
MLecture 1 Introduction to AI . The basics.pptx
FaizaKhan720183
?
Analyzing Consumer Spending Trends and Purchasing Behavior
Analyzing Consumer Spending Trends and Purchasing BehaviorAnalyzing Consumer Spending Trends and Purchasing Behavior
Analyzing Consumer Spending Trends and Purchasing Behavior
omololaokeowo1
?
Class 3-Workforce profile updated P.pptx
Class 3-Workforce profile updated P.pptxClass 3-Workforce profile updated P.pptx
Class 3-Workforce profile updated P.pptx
angelananalucky
?
Presentation1.pptx for data and table analysis
Presentation1.pptx for data and table analysisPresentation1.pptx for data and table analysis
Presentation1.pptx for data and table analysis
vatsalsingla4
?
原版复刻加拿大多伦多大学成绩单(UTSG毕业证书) 文凭
原版复刻加拿大多伦多大学成绩单(UTSG毕业证书) 文凭原版复刻加拿大多伦多大学成绩单(UTSG毕业证书) 文凭
原版复刻加拿大多伦多大学成绩单(UTSG毕业证书) 文凭
taqyed
?
The Marketability of Rice Straw Yarn Among Selected Customers of Gantsilyo Guru
The Marketability of Rice Straw Yarn Among Selected Customers of Gantsilyo GuruThe Marketability of Rice Straw Yarn Among Selected Customers of Gantsilyo Guru
The Marketability of Rice Straw Yarn Among Selected Customers of Gantsilyo Guru
kenyoncenteno12
?
stages-of-moral-development-lawrence-kohlberg-pdf-free.pdf
stages-of-moral-development-lawrence-kohlberg-pdf-free.pdfstages-of-moral-development-lawrence-kohlberg-pdf-free.pdf
stages-of-moral-development-lawrence-kohlberg-pdf-free.pdf
esguerramark1991
?
The truth behind the numbers: spotting statistical misuse.pptx
The truth behind the numbers: spotting statistical misuse.pptxThe truth behind the numbers: spotting statistical misuse.pptx
The truth behind the numbers: spotting statistical misuse.pptx
andyprosser3
?
643663189-Q4W3-Synthesize-Information-1-pptx.pptx
643663189-Q4W3-Synthesize-Information-1-pptx.pptx643663189-Q4W3-Synthesize-Information-1-pptx.pptx
643663189-Q4W3-Synthesize-Information-1-pptx.pptx
rossanthonytan130
?
Updated Willow 2025 Media Deck_Updated010325.pdf
Updated Willow 2025 Media Deck_Updated010325.pdfUpdated Willow 2025 Media Deck_Updated010325.pdf
Updated Willow 2025 Media Deck_Updated010325.pdf
tangramcommunication
?
Design Data Model Objects for Analytics, Activation, and AI
Design Data Model Objects for Analytics, Activation, and AIDesign Data Model Objects for Analytics, Activation, and AI
Design Data Model Objects for Analytics, Activation, and AI
aaronmwinters
?
Monitoring Imam Ririn di Pilkada Kota Depok 2024
Monitoring Imam Ririn di Pilkada Kota Depok 2024Monitoring Imam Ririn di Pilkada Kota Depok 2024
Monitoring Imam Ririn di Pilkada Kota Depok 2024
Deddy Rahman
?
Introduction to Java Programming for High School by 狠狠撸sgo.pptx
Introduction to Java Programming for High School by 狠狠撸sgo.pptxIntroduction to Java Programming for High School by 狠狠撸sgo.pptx
Introduction to Java Programming for High School by 狠狠撸sgo.pptx
mirhuzaifahali
?
Cost sheet. with basics and formats of sheet
Cost sheet. with basics and formats of sheetCost sheet. with basics and formats of sheet
Cost sheet. with basics and formats of sheet
supreetk82004
?
Boosting MySQL with Vector Search Scale22X 2025.pdf
Boosting MySQL with Vector Search Scale22X 2025.pdfBoosting MySQL with Vector Search Scale22X 2025.pdf
Boosting MySQL with Vector Search Scale22X 2025.pdf
Alkin Tezuysal
?
Stasiun kernel pabrik kelapa sawit indonesia
Stasiun kernel pabrik kelapa sawit indonesiaStasiun kernel pabrik kelapa sawit indonesia
Stasiun kernel pabrik kelapa sawit indonesia
fikrimanurung1
?
Stasiun kernel pengolahan kelapa sawit indonesia
Stasiun kernel pengolahan kelapa sawit indonesiaStasiun kernel pengolahan kelapa sawit indonesia
Stasiun kernel pengolahan kelapa sawit indonesia
fikrimanurung1
?
iam free indeed.pptxiam free indeed.pptx
iam free indeed.pptxiam free indeed.pptxiam free indeed.pptxiam free indeed.pptx
iam free indeed.pptxiam free indeed.pptx
muhweziart
?
Introduction Lecture 01 Data Science.pdf
Introduction Lecture 01 Data Science.pdfIntroduction Lecture 01 Data Science.pdf
Introduction Lecture 01 Data Science.pdf
messagetome133
?
Introduction to database and analysis software’s suitable for.pptx
Introduction to database and analysis software’s suitable for.pptxIntroduction to database and analysis software’s suitable for.pptx
Introduction to database and analysis software’s suitable for.pptx
nabinparajuli9
?
MLecture 1 Introduction to AI . The basics.pptx
MLecture 1 Introduction to AI . The basics.pptxMLecture 1 Introduction to AI . The basics.pptx
MLecture 1 Introduction to AI . The basics.pptx
FaizaKhan720183
?
Analyzing Consumer Spending Trends and Purchasing Behavior
Analyzing Consumer Spending Trends and Purchasing BehaviorAnalyzing Consumer Spending Trends and Purchasing Behavior
Analyzing Consumer Spending Trends and Purchasing Behavior
omololaokeowo1
?
Class 3-Workforce profile updated P.pptx
Class 3-Workforce profile updated P.pptxClass 3-Workforce profile updated P.pptx
Class 3-Workforce profile updated P.pptx
angelananalucky
?
Presentation1.pptx for data and table analysis
Presentation1.pptx for data and table analysisPresentation1.pptx for data and table analysis
Presentation1.pptx for data and table analysis
vatsalsingla4
?
原版复刻加拿大多伦多大学成绩单(UTSG毕业证书) 文凭
原版复刻加拿大多伦多大学成绩单(UTSG毕业证书) 文凭原版复刻加拿大多伦多大学成绩单(UTSG毕业证书) 文凭
原版复刻加拿大多伦多大学成绩单(UTSG毕业证书) 文凭
taqyed
?
The Marketability of Rice Straw Yarn Among Selected Customers of Gantsilyo Guru
The Marketability of Rice Straw Yarn Among Selected Customers of Gantsilyo GuruThe Marketability of Rice Straw Yarn Among Selected Customers of Gantsilyo Guru
The Marketability of Rice Straw Yarn Among Selected Customers of Gantsilyo Guru
kenyoncenteno12
?
stages-of-moral-development-lawrence-kohlberg-pdf-free.pdf
stages-of-moral-development-lawrence-kohlberg-pdf-free.pdfstages-of-moral-development-lawrence-kohlberg-pdf-free.pdf
stages-of-moral-development-lawrence-kohlberg-pdf-free.pdf
esguerramark1991
?
The truth behind the numbers: spotting statistical misuse.pptx
The truth behind the numbers: spotting statistical misuse.pptxThe truth behind the numbers: spotting statistical misuse.pptx
The truth behind the numbers: spotting statistical misuse.pptx
andyprosser3
?
643663189-Q4W3-Synthesize-Information-1-pptx.pptx
643663189-Q4W3-Synthesize-Information-1-pptx.pptx643663189-Q4W3-Synthesize-Information-1-pptx.pptx
643663189-Q4W3-Synthesize-Information-1-pptx.pptx
rossanthonytan130
?
Updated Willow 2025 Media Deck_Updated010325.pdf
Updated Willow 2025 Media Deck_Updated010325.pdfUpdated Willow 2025 Media Deck_Updated010325.pdf
Updated Willow 2025 Media Deck_Updated010325.pdf
tangramcommunication
?
Design Data Model Objects for Analytics, Activation, and AI
Design Data Model Objects for Analytics, Activation, and AIDesign Data Model Objects for Analytics, Activation, and AI
Design Data Model Objects for Analytics, Activation, and AI
aaronmwinters
?
Monitoring Imam Ririn di Pilkada Kota Depok 2024
Monitoring Imam Ririn di Pilkada Kota Depok 2024Monitoring Imam Ririn di Pilkada Kota Depok 2024
Monitoring Imam Ririn di Pilkada Kota Depok 2024
Deddy Rahman
?
Introduction to Java Programming for High School by 狠狠撸sgo.pptx
Introduction to Java Programming for High School by 狠狠撸sgo.pptxIntroduction to Java Programming for High School by 狠狠撸sgo.pptx
Introduction to Java Programming for High School by 狠狠撸sgo.pptx
mirhuzaifahali
?

Receiver Operating Characteristics SIB-ROC.ppt

  • 1. Darlene Goldstein 29 January 2003 Receiver Operating Characteristic Methodology
  • 2. Outline ? Introduction ? Hypothesis testing ? ROC curve ? Area under the ROC curve (AUC) ? Examples using ROC ? Concluding remarks
  • 3. Introduction to ROC curves ? ROC = Receiver Operating Characteristic ? Started in electronic signal detection theory (1940s - 1950s) ? Has become very popular in biomedical applications, particularly radiology and imaging ? Also used in machine learning applications to assess classifiers ? Can be used to compare tests/procedures
  • 4. ROC curves: simplest case ? Consider diagnostic test for a disease ? Test has 2 possible outcomes: – ‘postive’ = suggesting presence of disease – ‘negative’ ? An individual can test either positive or negative for the disease ? Prof. Mean...
  • 5. Hypothesis testing refresher ? 2 ‘competing theories’ regarding a population parameter: – NULL hypothesis H (‘straw man’) – ALTERNATIVE hypothesis A (‘claim’, or theory you wish to test) ? H: NO DIFFERENCE – any observed deviation from what we expect to see is due to chance variability ? A: THE DIFFERENCE IS REAL
  • 6. Test statistic ? Measure how far the observed data are from what is expected assuming the NULL H by computing the value of a test statistic (TS) from the data ? The particular TS computed depends on the parameter ? For example, to test the population mean ?, the TS is the sample mean (or standardized sample mean) ? The NULL is rejected fi the TS falls in a user-specified ‘rejection region’
  • 7. True disease state vs. Test result not rejected rejected No disease (D = 0) ? specificity X Type I error (False +) ? Disease (D = 1) X Type II error (False -) ? ? Power 1 - ?; sensitivity Disease Test
  • 8. Specific Example Test Result Pts with Pts with disease disease Pts without Pts without the disease the disease
  • 9. Test Result Call these patients “negative” Call these patients “positive” Threshold
  • 10. Test Result Call these patients “negative” Call these patients “positive” without the disease with the disease True Positives Some definitions ...
  • 11. Test Result Call these patients “negative” Call these patients “positive” without the disease with the disease False Positives
  • 12. Test Result Call these patients “negative” Call these patients “positive” without the disease with the disease True negatives
  • 13. Test Result Call these patients “negative” Call these patients “positive” without the disease with the disease False negatives
  • 14. Test Result without the disease with the disease ‘‘ ‘‘-’’ -’’ ‘‘ ‘‘+’’ +’’ Moving the Threshold: right
  • 15. Test Result without the disease with the disease ‘‘ ‘‘-’’ -’’ ‘‘ ‘‘+’’ +’’ Moving the Threshold: left
  • 17. True Positive Rate 0 % 100% False Positive Rate 0 % 100% True Positive Rate 0 % 100% False Positive Rate 0 % 100% A good test: A poor test: ROC curve comparison
  • 18. Best Test: Worst test: True Positive Rate 0 % 100% False Positive Rate 0 % 100 % True Positive Rate 0 % 100% False Positive Rate 0 % 100 % The distributions don’t overlap at all The distributions overlap completely ROC curve extremes
  • 19. ‘Classical’ estimation ? Binormal model: – X ~ N(0,1) in nondiseased population – X ~ N(a, 1/b) in diseased population ? Then ROC(t) = ?(a + b?-1 (t)) for 0 < t < 1 ? Estimate a, b by ML using readings from sets of diseased and nondiseased patients
  • 20. ROC curve estimation with continuous data ? Many biochemical measurements are in fact continuous, e.g. blood glucose vs. diabetes ? Can also do ROC analysis for continuous (rather than binary or ordinal) data ? Estimate ROC curve (and smooth) based on empirical ‘survivor’ function (1 – cdf) in diseased and nondiseased groups ? Can also do regression modeling of the test result ? Another approach is to model the ROC curve directlyas a function of covariates
  • 21. Area under ROC curve (AUC) ? Overall measure of test performance ? Comparisons between two tests based on differences between (estimated) AUC ? For continuous data, AUC equivalent to Mann- Whitney U-statistic (nonparametric test of difference in location between two populations)
  • 22. True Positive Rate 0 % 100% False Positive Rate 0 % 100 % True Positive Rate 0 % 100% False Positive Rate 0 % 100 % True Positive Rate 0 % 100% False Positive Rate 0 % 100 % AUC = 50% AUC = 90% AUC = 65% AUC = 100% True Positive Rate 0 % 100% False Positive Rate 0 % 100 % AUC for ROC curves
  • 23. Interpretation of AUC ? AUC can be interpreted as the probability that the test result from a randomly chosen diseased individual is more indicative of disease than that from a randomly chosen nondiseased individual: P(Xi ? Xj | Di = 1, Dj = 0) ? So can think of this as a nonparametric distance between disease/nondisease test results
  • 24. Problems with AUC ? No clinically relevant meaning ? A lot of the area is coming from the range of large false positive values, no one cares what’s going on in that region (need to examine restricted regions) ? The curves might cross, so that there might be a meaningful difference in performance that is not picked up by AUC
  • 25. Examples using ROC analysis ? Threshold selection for ‘tuning’ an already trained classifier (e.g. neural nets) ? Defining signal thresholds in DNA microarrays (Bilban et al.) ? Comparing test statistics for identifying differentially expressed genes in replicated microarray data (L?nnstedt and Speed) ? Assessing performance of different protein prediction algorithms (Tang et al.) ? Inferring protein homology (Karwath and King)
  • 27. Concluding remarks – remaining challenges in ROC methodology ? Inference for ROC curve when no ‘gold standard’ ? Role of ROC in combining information? ? Incorporating time into ROC analysis ? Alternatives to ROC for describing test accuracy? ? Generalization of positive/negative predictive value to continuous test? (+/-) predictive value = proportion of patients with (+/-) result who are correctly diagnosed = True/(True + False)