�ݺ�ߣ

z
DATA WRANGLING LESSONS
FROM THE OWASP TOP 10
BSIDES PITTSBURGH
JUNE 21, 2018
BRIAN GLAS
@infosecdad

z
OWASP TOP 10 OVERVIEW
§ First version was released in 2003
§ Updated in 2004, 2007, 2010, 2013, 2017
§ Started as an awareness document
§ Now widely considered the global baseline
§ Is a standard for vendors to measure against

z
OWASP TOP 10-2017 RC1
§ April 2017
§ Controversy over first release candidate
§ Two new categories in RC1
§ A7 – Insufficient Attack Protection
§ A10 – Underprotected APIs
§ Social Media got ugly

z
BLOG POSTS
§ Decided to do a little research and analysis
§ Reviewed the history of Top 10 development
§ Analyzed the public data
§ Wrote two blog posts…

z
DATA COLLECTION
§ Original desire for full public attribution
§ This meant many contributors, didn’t…
§ End up mostly being consultants and vendors
§ Hope to figure out a better way for 2020

z
HUMAN-AUGMENTED TOOLS (HAT) VS.
TOOL-AUGMENTED HUMANS (TAH)
§ Frequency of findings
§ Context (or lack thereof)
§ Natural Curiosity
§ Scalability
§ Consistency

z
OWASP SUMMIT JUNE 2017
§ Original leadership resigns right before
§ I was there for SAMM working sessions
§ Top 10 had working sessions as well
§ Asked to help with data analysis for Top 10

z
OWASP TOP 10-2017
§ New Plan
§ Expanded data call, one of largest ever @ 114k
§ Industry Survey to select 2 of 10 categories
§ Fully open process in GitHub
§ Actively translate into multiple languages
§ en, es, fr, he, id, ja, ko…

z
INDUSTRY SURVEY
§ Looking for two forward looking categories
§ 550 responses

z
INDUSTRY SURVEY RESULTS
§ 550 responses
§ Thank you!

z
DATA CALL RESULTS
§ A change from frequency to incident rate
§ Extended Data Call added: More Veracode,
Checkmarx, Security Focus (Fortify), Synopsys,
Bug Crowd
§ Data for over 114,000 applications

z
DATA CALL RESULTS
§ Percentage of
submitting
organizations that
found at least one
instance in that
vulnerability
category

z
WHAT CAN THE DATA TELL US
§ Humans still find more diverse vulnerabilities
§ Tools only look for what they know about
§ Tools can scale on a subset of tests
§ You need both
§ We aren’t looking for everything…

z
WHAT CAN THE DATA NOT TELL US
§ Is a language or framework more susceptible
§ Are the problems systemic or one-off
§ Is developer training effective
§ Are IDE plug-ins effective
§ How unique are the findings?
§ Consistent mapping?
§ Still only seeing part of the picture

z
VULN DATA IN PROD VS TESTING
0
0.5
1
1.5
2
2.5
3
0 0.5 1 1.5 2 2.5 3 3.5
Number of Vulnerabilities in Production

z
VULN DATA IN PROD VS TESTING
0
0.5
1
1.5
2
2.5
3
0 0.5 1 1.5 2 2.5 3 3.5
Security Defects in Testing

z
VULN DATA STRUCTURES
§ CWE Reference
§ Related App
§ Date
§ Language/Framework
§ Point in the process found
§ Severity (CVSS/CWSS/Something)
§ Verified

z
VULN DATA IN SECURITY STORIES

z
WHAT ABOUT TRAINING DATA?
§ How are you measuring training?
§ Are you correlating data from training to
testing automation?
§ Can you track down to the dev?
§ Do you know your Top 10?

z
WHAT CAN YOU DO?
§ Think about what story to tell, then figure what
data is needed to tell that story
§ Structure your data collection
§ Keep your data as clean and accurate as possible
§ Write stories
§ Consider contributing to Top 10 2020

z
THAT’S ALL FOLKS
THANK YOU!
§ Brian Glas
§ @infosecdad
§ brian.glas@gmail.com

�ݺ�ߣ

Wrangling OWASP Top10 data at BSides Pittsburgh PGH

More Related Content

Wrangling OWASP Top10 data at BSides Pittsburgh PGH