Designing and building post compromise recoverable services

•Download as PPTX, PDF•

1 like•1,470 views

A look at how to design and build services, systems, networks, hosts and applications that are designed to be able to successfully deal with a security compromise. The deck also touches on the topics of self-healing systems and potential applications of machine learning to the problem space.

Designing and building post compromise
recoverable services
Ollie Whitehouse

Why?
"We may be at the point of diminishing returns by trying to buy
down vulnerability"
"maybe it’s time to place more emphasis on coping with the
consequences of a successful attack, and trying to develop
networks that can ‘self-heal’ or ‘self-limit’ the damages inflicted
upon them”
Gen. Michael Hayden (USAF-Ret.) ex NSA and CIA head
February, 2012

Agenda
• Stages of a compromise
• Impact limitation
• Healing
• Requirements for:
• design
• build
• operations
• Wrap-up and conclusions

Healing – old wisdom / not practical
rebuild & reinstall everything
down to bare metal
(to avoid whack-a-mole and persistence)

Healing – reality
remediate, re-establish trust & re-integrate
(whilst continuing to provide service,
avoiding whack-a-mole & persistence)

The requirements
design, development
and operations

Design
• Packaging, testing &
deployment
• Boundaries
• Authentication
• System wide monitoring
• Isolation
• Operation while isolated

Design
• Roll-ability (not a word)
• Query-ability (not a word)
• Variable protection
• Integrity verification
• Frequency of checks

Design
• Health / normal
• Response
• if this then that
• Consider
• Machine learning for behaviours
• Rate limiting
• Something else

Development
• Staff & vendor education
• 3rd party components
• Source integrity
• Build environment integrity
• Build artefact integrity
• Archive releases
• Compromise unit test cases
• Test compromise scenarios

Operations
• Able to define ‘security healthy’
• Worse case scenario planning
• Configuration management
• Configuration integrity
• Protective monitoring
• Time-line capability
• Fire drill - continually

The requirements of tomorrow
self healing

Self-heal - steps
• Detect
• Verify integrity
• Understand and remediate
• Alert
• Segregate
• Snapshot
• Revert / Rebuild / Restart
• Verify
• Reintegrate

Self-heal – what is healthy?
• Client’s user behaviour
• Client’s software behaviour
• Client’s system behaviour
• Clients behaviour

Self-heal – what is healthy?
• Service behaviour
• Software behaviour
• System behaviour
• Network behaviour
• Operations / staff (and their credentials)

Putting it into practice
two (simplistic) examples
and one point for consideration

Example #1 (semi-passive response)
• Client SQLi
• Database dump – sequential record read
• Response taken
• Alerts raised
• Snapshots taken
… facilitates full post indecent analysis

Example #2 (active response)
• Ops client side attack
• Credentials stolen
• Anomalous credential behaviour
• Alerts sent
• Credentials automatically disabled
… exposure window minutes

Point for consideration
• Red and Blue teams
• Red team could be a
Netflix-esq simian army
• Blue team could be your
self-healing systems

Conclusions
• Design and implement compromise readiness
• Self learning / healing the future
• Plan for worse case*
• Test scenarios continually

Europe
Manchester - Head Ofﬁce
Cheltenham
Edinburgh
Leatherhead
London
Milton Keynes
Amsterdam
Copenhagen
Munich
Zurich
North America
Atlanta
Austin
Chicago
Mountain View
New York
San Francisco
Seattle
Australia
Sydney
Thanks! Questions?
ollie.whitehouse@nccgroup.com

Designing and building post compromise recoverable services

1. Designing and building post compromise recoverable services Ollie Whitehouse

2. Why? "We may be at the point of diminishing returns by trying to buy down vulnerability" "maybe it’s time to place more emphasis on coping with the consequences of a successful attack, and trying to develop networks that can ‘self-heal’ or ‘self-limit’ the damages inflicted upon them” Gen. Michael Hayden (USAF-Ret.) ex NSA and CIA head February, 2012

3. Why?

4. Agenda • Stages of a compromise • Impact limitation • Healing • Requirements for: • design • build • operations • Wrap-up and conclusions

5. Stages of a compromise

6. Stages of a compromise

7. Stages of a compromise

8. What can we do? Deny

9. What can we do? Frustrate

10. What can we do? Misdirect

11. What can we do? Contain

12. Services are unique

13. Indicator collection

14. Detection

15. Impact limitation

16. Healing – old wisdom / not practical rebuild & reinstall everything down to bare metal (to avoid whack-a-mole and persistence)

17. Healing – reality remediate, re-establish trust & re-integrate (whilst continuing to provide service, avoiding whack-a-mole & persistence)

18. Healing

19. Healing - configuration

20. Healing a live service

21. Healing – real world

22. The requirements design, development and operations

23. Design • Packaging, testing & deployment • Boundaries • Authentication • System wide monitoring • Isolation • Operation while isolated

24. Design • Roll-ability (not a word) • Query-ability (not a word) • Variable protection • Integrity verification • Frequency of checks

25. Design • Health / normal • Response • if this then that • Consider • Machine learning for behaviours • Rate limiting • Something else

26. Development • Staff & vendor education • 3rd party components • Source integrity • Build environment integrity • Build artefact integrity • Archive releases • Compromise unit test cases • Test compromise scenarios

27. Operations • Able to define ‘security healthy’ • Worse case scenario planning • Configuration management • Configuration integrity • Protective monitoring • Time-line capability • Fire drill - continually

28. The requirements of tomorrow self healing

29. Self-heal – defining states

30. Self-heal - steps • Detect • Verify integrity • Understand and remediate • Alert • Segregate • Snapshot • Revert / Rebuild / Restart • Verify • Reintegrate

31. Self-heal – what is healthy? • Client’s user behaviour • Client’s software behaviour • Client’s system behaviour • Clients behaviour

32. Self-heal – what is healthy? • Service behaviour • Software behaviour • System behaviour • Network behaviour • Operations / staff (and their credentials)

33. Putting it into practice two (simplistic) examples and one point for consideration

34. Example #1 (semi-passive response) • Client SQLi • Database dump – sequential record read • Response taken • Alerts raised • Snapshots taken … facilitates full post indecent analysis

35. Example #2 (active response) • Ops client side attack • Credentials stolen • Anomalous credential behaviour • Alerts sent • Credentials automatically disabled … exposure window minutes

36. Point for consideration • Red and Blue teams • Red team could be a Netflix-esq simian army • Blue team could be your self-healing systems

37. Conclusions • Design and implement compromise readiness • Self learning / healing the future • Plan for worse case* • Test scenarios continually

38. Europe Manchester - Head Ofﬁce Cheltenham Edinburgh Leatherhead London Milton Keynes Amsterdam Copenhagen Munich Zurich North America Atlanta Austin Chicago Mountain View New York San Francisco Seattle Australia Sydney Thanks! Questions? ollie.whitehouse@nccgroup.com

Editor's Notes

#13: These aren’t the only attack paths. For example you could attack upstream i.e.: Third party software components source repos. Customer threat actors could go after the service’s corporate IT etc.
#24: Packaging, testing & deployment Careful trust and architecture boundary considerations Kill passwords forever (2FA/MFA) Ability to easily monitor to varying degrees (live, log or full packet capture) Ability to easily isolate aspects while maintaining service Ability to easily operate while isolated from known compromised / good
#25: Ability to roll credentials / secrets Ability to query service properties, behaviour, performance etc. Ability to increase protective monitoring / active response Ability to verify integrity* (configuration, software, package, system, host, network etc..) Ability to increase integrity verification frequency
#26: Ability to define, model or learn healthy / normal Ability to define and execute reactions to events / situations if this then that Consider (less tried and tested – or ‘it worked in PhD project’) Machine learning for behaviours at all layers (we’ve seen this productized in a focused manner) Ability to rate or access limit functionality automatically and/or manually in high alert situations Something we’ve not considered
#27: Educate in defensive coding and functional design Consider 3rd party component integrity verification Ability to verify source control integrity Ability to verify build server integrity Ability to verify development to live assets integrity Archive releases (artefacts, source, test output and logs) Develop compromise unit test cases for functionality in systems and software Test compromise scenarios in pre-production
#28: Able to define ‘security healthy’ Plan for highest level of access compromise Ensure configuration management Ensure configuration integrity monitoring Protective monitoring and anomaly detection Have the ability to time-line across many distinct sources of data Take inspiration* from Netflix’s Simian Army and fire drill investigating, segregating, operating, rebuilding, repairing, rolling and reintegrating
#30: You need to be able to define system, network, host, software and service
#31: Integrity verification or other high confidence indicator Ability to identify likely root cause and remediate* Alert (operations) Opt out of operation Snapshot (machines / configuration / logs) Revert (to known good) Restart Verify Reintegrate
#32: Client’s user behaviour – needs to be learnt Client’s software behaviour – do we care? Clients system behaviour – do we care? Client behaviour – needs to be learnt
#33: Service behaviour – needs to be defined / modelled / learnt Software behaviour – needs to be defined / modelled / learnt System behaviour – needs to be defined / modelled / learnt Network behaviour – needs to be defined / modelled / learnt Operations / staff (and their credentials) behaviour
#35: Client’s database queries usually*(1) non sequential across records and non complete result sets*(2) Query observed doing select * from what is usually a source(*3) of the same base 75 queries Results return speed is rate limited*(4) with marginal effect Alert is raised to client security point of contact query, source, destination (including db and table), time and date reaction by system Snapshot database logs and source machinetaken into security incident zone for client / your analysis … facilitates full post incident analysis
#36: An operations desktop gets rolled by client side Credentials stolen and used at a higher rate*(1) than normal during non incident window*(2) or against systems not part of incident group*(3) Credentials used from hosts other than expected*(4) Alert sent to operations shift manager and security operations centre sources, destination, times and dates reaction by system Credentials automatically disabled … exposure window minutes
#37: One large company has Red and Blue teams Red always attacking the services Blue always looking trying to detect and mitigate Idea: Your Red team could be a Netflix-esq simian army Your Blue team could be your self-healing systems Result = If stuff isn’t happening then it’s broken!
#38: Services, systems and software need to be compromise ready – old school: Secure engineering Intrusion prevention Principal of least privilege Segregation Intrusion detection Current approaches revolve around: Event correlation / confidence indicators Human analysis and intervention Machine learning Modelling … it’s the way of the future …

�ݺ�ߣ

Designing and building post compromise recoverable services

More Related Content

Designing and building post compromise recoverable services

Editor's Notes