This document discusses how Netflix shifts the availability curve by optimizing tools and culture for engineering velocity. It describes Netflix's use of tools like Asgard, Aminator, and Chaos Monkey to automate deployments, create machine images, and test resiliency. The document also emphasizes giving engineers freedom and responsibility, a blameless culture, and representing best practices in tooling to support innovation and rapid experimentation at Netflix.
10. Support Experimentation
!
How We Built a Predictive
Autoscaling Engine
http://techblog.netflix.com/2013/11/scryer-netflixs-predictive-auto-scaling.html
15. Netflix Build Language
? Based on Gradle
? Internal and Open Source
? Gradle Summit talk:
http://www.slideshare.net/quidryan/gradle-summit-2014-nebula
https://github.com/nebula-plugins
16. Jenkins Job DSL
Configuration as Code
Groovy Script
Scripts go in Version Control
http://www.slideshare.net/quidryan/configuration-as-code
17. Aminator
Create AMI from Base AMI
Image contains service and everything needed to run it
Builds Unit of Deployment for Test and Prod
Abstracts Cloud Details
http://techblog.netflix.com/2013/03/ami-creation-with-aminator.html
18. Asgard
Deploys Netflix to the Cloud
Red/Black push
Developed to address delays in rollback
http://www.infoq.com/presentations/asgard
19. Red/Black Push
? Scale up new instances while running the old version
? Cloud Native
? Turn on traffic to new ASG
? Canary Analysis
? Turn off traffic to old ASG
? Wait ¡ Analyze ¡ Roll Back?
20. Canary Analysis
!
? Production Deployment Pattern
? Compare Metrics vs. Baseline Version
? ¡°Canary Analyze All The Things: How we learned to Keep Calm and Release
Often¡±, Roy Rapoport
www.slideshare.net/royrapoport/20140612-q-con-canary-analysis
21. Continuous Delivery Workflow
Support the Journey
Judges between Stages
Represent Best Practices
http://techblog.netflix.com/2013/09/glisten-groovy-way-to-use-amazons.html
28. Conformity Monkey
Have Deployments Diverged?
? Balance Regional Consistency
with Regional Isolation
? Build Best Practices into Tooling
and Reporting
29. Janitor Monkey
Reduce Cognitive Load and Cost
? Remove unused instances
? Uniform way to clean up
30. Shifting the Curve with Tools at Netflix
? Value Self-Service
? Test Everywhere
? Awareness of Multiple Regions
? Best Practices Represented in Tooling
? Recover Quickly and Easily
? Be Cloud Native
? Respect the Journey
31. Shifting the Curve with Culture at
Netflix
? Free the People!
? Context not Control
? Freedom to Experiment
? Blameless Culture
32. ¡°As the number of applications and the scale of the campaign's
AWS infrastructure use climbed, the DevOps team shifted to
using Asgard¡ªan open-source tool developed by Netflix to
manage cloud deployments.¡±
ArsTechnica, November 2012