ºÝºÝߣ

ºÝºÝߣShare a Scribd company logo
Quilt
distributed load test tooling and infrastructure
About me
? Engineering Manager (Performance and Resiliency Test) @ BlueJeans
? had seen some of the worst outages!
? Had simulated some real failures!
? Now we have 99.95% availability!
? Have team of 10 Engineers
? Interests
? Java, Ruby, Zk, Cassandra, Couchbase, RMQ, Jenkins, CI/CD
? AWS, RS, Google compute engine
? Design and develop tools
? Simulate failures
? Develop android apps!
? Startups
? ~15 years
? Developer, QA, Solution Architect, Devops
? 3 years @bluejeans
? 6.5 years @apigee (xml parsing engine to 4G gateway)
? 2 years @ Bea Systems
? 3 years as owner, dev, customer management etc
Outages
Why Outages
? Performance issues
? Natural load increase ¨C poor software design, poor sizing, mc@bjn
? Load spikes ¨C Xiaomi@fk, sip-dos-attack@bjn
? Cascading failures
? Hardware failure
? Network switch failure
? Software crash
? 3rd party app failure ¨C CB, ZK, RMQ, C*
? Memory leak
? Network blips/outage
? between the datacenters
? Inside the datacenter!
What can we do ?
? Performance
? Load simulation
? Monitoring systems
? Comparing against benchmarks
? Size the environment
? Resilience
? Simulate failures
? Evaluate the impact
AUTOMATE, AUTOMATE, AUTOMATE!!!
Load simulation - challenge
? Large-scale heterogeneous distributed system
? Real-time video/audio mixing
? Heavy RTP data transfer
? Multiple protocols ¨C sip, h323, webrtc, hls, http,
websocket
? Geo-located (5-6 regions) & partitions
? AWS + Datacenter hosting ¨C Scaling needs advance
notice for DC!
? Inter-region zookeeper lookup
Environment
Say, 20,000 concurrent
EPs at peak load
It¡¯s complex!
Say, 20,000 concurrent
EPs at peak load
1 EP (with the least quality)
150 kbps for video
80 kbps for audio
20,000 EPs
Rx ¨C 230 * 20K = 4 gbps
Tx = 4 gbps
*apigee office network is 40 mbps
CPU
1 m3.xlarge supports 30 EP simulators
~700 m3.xlarge instances for 20K
We generate terabytes of logs for each run!
EP simulators
? Sipp ¨C open source
? Callgen ¨C in-house
developed h323 simulator
? Webrtc ¨C runs on headless
browser with selenium
? Proprietary (mobile and thick
clients) ¨C in-house
developed simulator
The tool
? Controller
? Distribution logic ¨C EP types,
meeting ids, client instances
? Ruby
? Clients
? Simulate EPs
? Http api calls - Ruby
? Websocket ¨C sockjs on
node.js
? RPC using DRuby
The tool
Cost!
? The test is in-evitable, no major releases go without that
? Solution - optimal usage of aws instances
? All automated
? Bring up instances when needed
? Setup ¨C checkout and copy latest builds ¡°concurrently¡± of the tool to the
instances
? Run the tests with real time monitoring
? Copy the logs to S3 ¡°concurrently¡±
? Bring down the instances
? Analyze, debug etc ¨C offline activity
The setup
? All automated
? Bring up instances
when needed
? Setup ¨C checkout
latest build of the tool
to the instances
? Run the tests with
real time monitoring
? Bring down the
instances
? Analyze, debug etc ¨C
offline activity
Our tool stack
? Quilt ¨C Setup Infra and simulate distributed load. We just talked about it
? Analyzer ¨C post-test analysis by collecting metrics from various sources such
as sensu, atop, new-relic. Graphs generated with high-charts.
? Scoreboard ¨C Real time monitoring
? Catapult ¨C UI around Quilt to enable Devs to do the test
? Goblin (being open-sourced and presented in root-conf) ¨C resiliency
testing framework and utils
? Scout ¨C the agent who resides in the RMZ of system under test
? Rain ¨C the new load generation framework in node.js for testing Bluejeans
Primetime. Scales to 100s of thousands!
Starting aws instances
Associate elastic ip
Terminate clients
Key takeaways
? Peaceful sleep ¨C No way out, need load and resiliency testing!
? Automate - Design the Tools & Infra properly
? Scale ¨C Generating more load is just adding more aws instances
? Extensibility ¨C adding a new endpoint type is quick
? Automated analysis and reporting ¨C sensu, atop, new-relic
? Vendor agnostic - aws/rackspace/GCE
? Cost optimization - Use sdk to dynamically launch instances only when
needed ¨C save cost
Thank you
Stay connected :
Ajith Jose
BlueJeans Network
https://in.linkedin.com/in/ajithvj
Ad

Recommended

Scaling on AWS for the First 10 Million Users at Websummit Dublin
Scaling on AWS for the First 10 Million Users at Websummit Dublin
Ian Massingham
?
AWS Pop-up Loft Berlin: Cache is King - Running Lean Architectures: Optimizin...
AWS Pop-up Loft Berlin: Cache is King - Running Lean Architectures: Optimizin...
AWS Germany
?
Performance testing in scope of migration to cloud by Serghei Radov
Performance testing in scope of migration to cloud by Serghei Radov
Valeriia Maliarenko
?
Things You MUST Know Before Deploying OpenStack: Bruno Lago, Catalyst IT
Things You MUST Know Before Deploying OpenStack: Bruno Lago, Catalyst IT
OpenStack
?
Rails Conf Europe 2007 Notes
Rails Conf Europe 2007 Notes
Ross Lawley
?
Performance on a budget
Performance on a budget
Dimitry Ushakov
?
AWS (Hadoop) Meetup 30.04.09
AWS (Hadoop) Meetup 30.04.09
Chris Purrington
?
Clouds: All fluff and no substance?
Clouds: All fluff and no substance?
Guy Coates
?
PipelineAI + TensorFlow AI + Spark ML + Kuberenetes + Istio + AWS SageMaker +...
PipelineAI + TensorFlow AI + Spark ML + Kuberenetes + Istio + AWS SageMaker +...
Chris Fregly
?
Netflix keystone streaming data pipeline @scale in the cloud-dbtb-2016
Netflix keystone streaming data pipeline @scale in the cloud-dbtb-2016
Monal Daxini
?
Flexible compute
Flexible compute
Peter Clapham
?
Sanger, upcoming Openstack for Bio-informaticians
Sanger, upcoming Openstack for Bio-informaticians
Peter Clapham
?
Large-scaled Deploy Over 100 Servers in 3 Minutes
Large-scaled Deploy Over 100 Servers in 3 Minutes
Hiroshi SHIBATA
?
OpenNebulaConf 2016 - Measuring and tuning VM performance by Boyan Krosnov, S...
OpenNebulaConf 2016 - Measuring and tuning VM performance by Boyan Krosnov, S...
OpenNebula Project
?
Azug - successfully breeding rabits
Azug - successfully breeding rabits
Yves Goeleven
?
Scalling Rails: The Journey to 200M Notifications
Scalling Rails: The Journey to 200M Notifications
Gustavo Araujo
?
DevOps, CLI, APIs, Oh My! Security Gone Agile
DevOps, CLI, APIs, Oh My! Security Gone Agile
Matt Tesauro
?
Load testing and performance tracing
Load testing and performance tracing
Hans H?chtl
?
Ensuring Performance in a Fast-Paced Environment (CMG 2014)
Ensuring Performance in a Fast-Paced Environment (CMG 2014)
Martin Spier
?
Infrastructure Considerations : Design : "webops"
Infrastructure Considerations : Design : "webops"
Piyush Kumar
?
3.2 Streaming and Messaging
3.2 Streaming and Messaging
Õñ¶« Áõ
?
Introduction openstack-meetup-nov-28
Introduction openstack-meetup-nov-28
Sadique Puthen
?
Sanger OpenStack presentation March 2017
Sanger OpenStack presentation March 2017
Dave Holland
?
A real-life account of moving 100% to a public cloud
A real-life account of moving 100% to a public cloud
Julien SIMON
?
introduction to node.js
introduction to node.js
orkaplan
?
PipelineAI + AWS SageMaker + Distributed TensorFlow + AI Model Training and S...
PipelineAI + AWS SageMaker + Distributed TensorFlow + AI Model Training and S...
Chris Fregly
?
Optimizing, Profiling, and Deploying High Performance Spark ML and TensorFlow AI
Optimizing, Profiling, and Deploying High Performance Spark ML and TensorFlow AI
Data Con LA
?
Lessons Learned From PayPal: Implementing Back-Pressure With Akka Streams And...
Lessons Learned From PayPal: Implementing Back-Pressure With Akka Streams And...
Lightbend
?
Tech-ASan: Two-stage check for Address Sanitizer - Yixuan Cao.pdf
Tech-ASan: Two-stage check for Address Sanitizer - Yixuan Cao.pdf
caoyixuan2019
?
AI VIDEO MAGAZINE - June 2025 - r/aivideo
AI VIDEO MAGAZINE - June 2025 - r/aivideo
1pcity Studios, Inc
?

More Related Content

Similar to Quilt - Distributed Load Simulation from AWS (20)

PipelineAI + TensorFlow AI + Spark ML + Kuberenetes + Istio + AWS SageMaker +...
PipelineAI + TensorFlow AI + Spark ML + Kuberenetes + Istio + AWS SageMaker +...
Chris Fregly
?
Netflix keystone streaming data pipeline @scale in the cloud-dbtb-2016
Netflix keystone streaming data pipeline @scale in the cloud-dbtb-2016
Monal Daxini
?
Flexible compute
Flexible compute
Peter Clapham
?
Sanger, upcoming Openstack for Bio-informaticians
Sanger, upcoming Openstack for Bio-informaticians
Peter Clapham
?
Large-scaled Deploy Over 100 Servers in 3 Minutes
Large-scaled Deploy Over 100 Servers in 3 Minutes
Hiroshi SHIBATA
?
OpenNebulaConf 2016 - Measuring and tuning VM performance by Boyan Krosnov, S...
OpenNebulaConf 2016 - Measuring and tuning VM performance by Boyan Krosnov, S...
OpenNebula Project
?
Azug - successfully breeding rabits
Azug - successfully breeding rabits
Yves Goeleven
?
Scalling Rails: The Journey to 200M Notifications
Scalling Rails: The Journey to 200M Notifications
Gustavo Araujo
?
DevOps, CLI, APIs, Oh My! Security Gone Agile
DevOps, CLI, APIs, Oh My! Security Gone Agile
Matt Tesauro
?
Load testing and performance tracing
Load testing and performance tracing
Hans H?chtl
?
Ensuring Performance in a Fast-Paced Environment (CMG 2014)
Ensuring Performance in a Fast-Paced Environment (CMG 2014)
Martin Spier
?
Infrastructure Considerations : Design : "webops"
Infrastructure Considerations : Design : "webops"
Piyush Kumar
?
3.2 Streaming and Messaging
3.2 Streaming and Messaging
Õñ¶« Áõ
?
Introduction openstack-meetup-nov-28
Introduction openstack-meetup-nov-28
Sadique Puthen
?
Sanger OpenStack presentation March 2017
Sanger OpenStack presentation March 2017
Dave Holland
?
A real-life account of moving 100% to a public cloud
A real-life account of moving 100% to a public cloud
Julien SIMON
?
introduction to node.js
introduction to node.js
orkaplan
?
PipelineAI + AWS SageMaker + Distributed TensorFlow + AI Model Training and S...
PipelineAI + AWS SageMaker + Distributed TensorFlow + AI Model Training and S...
Chris Fregly
?
Optimizing, Profiling, and Deploying High Performance Spark ML and TensorFlow AI
Optimizing, Profiling, and Deploying High Performance Spark ML and TensorFlow AI
Data Con LA
?
Lessons Learned From PayPal: Implementing Back-Pressure With Akka Streams And...
Lessons Learned From PayPal: Implementing Back-Pressure With Akka Streams And...
Lightbend
?
PipelineAI + TensorFlow AI + Spark ML + Kuberenetes + Istio + AWS SageMaker +...
PipelineAI + TensorFlow AI + Spark ML + Kuberenetes + Istio + AWS SageMaker +...
Chris Fregly
?
Netflix keystone streaming data pipeline @scale in the cloud-dbtb-2016
Netflix keystone streaming data pipeline @scale in the cloud-dbtb-2016
Monal Daxini
?
Sanger, upcoming Openstack for Bio-informaticians
Sanger, upcoming Openstack for Bio-informaticians
Peter Clapham
?
Large-scaled Deploy Over 100 Servers in 3 Minutes
Large-scaled Deploy Over 100 Servers in 3 Minutes
Hiroshi SHIBATA
?
OpenNebulaConf 2016 - Measuring and tuning VM performance by Boyan Krosnov, S...
OpenNebulaConf 2016 - Measuring and tuning VM performance by Boyan Krosnov, S...
OpenNebula Project
?
Azug - successfully breeding rabits
Azug - successfully breeding rabits
Yves Goeleven
?
Scalling Rails: The Journey to 200M Notifications
Scalling Rails: The Journey to 200M Notifications
Gustavo Araujo
?
DevOps, CLI, APIs, Oh My! Security Gone Agile
DevOps, CLI, APIs, Oh My! Security Gone Agile
Matt Tesauro
?
Load testing and performance tracing
Load testing and performance tracing
Hans H?chtl
?
Ensuring Performance in a Fast-Paced Environment (CMG 2014)
Ensuring Performance in a Fast-Paced Environment (CMG 2014)
Martin Spier
?
Infrastructure Considerations : Design : "webops"
Infrastructure Considerations : Design : "webops"
Piyush Kumar
?
3.2 Streaming and Messaging
3.2 Streaming and Messaging
Õñ¶« Áõ
?
Introduction openstack-meetup-nov-28
Introduction openstack-meetup-nov-28
Sadique Puthen
?
Sanger OpenStack presentation March 2017
Sanger OpenStack presentation March 2017
Dave Holland
?
A real-life account of moving 100% to a public cloud
A real-life account of moving 100% to a public cloud
Julien SIMON
?
introduction to node.js
introduction to node.js
orkaplan
?
PipelineAI + AWS SageMaker + Distributed TensorFlow + AI Model Training and S...
PipelineAI + AWS SageMaker + Distributed TensorFlow + AI Model Training and S...
Chris Fregly
?
Optimizing, Profiling, and Deploying High Performance Spark ML and TensorFlow AI
Optimizing, Profiling, and Deploying High Performance Spark ML and TensorFlow AI
Data Con LA
?
Lessons Learned From PayPal: Implementing Back-Pressure With Akka Streams And...
Lessons Learned From PayPal: Implementing Back-Pressure With Akka Streams And...
Lightbend
?

Recently uploaded (20)

Tech-ASan: Two-stage check for Address Sanitizer - Yixuan Cao.pdf
Tech-ASan: Two-stage check for Address Sanitizer - Yixuan Cao.pdf
caoyixuan2019
?
AI VIDEO MAGAZINE - June 2025 - r/aivideo
AI VIDEO MAGAZINE - June 2025 - r/aivideo
1pcity Studios, Inc
?
Coordinated Disclosure for ML - What's Different and What's the Same.pdf
Coordinated Disclosure for ML - What's Different and What's the Same.pdf
Priyanka Aash
?
Using the SQLExecutor for Data Quality Management: aka One man's love for the...
Using the SQLExecutor for Data Quality Management: aka One man's love for the...
Safe Software
?
¡°MPU+: A Transformative Solution for Next-Gen AI at the Edge,¡± a Presentation...
¡°MPU+: A Transformative Solution for Next-Gen AI at the Edge,¡± a Presentation...
Edge AI and Vision Alliance
?
Securing AI - There Is No Try, Only Do!.pdf
Securing AI - There Is No Try, Only Do!.pdf
Priyanka Aash
?
WebdriverIO & JavaScript: The Perfect Duo for Web Automation
WebdriverIO & JavaScript: The Perfect Duo for Web Automation
digitaljignect
?
OpenPOWER Foundation & Open-Source Core Innovations
OpenPOWER Foundation & Open-Source Core Innovations
IBM
?
Daily Lesson Log MATATAG ICT TEchnology 8
Daily Lesson Log MATATAG ICT TEchnology 8
LOIDAALMAZAN3
?
Cyber Defense Matrix Workshop - RSA Conference
Cyber Defense Matrix Workshop - RSA Conference
Priyanka Aash
?
"Database isolation: how we deal with hundreds of direct connections to the d...
"Database isolation: how we deal with hundreds of direct connections to the d...
Fwdays
?
UserCon Belgium: Honey, VMware increased my bill
UserCon Belgium: Honey, VMware increased my bill
stijn40
?
Python Conference Singapore - 19 Jun 2025
Python Conference Singapore - 19 Jun 2025
ninefyi
?
The Future of Technology: 2025-2125 by Saikat Basu.pdf
The Future of Technology: 2025-2125 by Saikat Basu.pdf
Saikat Basu
?
Connecting Data and Intelligence: The Role of FME in Machine Learning
Connecting Data and Intelligence: The Role of FME in Machine Learning
Safe Software
?
Quantum AI: Where Impossible Becomes Probable
Quantum AI: Where Impossible Becomes Probable
Saikat Basu
?
Cracking the Code - Unveiling Synergies Between Open Source Security and AI.pdf
Cracking the Code - Unveiling Synergies Between Open Source Security and AI.pdf
Priyanka Aash
?
Oh, the Possibilities - Balancing Innovation and Risk with Generative AI.pdf
Oh, the Possibilities - Balancing Innovation and Risk with Generative AI.pdf
Priyanka Aash
?
GenAI Opportunities and Challenges - Where 370 Enterprises Are Focusing Now.pdf
GenAI Opportunities and Challenges - Where 370 Enterprises Are Focusing Now.pdf
Priyanka Aash
?
ReSTIR [DI]: Spatiotemporal reservoir resampling for real-time ray tracing ...
ReSTIR [DI]: Spatiotemporal reservoir resampling for real-time ray tracing ...
revolcs10
?
Tech-ASan: Two-stage check for Address Sanitizer - Yixuan Cao.pdf
Tech-ASan: Two-stage check for Address Sanitizer - Yixuan Cao.pdf
caoyixuan2019
?
AI VIDEO MAGAZINE - June 2025 - r/aivideo
AI VIDEO MAGAZINE - June 2025 - r/aivideo
1pcity Studios, Inc
?
Coordinated Disclosure for ML - What's Different and What's the Same.pdf
Coordinated Disclosure for ML - What's Different and What's the Same.pdf
Priyanka Aash
?
Using the SQLExecutor for Data Quality Management: aka One man's love for the...
Using the SQLExecutor for Data Quality Management: aka One man's love for the...
Safe Software
?
¡°MPU+: A Transformative Solution for Next-Gen AI at the Edge,¡± a Presentation...
¡°MPU+: A Transformative Solution for Next-Gen AI at the Edge,¡± a Presentation...
Edge AI and Vision Alliance
?
Securing AI - There Is No Try, Only Do!.pdf
Securing AI - There Is No Try, Only Do!.pdf
Priyanka Aash
?
WebdriverIO & JavaScript: The Perfect Duo for Web Automation
WebdriverIO & JavaScript: The Perfect Duo for Web Automation
digitaljignect
?
OpenPOWER Foundation & Open-Source Core Innovations
OpenPOWER Foundation & Open-Source Core Innovations
IBM
?
Daily Lesson Log MATATAG ICT TEchnology 8
Daily Lesson Log MATATAG ICT TEchnology 8
LOIDAALMAZAN3
?
Cyber Defense Matrix Workshop - RSA Conference
Cyber Defense Matrix Workshop - RSA Conference
Priyanka Aash
?
"Database isolation: how we deal with hundreds of direct connections to the d...
"Database isolation: how we deal with hundreds of direct connections to the d...
Fwdays
?
UserCon Belgium: Honey, VMware increased my bill
UserCon Belgium: Honey, VMware increased my bill
stijn40
?
Python Conference Singapore - 19 Jun 2025
Python Conference Singapore - 19 Jun 2025
ninefyi
?
The Future of Technology: 2025-2125 by Saikat Basu.pdf
The Future of Technology: 2025-2125 by Saikat Basu.pdf
Saikat Basu
?
Connecting Data and Intelligence: The Role of FME in Machine Learning
Connecting Data and Intelligence: The Role of FME in Machine Learning
Safe Software
?
Quantum AI: Where Impossible Becomes Probable
Quantum AI: Where Impossible Becomes Probable
Saikat Basu
?
Cracking the Code - Unveiling Synergies Between Open Source Security and AI.pdf
Cracking the Code - Unveiling Synergies Between Open Source Security and AI.pdf
Priyanka Aash
?
Oh, the Possibilities - Balancing Innovation and Risk with Generative AI.pdf
Oh, the Possibilities - Balancing Innovation and Risk with Generative AI.pdf
Priyanka Aash
?
GenAI Opportunities and Challenges - Where 370 Enterprises Are Focusing Now.pdf
GenAI Opportunities and Challenges - Where 370 Enterprises Are Focusing Now.pdf
Priyanka Aash
?
ReSTIR [DI]: Spatiotemporal reservoir resampling for real-time ray tracing ...
ReSTIR [DI]: Spatiotemporal reservoir resampling for real-time ray tracing ...
revolcs10
?
Ad

Quilt - Distributed Load Simulation from AWS

  • 1. Quilt distributed load test tooling and infrastructure
  • 2. About me ? Engineering Manager (Performance and Resiliency Test) @ BlueJeans ? had seen some of the worst outages! ? Had simulated some real failures! ? Now we have 99.95% availability! ? Have team of 10 Engineers ? Interests ? Java, Ruby, Zk, Cassandra, Couchbase, RMQ, Jenkins, CI/CD ? AWS, RS, Google compute engine ? Design and develop tools ? Simulate failures ? Develop android apps! ? Startups ? ~15 years ? Developer, QA, Solution Architect, Devops ? 3 years @bluejeans ? 6.5 years @apigee (xml parsing engine to 4G gateway) ? 2 years @ Bea Systems ? 3 years as owner, dev, customer management etc
  • 4. Why Outages ? Performance issues ? Natural load increase ¨C poor software design, poor sizing, mc@bjn ? Load spikes ¨C Xiaomi@fk, sip-dos-attack@bjn ? Cascading failures ? Hardware failure ? Network switch failure ? Software crash ? 3rd party app failure ¨C CB, ZK, RMQ, C* ? Memory leak ? Network blips/outage ? between the datacenters ? Inside the datacenter!
  • 5. What can we do ? ? Performance ? Load simulation ? Monitoring systems ? Comparing against benchmarks ? Size the environment ? Resilience ? Simulate failures ? Evaluate the impact AUTOMATE, AUTOMATE, AUTOMATE!!!
  • 6. Load simulation - challenge ? Large-scale heterogeneous distributed system ? Real-time video/audio mixing ? Heavy RTP data transfer ? Multiple protocols ¨C sip, h323, webrtc, hls, http, websocket ? Geo-located (5-6 regions) & partitions ? AWS + Datacenter hosting ¨C Scaling needs advance notice for DC! ? Inter-region zookeeper lookup
  • 8. It¡¯s complex! Say, 20,000 concurrent EPs at peak load 1 EP (with the least quality) 150 kbps for video 80 kbps for audio 20,000 EPs Rx ¨C 230 * 20K = 4 gbps Tx = 4 gbps *apigee office network is 40 mbps CPU 1 m3.xlarge supports 30 EP simulators ~700 m3.xlarge instances for 20K We generate terabytes of logs for each run!
  • 9. EP simulators ? Sipp ¨C open source ? Callgen ¨C in-house developed h323 simulator ? Webrtc ¨C runs on headless browser with selenium ? Proprietary (mobile and thick clients) ¨C in-house developed simulator
  • 10. The tool ? Controller ? Distribution logic ¨C EP types, meeting ids, client instances ? Ruby ? Clients ? Simulate EPs ? Http api calls - Ruby ? Websocket ¨C sockjs on node.js ? RPC using DRuby
  • 12. Cost! ? The test is in-evitable, no major releases go without that ? Solution - optimal usage of aws instances ? All automated ? Bring up instances when needed ? Setup ¨C checkout and copy latest builds ¡°concurrently¡± of the tool to the instances ? Run the tests with real time monitoring ? Copy the logs to S3 ¡°concurrently¡± ? Bring down the instances ? Analyze, debug etc ¨C offline activity
  • 13. The setup ? All automated ? Bring up instances when needed ? Setup ¨C checkout latest build of the tool to the instances ? Run the tests with real time monitoring ? Bring down the instances ? Analyze, debug etc ¨C offline activity
  • 14. Our tool stack ? Quilt ¨C Setup Infra and simulate distributed load. We just talked about it ? Analyzer ¨C post-test analysis by collecting metrics from various sources such as sensu, atop, new-relic. Graphs generated with high-charts. ? Scoreboard ¨C Real time monitoring ? Catapult ¨C UI around Quilt to enable Devs to do the test ? Goblin (being open-sourced and presented in root-conf) ¨C resiliency testing framework and utils ? Scout ¨C the agent who resides in the RMZ of system under test ? Rain ¨C the new load generation framework in node.js for testing Bluejeans Primetime. Scales to 100s of thousands!
  • 18. Key takeaways ? Peaceful sleep ¨C No way out, need load and resiliency testing! ? Automate - Design the Tools & Infra properly ? Scale ¨C Generating more load is just adding more aws instances ? Extensibility ¨C adding a new endpoint type is quick ? Automated analysis and reporting ¨C sensu, atop, new-relic ? Vendor agnostic - aws/rackspace/GCE ? Cost optimization - Use sdk to dynamically launch instances only when needed ¨C save cost
  • 19. Thank you Stay connected : Ajith Jose BlueJeans Network https://in.linkedin.com/in/ajithvj