a
Amazon Elastic BeanStalk

AWS Well Architected Framework Metrics for Elastic Beanstalk – AllCode

AWS Well Architected Framework - Elastic Beanstalk Metrics

Which Metrics to Monitor for you Elastic Beanstalk Environment

We’ve seen a number of poorly configured Elastic Beanstalk environments. Most people believe that if your Elastic Beanstalk instances run into problems, then AWS will just spin up new instances. This is partially correct. Elastic Beanstalk enables you to configure scaling based upon a single metric, but what happens if the problem isn’t with that metric? Well, unfortunately, everything can go to hell.

Our recommendation is to add the following metrics to all of your AWS Elastic Beanstalk environments:

  1. EnvironmentHealth
  2. ApplicationRequestsTotal
  3. ApplicationRequests5xx
  4. ApplicationRequests4xx
  5. ApplicationLatencyP99

Yes, there’s cost associated with adding these 5 metrics to your AWS environment, but, in our opinion, the costs are well worth it.

ElasticBeanstalk_Metrics_Cost_Estimate.pdf

  1. EnvironmentHealth

The most basic Beanstalk metric is EnvironmentHealth, which is an enumeration containing the 7 Beanstalk health statuses as follows:

  • 0 – OK
  • 1 – Info
  • 5 – Unknown
  • 10 – No data
  • 15 – Warning
  • 20 – Degraded
  • 25 – Severe

Recommendation: Monitor and create an alarm for anything >= 15 (Warning)

The Cloud Watch Alarm will be specified in the CloudWatch > Alarms section. Typically, the Alarm will be configured on the Metric name: EnvironmentHealth for the appropriate environment. The statistic will be set to average over a 5 minute period. We’ll configure the conditions to be Static, Greater than 15.

Well Architected Framework Elastic Beanstalk Metric Alarm for EnvironmentHealth
Well Architected Framework Elastic Beanstalk Metric for EnvironmentHealth

Next, when the Alarm state trigger is in Alarm, we’ll select an existing SNS Topic to post to. We’ll also specify an Email endpoint to receive the notification.

Well Architected Framework Elastic Beanstalk Alarm for EnvironmentHealth

In most Elastic Beanstalk environments, we see see multiple degradations within a two week period as seen below

Well Architected Framework Elastic Beanstalk EnvironmentHealth Chart
Well Architected Framework Elastic Beanstalk EnvironmentHealth Chart
  1. ApplicationRequestsTotal
  2. ApplicationRequests5xx
  3. ApplicationRequests4xx

Beanstalk measures metrics about the number of requests your application is receiving, as well as the status codes of the responses. Monitoring the total number of requests can help you pinpoint surges in traffic, while monitoring 5xx and 4xx responses is good error detection.This metric is measured for both environments and instances, and we want to monitor the environment version.

Recommendation: Monitor and create an alarms on ApplicationRequestsTotal, ApplicationRequests5xx, and ApplicationRequests4xx on the Sum statistic for each metric.

These sums will be different per environment. For example, for some applications, 2,500 requests within a 5 minute period will be a lot of requests triggering an alarm

Well Architected Framework Elastic Beanstalk EnvironmentHealth Chart
  1. ApplicationLatencyP99

ApplicationLatencyP99 measures 99th percentile for application latency, and can be useful to detect when your application’s performance is suffering. When latency increases, it often means that other issues are imminent. This metric is measured for both environments and instances, and we want to monitor the environment version.

Recommendation: Monitor ApplicationLatencyP99, create a CloudWatch alarm on the Average statistic.

Well Architected Framework Elastic Beanstalk Application Latency Chart
Well Architected Framework Elastic Beanstalk Application Latency Chart

We typically will deploy these 5 metrics to each environment. When these metrics go into an alarm state, we’ll have the alarm invoke the appropriate SNS topic.

Instance Metrics

While monitoring instance metrics can yield some very helpful alerts, if there is any sort of autoscaling or instance rotation in the environment, creating alarms will be ineffective. Any instance you create an alarm on will eventually be removed and replaced — but your alarm won’t!

Joel Garcia
Joel Garcia

Joel Garcia has been building AllCode since 2015. He’s an innovative, hands-on executive with a proven record of designing, developing, and operating Software-as-a-Service (SaaS), mobile, and desktop solutions. Joel has expertise in HealthTech, VoIP, and cloud-based solutions. Joel has experience scaling multiple start-ups for successful exits to IMS Health and Golden Gate Capital, as well as working at mature, industry-leading software companies. He’s held executive engineering positions in San Francisco at TidalWave, LittleCast, Self Health Network, LiveVox acquired by Golden Gate Capital, and Med-Vantage acquired by IMS Health.

Related Articles

3 Ways Gen AI and AWS can Enhance Your Business

3 Ways Gen AI and AWS can Enhance Your Business

Amazon is on the cutting edge of new technologies. They have been increasingly experimenting with AI and learning algorithms, culminating in their most recent breakthroughs in Generative AI. Developers and technology enthusiasts have access to their innovations through the tools available on AWS.

Business Owner’s Guide to DevOps Essentials

Business Owner’s Guide to DevOps Essentials

As a business owner, it’s essential to maximize workplace efficiency. DevOps is a methodology that unites various departments to achieve business goals swiftly. Maintaining a DevOps loop is essential for the health and upkeep of deployed applications.

AWS Graviton and Arm-architecture Processors

AWS Graviton and Arm-architecture Processors

AWS launched its new batch of Arm-based processors in 2018 with AWS Graviton. It is a series of server processors designed for Amazon EC2 virtual machines. The EC2 AI instances support web servers, caching fleets, distributed data centers, and containerized microservices. Arm architecture is gradually being rolled out to handle enterprise-grade utilities at scale. Graviton instances are popular for handling intense workloads in the cloud.