Auto Fix Ec2 Instance Status checks failed using CloudFormation

3 min readSep 28, 2022

I recently got an email from one of my customers saying that their Ec2 instances status checks are failing and need an automated solution to fix the issue. Sometimes instance checks fail in the middle of the night, so he needs to wake up and reboot the instance manually to resolve the issue. Due to this issue, the customer is looking more for an automated solution instead of fixing the issue manually.

AWS monitors the health of the Ec2 instance using instance and system status checks. If a status check fails, the Ec2 instance becomes unreachable. Instance status checks failure may happen due to Network configurations issue, memory issue, and boot-up issue. Below CloudFormation stack helps to reboot the instance using CloudWatch Alarm when it fails the status checks.

What is in CloudFormation Stack:

Instance Auto recovery Alarm resource

AWS CloudWatch Alarm will be created with specific thresholds to reboot the instance when status checks fail.

2. Parameter section

The Parameter section auto-populates instance IDs in the region to create a recovery alarm in the resource section.

Ec2StatusChecksRecovery CloudFormation Template:

https://github.com/praveenv4/Task/blob/main/Cloudformation/Ec2StatusChecksRecovery.yaml

Create Ec2StatusChecksRecovery CloudFormation Stack:

Log in to the AWS account and go to Cloudformation service
Choose Create Stack and click on “with new resources (standard).”

3. In this example, our template is ready, so select “Upload a template file” and click on “Choose file” then select the file from your local machine.

4. After choosing the template click on Next, provide a name like “Ec2StatusChecksRecovery” and click on drop-down, then select the Ec2 instance ID that you are having the issue with.