
Prepare and protect your applications from disruptions
Vendor
Amazon Web Services (AWS)
Company Website
Prepare and protect your applications from disruptions
Benefits
Reduce outages
Continuously validate and track application resilience to reduce outages
Evaluate resilience targets
Evaluate resilience targets (Recovery Time Objective and Recovery Point Objective).
Proactive issue management
Identify and resolve issues before they occur in production.
Optimize continuity
Optimize business continuity while reducing recovery costs.
How it works
AWS Resilience Hub is a central location in the AWS Console for you to manage and improve the resilience posture of your applications on AWS. AWS Resilience Hub enables you to define your resilience goals, assess your resilience posture against those goals, and implement recommendations for improvement based on the AWS Well-Architected Framework. Within AWS Resilience Hub, you can also create and run AWS Fault Injection Service (AWS FIS) experiments, which mimic real-life disruptions to your application to help you better understand dependencies and uncover potential weaknesses. AWS Resilience Hub provides you with the services and tooling you need to continuously strengthen your resilience posture, all in a single place.
Features
Describe
Describe your applications as resource collections, such as CloudFormation stacks, Terraform state files, myApplications, or resource groups, or define applications for Kubernetes workloads that are managed on Amazon EKS. Applications can also be described using both resource collections and Amazon EKS clusters.
Define
Define the resilience policies for your applications. These policies include RTO and RPO targets for applications, infrastructure, Availability Zone, and Region disruptions.
Assess
AWS Resilience Hub’s assessment uses best practices from the AWS Well-Architected Framework to analyze the components of an application and uncover potential resilience weaknesses. These can be caused by incomplete infrastructure setup, misconfigurations, or situations where additional configuration improvements are needed.
Recommend
AWS Resilience Hub provides actionable recommendations to improve resilience. The resilience assessment also generates code snippets that help you create recovery procedures as AWS Systems Manager documents for your applications, referred to as Standard Operating Procedures (SOPs). AWS Resilience Hub generates a list of recommended Amazon CloudWatch monitors and alarms to help the operator quickly identify any change to the application's resilience posture once deployed.
Validate
After the application and SOPs have been updated to incorporate recommendations from the resilience assessment, you can use AWS Resilience Hub to test and verify that your application can meet its resilience targets before releasing it into production. AWS Resilience Hub is integrated with AWS Fault Injection Simulator (FIS), a chaos engineering service, to provide fault-injection simulations of real-world failures to validate that the application recovers within the defined resilience targets. This can include network errors or too many open connections to a database. AWS Resilience Hub also provides APIs so you can integrate its resilience assessment and testing into your CI/CD pipelines for ongoing resilience validation. Integrating resilience validation into CI/CD pipelines helps ensure that changes to the application's underlying infrastructure do not compromise resilience.
Use cases
Uncover potential weaknesses
Uses fault-injection simulations of real-world failures to help validate the effectiveness of recovery standard operating procedures (SOP) and alarms.
Protect mission-critical applications
Provides actionable recommendations to improve resilience and helps you create recovery procedures.
Help meet contractual and regulatory requirements
Keeps an audit trail of events during planned and unplanned outages, helping meet compliance and regulatory requirements.