AWS Security, Cost, & Well-Architected Best Practices

AWS Knowledge Base

Cloud Security, Cost Optimization, Well-Architected Framework and More

The ultimate guide to AWS best practices and remediations for all cloud issues in Cost Optimization, Security, Reliability, Performance, and Operations and Fault Tolerance. Get step-by-step instructions on proven remediations and frameworks to keep your cloud running smoothly.

Cost Optimization

Optimize Your Cost by Scheduling Idle Resources

To reduce AWS monthly bill, there is another practice that a cloud admin can do to stop or terminate idle instances from the AWS account. There is a default way to find out whether EC2 instances that declare the instance is inactive or not. The CPU average is less than 2%, and the average network I/O has been less than 5MB since last week.

Learn More about Optimize Your Cost by Scheduling Idle Resources

Monitoring Unattached EBS Volumes to follow best practices of FinOps

What is Elastic Block Storage (EBS)

AWS EBS stands for Elastic block storage. EBS lets you store huge amounts of data of any kind i.e. Files system data, transactional Data, relational databases, etc. An EBS volume is like a hard drive attached to an EC2 instance. EBS provides high availability and durability, and is ideal for intensive applications.

Learn More about Monitoring Unattached EBS Volumes to follow best practices of FinOps

Optimize your cost by Deleting AWS Elastic IP Address Why is it important to monitor your Elastic IP resources.

What is Elastic IP address

Elastic IP (EIP) is an IP address one can reserve for their AWS account. Static IP addresses by nature are associated to a particular machine. However, to keep-up with the dynamic needs of public cloud, users generally use Elastic IP addresses.These IP addresses are called ‘Elastic’ as they can be reassigned or remapped to another instanceas an organization keeps launching and terminating resources.

Learn More about Optimize your cost by Deleting AWS Elastic IP Address

Unused Elastic IP Addresses: Cost Optimization: Risk level: Low Rule ID: EC2-003

Check for any unattached Elastic IP (EIP) addresses in your AWS account and release (remove) them in order to lower the cost of your monthly AWS bill.

This rule can help you with the following compliance standards:

This rule can help you work with the AWS Well-Architected Framework

Amazon Web Services enforce a small hourly charge if an Elastic IP (EIP) address within your account is not associated with a running EC2 instance or an Elastic Network Interface (ENI). nOps recommends releasing any unassociated EIPs that are no longer needed to reduce your AWS monthly costs.

Learn More about Unused Elastic IP Addresses

Low Traffic AWS EC2 instances: Cost Optimization : Risk level: Rule ID:EC2-004

Identify any Amazon EC2 instances that appear to be idle and stop or terminate them to help lower the cost of your monthly AWS bill. By default, an EC2 instance is considered ‘idle’ when meets the following criteria (to declare the instance ‘idle’ both conditions must be true):

The average CPU Utilisation has been less than 2% for the last 7 days.
The average Network I/O has been less than 5 MB for the last 7 days.

It is important that your EC2 instances are tagged with correct tags which provide visibility into their usage profile and help you decide whether it’s safe or not to stop or terminate these resources. For Example, knowing the role and the owner of an EC2 instance before you take the decision to stop/terminate it is very important and can avoid unwanted termination of actually used workloads.

This rule can help you with the following compliance standards:

APRA
MAS

This rule can also help you work with the AWS Well-Architected Framework.

Idle instances represent a good candidate to reduce your monthly AWS costs and avoid accumulating unnecessary EC2 usage charges.

Learn More about Low Traffic AWS EC2 instances

Unused AWS EBS volumes Category: Cost Optimization Risk level: Medium Rule ID: EBS-001Collapse

Unused AWS EBS volumes Category: Cost Optimization Risk level: Medium Rule ID: EBS-001

EBS (Elastic Block Storage) volumes are attached to EC2 Instances as storage devices. Unused (Unattached) EBS Volumes can keep accruing costs even when their associated EC2 instances are no longer running.

This rule checks whether there are unused EBS Volumes in your AWS account. nOps recommends you consider deleting non-used EBS volumes to reduce your monthly AWS bills.

This rule can help you with the following:

Compliance frameworks report

SOC 2 Readiness Report

AWS Well-Architected Lens

AWS Well-Architected Framework Lens

Underutilized (< 30% capacity on avg for last week) EC2 instances Category: Cost Optimization Risk level: High Rule ID: EC2-003Collapse Identify any Amaz

on EC2 instances that appear to be under-utilised and downsize (resize) them to help lower the cost of your monthly AWS bill. By default, an EC2 instance is considered under-utilised when matches the following criteria (to declare the instance under-utilised both conditions must be met):

The average CPU utilisation has been less than 30% for the last 7 days.
The average memory utilisation has been less than 30% for the last 7 days.

By default, AWS CloudWatch doesn’t capture an EC2 instance memory utilisation because the necessary metric cannot be implemented at the hypervisor level. In order to report the memory utilisation using CloudWatch you need to install an agent (script) on the instance and create a custom metric (let’s name it EC2MemoryUtilization) on the AWS CloudWatch dashboard. The instructions required for installing the monitoring agent depends on the Operating System used by the instance. Please refer to this URL for more details.

This rule can help you with the following compliance standards:

This rule can help you work with the AWS Well-Architected Framework

Downsizing under-utilised EC2 instances to meet the capacity needs at the lowest cost represents an efficient strategy to reduce your monthly AWS costs. For example, resizing a c5.xlarge instance provisioned in the US-East (N. Virginia) region to a c5.large-type instance due to CPU and memory underuse, you can roughly reduce your AWS costs by half.

Learn More about Underutilized EC2 instances

Idle RDS Instance Category: Cost Optimization Risk level: High Rule ID: RDS-003Collapse

Any RDS Instance that appear to be idle must be identified and deleted to lower your AWS Monthly Bill. nOps recommends that RDS instance is considered ‘idle’ when meets the following criteria (to declare the instance ‘idle’ both conditions must be true):

The average number of database connections have been 0 for the last 7 days.

The AWS CloudWatch metrics used to detect idle RDS instances are:

DatabaseConnections – the number of RDS database connections in use (Units: Count).

This rule can help you work with the AWS Well-Architected Framework

You must check for idle instances regularly and terminate them in order to avoid unnecessary charges in your AWS Monthly bill.

However, it is important to consider the following things:

Backup your RDS databases before termination because once these instances are deleted, all their automated backups (snapshots) will be permanently lost.
It is important to know the role and the owner of an AWS RDS instance before you take the decision to remove it from your account. Hence , we assume that RDS instances are properly tagged to provide you this information.
Ensure that the RDS instance your are terminating is not used in an Application Stack

Learn More about RDS-003 RDS instance idle

Review RDS Instance Size Category: Cost Optimization Risk level: High Rule ID: RDS-001Collapse

You must periodically review amazon RDS database instances to assess their utilization. Any discovered underutilized RDS Instance should be downsized (resized) to avoid undesirable costs.

We consider an RDS database instance as “underutilized” when it meets the following criteria:

The average CPU utilization has been less than 30% for the last 7 days.
The total number of ReadIOPS and WriteIOPS recorded per day for the previous7 days has been less than 100 on average.

The following AWS CloudWatch metrics can be used to detect underutilized RDS instances:

CPUUtilization – This performance metric reports CPU utilization percentage (%).
ReadIOPS and WriteIOPS – These metrics also check for utilization, reporting the average number of disk read and write I/O (Input/Output) operations per second (/sec)

nOps uses this rule in the AWS Well-Architected Framework Lens. It can also help you when checking workloads’ compliance in preparing the SOC 2 Readiness Report.

Downsizing underused RDS database instances can have a tremendous positive impact on your monthly AWS cost. For example, downgrading a db.m5.2xlarge RDS PostgreSQL database instance to a db.m5.large instance due to CPU and IOPS underuse allows you to save roughly 25% (as of September 2021).

Learn More about RDS-001 – Review RDS instance size

Underutilized (<30% read/write) DynamoDB tables Category: Cost Optimization Risk level: High Rule ID: DDB-001Collapse

The rule identifies all underutilized AWS DynamoDB tables. For optimal capacity and cost optimization, nOps recommends you consider lowering the read and write capacity mode. You may also consider switching it from Provisioned Mode to On-Demand Mode.

This rule can help you with the following

AWS Well-Architected Framework Review.
SOC 2 Readiness Report

By default, nOps considers a DynamoDB table as “underutilized” when the number of read/write capacity units (RCUs and WCUs) consumed is 30% lower than the number of provisioned read/write capacity units set for a table over a specified time period.

The following AWS CloudWatch metrics can be helpful to detect such underused DynamoDB Tables:

ProvisionedReadCapacityUnits – the number of provisioned read capacity units for a DynamoDB table (Units: Count).
ConsumedReadCapacityUnits – the number of read capacity units consumed over the specified time period (Units: Count).
ProvisionedWriteCapacityUnits – the number of provisioned write capacity units for a DynamoDB table (Units: Count).
ConsumedWriteCapacityUnits – the number of write capacity units consumed over the specified time period (Units: Count).

When you create a DynamoDB Table in Provisioned Mode, you are charged for the Provisioned Read/Write Capacity regardless of whether you consume them or not. However, when you create a DynamoDB table in On-Demand mode, you pay only for the capacity you use.

In Provisioned mode, you can also make use of AutoScaling feature where you can specify a minimum capacity and a maximum capacity. DynamoDB can then scale your Provisioned Capacity Units based on scaling configuration. This feature is discussed in a separate rule page.

Learn More about Underutilized (<30% read/write) DynamoDB tables

Disabled autoscaling DynamoDB tables Category: Cost Optimization Risk level: Medium Rule ID: DDB-002Collapse

This rule checks whether auto-scaling is enabled for AWS DynamoDB tables in your cloud environment.

Auto-scaling is enabled by default for DynamoDB tables. With the help of AWS CloudWatch, it dynamically adjusts the throughput (read and write) capacity of your provisioned DB tables to meet traffic demands.

For performance and cost optimization, nOps recommends you consider enabling Autoscaling on the DB tables.

This rule can help you work with the following:

AWS Well-Architected Framework.
SOC 2 Readiness Report

Learn More about Disabled autoscaling DynamoDB tables

Security

Encrypt EBS Volumes Category: Security Risk level: High Rule ID: EBS-001Collapse

When dealing with production data that is crucial to your business, it is highly recommended to implement encryption in order to protect it from attackers or unauthorised personnel. With Elastic Block Store encryption enabled, the data stored on the volume, the disk I/O and the snapshots created from the volume are all encrypted. The EBS encryption keys use AES-256 algorithm and are entirely managed and protected by the AWS key management infrastructure, through AWS Key Management Service (AWS KMS).

This rule can help you with the following compliance standards:

This rule can help you work with the AWS Well-Architected Framework

Learn How to Encrypt EBS Volumes

MFA for IAM users with Console Sign-in Category: Security Risk level: High Rule ID: IAM-001Collapse

Ensure that all users with AWS Console access have Multi-Factor Authentication (MFA) enabled in order to secure your AWS environment and adhere to IAM security best practices.

This rule can help you with the following compliance standards:

This rule can also help you work with the AWS Well-Architected Framework

Having MFA-protected IAM users is the best way to protect your AWS resources and services against attackers. An MFA device signature adds an extra layer of protection on top of your existing IAM user credentials (username and password), making your AWS account virtually impossible to penetrate without the MFA generated passcode.

Learn about MFA for IAM users with Console Sign-in

Root user without MFA Category: Security Risk level: High Rule ID: IAM-002Collapse

Ensure that Multi-Factor Authentication (MFA) is enabled for your root account in order to secure your AWS environment and adhere to IAM security best practices.

This rule can help you with the following compliance standards:

This rule can help you work with the AWS Well-Architected Framework

Having an MFA-protected root account is the best way to protect your AWS resources and services against attackers. An MFA device signature adds an extra layer of protection on top of your existing root credentials making your AWS root account virtually impossible to penetrate without the MFA generated passcode.

Learn about Root user without MFA

Unencrypted AWS S3 Buckets Category: Security Risk level: High Rule ID: S3-001Collapse

S3 Buckets should be encrypted to keep your stored data secure. nOps recommends you encrypt your AWS S3 Buckets to protect data at rest. This can be accomplished using AWS S3-managed keys (SSE-S3) or AWS KMS-managed keys (SSE-KMS)forServer-Side Encryption.

This rule can help you with the following:

Compliance frameworks

SOC 2 Readiness Report
HIPAA Readiness Report
CIS Readiness Report

AWS Well-Architected Lens

AWS Well-Architected Framework Lens
FTR Lens

AWS S3 default encryption setting directs AWS to automatically encrypt your S3 data as it is stored in S3 buckets to prevent unauthorized attackers from accessing it.

Learn about Unencrypted AWS S3 Buckets

Inactive IAM account keys detected Category: Security Risk level: Medium Rule ID: IAM-001Collapse

Identify and deactivate any unnecessary IAM access keys as a security best practice. AWS allows you to assign maximum two active access keys but it is recommended only during the key rotation process. nOps strongly recommends to deactivate the old key once the new one has been created so that only one access key remain active for a given IAM user.

This rule can help you with the following compliance standards:

This rule can help you work with the AWS Well-Architected Framework

Learn about Inactive IAM account keys detected

Unencrypted AWS RDS Instances Category: Security Risk level: High Rule ID: RDS-002Collapse

Ensure that your RDS database instances are encrypted to ensure encryption at rest data compliance. The RDS data encryption and decryption is handled transparently and does not require any additional action from you or your application.

This rule can help you with the following compliance standards:

This rule can help you work with the AWS Well-Architected Framework

When dealing with production databases that hold sensitive and critical data, it is highly recommended to implement encryption in order to protect your data from unauthorised access. When you enable RDS encryption, the data stored on the instance, the underlying storage, the automated backups, Read Replicas, and snapshots, all are encrypted. The RDS encryption keys implement AES-256 algorithm and are entirely managed and protected by the AWS key management infrastructure through AWS Key Management Service (AWS KMS).

Learn about Unencrypted AWS RDS Instances

Unused IAM Role Category: Security Risk level: Medium Rule ID: IAM-001Collapse

AWS Identity and Access Management (IAM) roles are essential to providing permissions to teams and applications using your provisioned AWS infrastructure. As time passes and needs change, some created roles might be left unused in your AWS account. An IAM role is considered unused if there has been no usage/activity for this role in the past 90 days. It is highly recommended to remove these unused roles from your AWS account to prevent unauthorized access.

This rule can help you with the following:

Compliance Frameworks

SOC 2 Readiness Report
HIPAA Readiness Report
CIS Readiness Report

AWS Well-Architected Lens

AWS Well-Architected Framework Lens
FTR Lens

To help you identify these unused roles, IAM now reports the last-used timestamp that represents when a role was last used to make an AWS request. You or your security team can use this information to identify, analyze, and then confidently remove unused roles. This helps you improve the security posture of your AWS environments. Additionally, by removing unused roles, you can simplify your monitoring and auditing efforts by focusing only on roles that are in use.

Learn about IAM-001 Unused IAM Role

AWS EC2 with public subnets with open ports Category: Security Risk level: High Rule ID: EC2-009Collapse

For instances provisioned in Public subnets, you must ensure that no inbound rules exist in any security group that allows unrestricted access (i.e., 0.0.0.0/0 or::/0) to TCP port 22.

To apply the concept of least privilege, traffic must be authorized from only known hosts, services needed IP addresses or other security groups.

Allowing unlimited access to an EC2 instance on port 22 allows an attacker to brute force their way into the system and potentially acquire access to the entire network. This can result in malicious activities such as hacking and man-in-the-middle (MITM) assaults.

Port 22 is used to establish an SSH connection to an EC2 instance and access a shell.

This rule can help you with the following:

Compliance Frameworks’ reports

SOC 2 Readiness Report
CIS Readiness Report

AWS Well-Architected Lens

AWS Well-Architected Framework Lens

Learn about AWS EC2 with public subnets with open ports

Active root access key and secret key Category: Security Risk level: High Rule ID: IAM-007Collapse

This rule checks for and lists AWS Accounts with root access key and secret key activated. nOps recommends that you use IAM roles and disable active keys when performing programmatic queries to keep your cloud environment safe and conform with the Well-Architected security best practices.

Learn about Active root access key and secret key

Detected weak password policy Category: Security Risk level: High Rule ID: IAM-008Collapse

This rule checks and lists all AWS accounts with a weak password policy. nOps strongly recommends you consider configuring a strong password policy for all your AWS accounts. The policy should contain essential specifications like minimum character length, expiration, etc.

This rule can help you with the following:

AWS Well-Architected Framework Lens

The AWS account root user password and IAM user access keys are not covered by the IAM password policy. If a password expires, the IAM user can no longer sign in to the AWS Management Console but still use their access keys.

Default AWS Password Policy

If an administrator does not configure a custom password policy, IAM user passwords must adhere to the AWS default password policy. The default password policy enforces the following conditions:

minimum of 8 characters and a maximum of 128 characters

minimum of three of the following character types: uppercase, lowercase, numbers, and ‘! @ # $ % & * () + – = [] | ” symbols

Must not be the same as your AWS account name or email address.

nOps Recommended Password Policy

nOps recommends that you must configure a custom password policy for IAM users with the following conditions :

Minimum Password Length: Specify a minimum character length for the passwords (6 – 128)
Password strength: You can select any of the following specifications below to define the strength of your IAM user passwords:
- Require at least one uppercase letter from the Latin alphabet (A–Z)
- Require at least one lowercase letter from the Latin alphabet (a–z)
- Require at least one number
- Require at least one nonalphanumeric character ! @ # $ % ^ & * ( ) _ + - = [ ] { } | '
Enable password expiration: The user’s password expires after specific days (e.g., 90 days), and a user must set a new password to access AWS Management Console.
Password expiration requires administrator reset: Prevent IAM users from updating their passwords after the password expires.
Allow users to change their own password: You can allow all IAM users in your account to change their passwords via the IAM console.
Prevent password reuse: prevent IAM users from reusing a specified number of previous passwords.

Learn about Detected weak password policy

Disabled CloudTrail File Integrity Checks Category: Security Risk level: Medium Rule ID: CT-001Collapse

Validated AWS CloudTrail log files are essential in security and forensic investigations. This rule checks and lists AWS accounts that don’t have the CloudTrail bucket protected from deletion or overwrite.

When you activate the log file integrity validation option, CloudTrail will generate a hash using industry-standard algorithms for each log file that it delivers to your specified S3 bucket.

This rule can help you with the following:

AWS Well-Architected Framework Lens
AWS Foundational Technical Review (FTR) Lens

Learn about Disabled CloudTrail File Integrity Checks

AWS CloudTrail Event log disabled Category: Security Risk level: High Rule ID: CT-002Collapse

To boost your API security and governance posture, you must consider enabling AWS CloudTrail Event Log for all AWS regions.

This rule checks for and lists AWS Accounts that don’t have AWS CloudTrail Event log enabled.

CloudTrail is enabled by default when you establish an AWS account. CloudTrail events are produced anytime an AWS account event occurs. In the CloudTrail console, click Event history to see the previous 90 days’ occurrences.

However, if you want to manage ongoing events efficiently, you should create a trail, which is just a configuration that permits events to be sent to a specified S3 bucket.

A CloudTrail might be regional or global. Regional trails exclusively record occurrences from a specified region, whereas global trails, which are recommended, record events from all regions.

Learn about AWS CloudTrail Event log disabled

Disabled AWS GuardDuty Accounts Category: Security Risk level: Medium Rule ID: GD-001Collapse

This rule ensures that AWS GuardDuty Service is enabled for your AWS Accounts.

Amazon GuardDuty is an intelligent threat detection service that continuously monitors your provisioned AWS workloads for malicious activities like API requests from harmful IP addresses and unauthorized data S3 access.

It also provides comprehensive security insights for visibility and remediation. To identify and prioritize potential threats, GuardDuty leverages various techniques, like machine learning (ML), anomaly detection, and integrated threat intelligence. GuardDuty can analyze tens of billions of events curated from AWS CloudTrail event logs, Amazon Virtual Private Cloud (VPC) flow logs, and DNS query logs, among many other data sources.

This rule can help you with the following:

AWS Well-Architected Framework Lens

Learn about Disabled AWS GuardDuty Accounts

Disabled AWS Config for Regions Category: Security Risk level: Medium Rule ID: CONFIG-001Collapse

This rule checks whether AWS config is enabled in your AWS account.

AWS Config is a service that allows you to inspect, audit, and review your AWS resource configurations. Config monitors and records all AWS resource configurations in real-time, enabling you to match recorded configurations against desired configurations seamlessly.

AWS Config also helps to analyze changes in AWS resource configurations, dig into particular resource configuration histories, and evaluate compliance with the configuration defined in your internal policies.

nOps recommends you consider enabling AWS Config for better security.

This rule can help you with the following:

AWS Well-Architected Framework Lens

Learn about Disabled AWS Config for Regions

Detected usage of root account Category: Security Risk level: High Rule ID: IAM-006Collapse

Root user credentials provide unrestricted access to all AWS resources, including billing details, the root user password, and the power to alter account settings and terminate the account. You must never use AWS root user credentials for your routine operations, including administrative ones. Instead, adhere to the best practice of using the root user only to create your first IAM user. You should use root accounts to perform only a few account and service management tasks as specified here.

nOps suggests enforcing the least privilege principle by defining IAM users/roles and restricting them to only the actions they need to do their tasks.

This rule can help you with the following:

Compliance Frameworks

SOC 2 Readiness Report
HIPAA Readiness Report
CIS Readiness Report

AWS Well-Architected Lens

AWS Well-Architected Framework Lens
FTR Lens

Learn about Detected usage of root account

AWS RDS with public subnets with open ports Category: Security Risk level: High Rule ID: RDS-007Collapse

You should provision AWS RDS instances in private subnets to shield them from direct internet traffic. However, suppose you must deploy an RDS instance on public subnets for any reason. In that case, you must verify that no inbound rules exist in any security group that permits unfettered access (i.e., 0.0.0.0/0 or::/0) (particularly on the TCP/IP port that your Database listens on).

The table below lists the default endpoint ports for each RDS database engine:

Database Engine	Default Port
Aurora/MySQL/MariaDB	3306
PostgreSQL	5432
SQL Server	1433
Oracle	1521

To use the least privilege principle, only known hosts, services, IP addresses, or security groups should be permitted. Unrestricted access to an RDS instance allows malicious attackers to brute force their way in and potentially get network access. This can lead to harmful activities like hacking and man-in-the-middle (MITM) attacks.

This rule can help you with the following:

Compliance Frameworks’ reports

SOC 2 Readiness Report
HIPAA Readiness Report
CIS Readiness Report

AWS Well-Architected Lens

AWS Well-Architected Framework Lens
AWS MSP Partner Program Validation Audit Checklist

Learn about AWS RDS with public subnets with open ports

Reliability

DynamoDB Continuous Backup Category: Reliability Risk level: Medium Rule ID: DynamoDB-001Collapse

Point-In-Time-Recovery (PITR) is an automatic continuous backup that lets you restore your DynamoDB table and secondary indexes, global and local, to any point in time during the past 35 days. This setting does not interfere with on-demand backups but instead acts as an additional defence layer.

This rule can help you with the following compliance standards:

NIST 800-53 (Rev. 4)
This rule can help you work with the AWS Well-Architected Framework

Learn How to Backup Continuously With DynamoDB

Enable Multi-AZ for RDS Instances Category: Reliability Risk level: Medium Rule ID: RDS-01Collapse

Enable Multi-AZ deployment configurations on your RDS Instances for high availability and automatic failover support , fully managed by AWS.

Amazon RDS Multi-AZ deployments provide enhanced availability for databases within a single region. In the event of a planned or unplanned outage of your DB instance, Amazon RDS automatically switches to a standby replica in another Availability Zone if you have enabled Multi-AZ.

This rule can help you with the following compliance standards:

NIST 800-53 (Rev. 4)

Learn How to Enable Multi-AZ for RDS Instances

Autoscaling Not Enabled on EC2 Instance Category: Reliability Risk level: Medium Rule ID: EC2-01Collapse

Amazon EC2 Auto Scaling helps you maintain application availability and allows you to automatically add or remove EC2 instances according to conditions you define. You can use the fleet management features of EC2 Auto Scaling to maintain the health and availability of your fleet. You can also use the dynamic and predictive scaling features of EC2 Auto Scaling to add or remove EC2 instances. Dynamic scaling responds to changing demand and predictive scaling automatically schedules the right number of EC2 instances based on predicted demand. Dynamic scaling and predictive scaling can be used together to scale faster.

This rule can help you with the following compliance standards which aligns with AWS Well-Architected Framework:

MAS
NIST 800-53 (Rev. 4)

Learn How to Enable Autoscaling for EC2 Instances

Enable Point-in-Time Recovery for RDS Instance Category: Reliability Risk level: High Rule ID: RDS-02Collapse

The automated backup feature of Amazon RDS enables point-in-time recovery of your DB instance. When automated backups are turned on for your DB Instance, Amazon RDS automatically performs a full daily snapshot of your data (during your preferred backup window) and captures transaction logs (as updates to your DB Instance are made). When you initiate a point-in-time recovery, transaction logs are applied to the most appropriate daily backup in order to restore your DB instance to the specific time you requested.

This rule can help you with the following compliance standards:

- NIST 800-53 (Rev. 4)
- This rule can help you work with the AWS Well-Architected Framework.

Learn How to Enable Point-in-Time recovery for RDS

Missing Snapshots For EBS Volumes Category: Reliability Risk level: Medium Rule ID: EC2-002Collapse

An EBS snapshot is a point-in-time copy of your Amazon EBS volume, which is copied to Amazon Simple Storage Service (Amazon S3). EBS snapshots are incremental copies of data. This means that only unique blocks of EBS volume data that have changed since the last EBS snapshot are stored in the next EBS snapshot.

Creating point-in-time EBS snapshots periodically will allow you to handle efficiently your data recovery process in the event of a failure, save your data before shutting down an EC2 instance, back up data for geographical expansion, and maintain your disaster recovery stack up to date.

This rule can help you with the following compliance standards:

NIST 800-53 (Rev. 4)
This rule can help you work with the AWS Well-Architected Framework

Learn More about Missing Snapshots For EBS Volumes

Disabled Multi-AZ ElastiCache Redis Instances Category: Reliability Risk level: Medium Rule ID: EC-001Collapse

nOps recommends that provisioned ElastiCache resources have a Multi-AZ deployment configuration to enhance High Availability (HA). This ensures that the service can automatically failover to a read replica when the primary cache node fails, for example, in case of planned maintenance, the unlikely event of a primary node, or Availability Zone failure.

ElastiCache will handle this failover transparently, and there is no need to create or provision a new primary node. You can resume writing to the new primary as soon as read replica promotion to the primary node is complete.

This rule can help you with:

AWS Well-Architected Lens

AWS Well-Architected Framework Lens

Please note that:

Redis Cache Multi-AZ with automatic failover does not support T1 and T2 cache node types or cache clusters with the Redis engine version earlier than 2.8.6

Redis Cache Multi-AZ with automatic failover is only available if the cluster has at least one read replica.

Learn More about Disabled Multi-AZ ElastiCache Redis instances

Disabled Bucket versioning Category: Reliability Risk level: Low Rule ID: S3-003Collapse

Make sure the versioning flag is enabled in your AWS S3 buckets so you can recover items if they are accidentally deleted or overwritten. After versioning is enabled for a bucket:

Amazon S3 inserts a delete marker on the object if you delete an object rather than removing it permanently. The delete marker then becomes the current version of the object.
If you overwrite an object, Amazon S3 will create a new version of the object in the bucket.
S3 versioning can also be used to archive objects to low-cost storage classes using lifecycle policies to save some costs.

This rule is used by the following::

AWS Well-Architected Framework Lens
FTR Lens

Learn More about S3-003 Disabled Bucket versioning

Operations and Fault Tolerance

Missing Tags for EBS Resources Category: Operations and Fault Tolerance Risk level: Low Rule ID: EBS-006Collapse

Ensure that user-defined tags (metadata) are being used for labelling, collecting and organising EBS resources available within your AWS environment. nClouds recommends that your resources must have some user-defined tags (and not just the default Key and Value) to follow best practices. We highly recommend the following tagging schema to help you identify and manage your resources:

Name: used to identify individual resources.
Role: used to describe the function of a specific resource (e.g. web tier, database tier).
Environment: used to distinguish between different stages (e.g. development, production).
Owner: used to identify the person / team responsible for the resource.

This rule can help you with the following compliance standards:

APRA
MAS

Naming (tagging) your AWS EBS volumes logically and consistently has several advantages such as providing additional information about the volume location and usage, promoting consistency within the selected environment, distinguishing fast similar resources from one another, avoiding naming collisions, improving clarity in cases of potential ambiguity and enhancing the aesthetic and professional appearance.

Learn More about Missing Tags for EBS Resources

Disabled AWS Enterprise Support Category: Operations and Fault Tolerance Risk level: Medium Rule ID: SUPP-001Collapse

Ensure that the appropriate level of AWS Support Plan is enabled for the productions accounts and critical workloads.

For example, if an AWS account is hosting production systems and critical workloads, it is highly recommended that your AWS Support Plan should be Business or Enterprise.

Amazon Web Services provides the following support plans:

- Basic – The plan is included for all AWS customers and includes the following:
  - 24×7 access to customer service, documentation, white papers, and support forums.
  - Access to the 7 core Trusted Advisor checks and guidance to provision your resources following best practices to increase performance and improve security.
  - A personalised view of the health of AWS services, and alerts when your resources are impacted.

- Developer – This plan is recommended for customers that are experimenting or testing in AWS. This plan includes the following additional features on top of basic plan:
  - You get enhanced Technical Support to quickly get started with AWS services and resources. You have email access to Cloud Support Associates (Business hours** ) and can raise unlimited support cases with 1 primary contact.
  - You have access to general architectural guidance as well from AWS.

- Business– This plan is recommended and suitable for most of the production workloads in AWS. This plan includes the following additional features on top of the Developer Support Plan:
  - Full set of Trusted Advisor Checks
  - You can raise unlimited cases with unlimited contacts
  - You will get fast support response times on your Production System Impaired/Down cases. i.e. less than 4 hours for impaired production systems and less than 1 hour for production systems that are experience downtime
  - You can raise support cases programatically via access to AWS Support API
  - You will get support and troubleshooting support for 3rd Party Softwares too.

Enterprise – This plan is recommended for business and/or mission critical workloads in AWS. If you are an enterprise businesses that are running mission critical workloads on AWS and require high-touch proactive/preventive support, then this plan is for you. This plan includes the following additional features on top of the Business Support Plan:
- Faster response times for your Business-critical system. i.e. less than 15 minutes for a business critical system down.
- Consultative review and guidance based on your applications
- Designated Technical Account Manager (TAM) to proactively monitor your environment and assist with optimisation and coordinate access to programs and AWS experts.

You can find up-to-date information and pricing on these AWS Support Plans here.

The purpose of this nOps rule is to validate the support plan required for your AWS account/environment.

This rule can help you with the following compliance standards:

NIST 800-53 (Rev. 4)

Learn More about Disabled AWS Enterprise Support

Out of date AMIs Category: Operations and Fault Tolerance Risk level: High Rule ID: EC2-008Collapse

This rule checks for AMIs that are older than six months old. An Amazon Machine Image (AMI) contains all the necessary information needed to launch instances in AWS. When you launch an instance, it is mandatory to specify an AMI. If you require the identical configuration for all of your instances, you may launch them from the same AMI. To ensure that your instances are up-to-date with the latest security patches, software versions, you must regularly update your AMIs.

Any AMI older than 180 days is considered obsolete and is missing important patches and security updates required for reliable operations.

This rule can help you with the following:

AWS Well-Architected Framework Lens

Learn More about Out of date AMIs

Performance

Unused AWS ELB Resources Category: Performance Risk level: Low Rule ID: ELBv2-001Collapse

Find any unused Amazon Application Load Balancers (ALBs) and Network Load Balancers (NLBs) and remove them from your account in order to help lower the cost of your monthly AWS bill.

An AWS ELBv2 load balancer is considered “unused” when the associated target group has no EC2 target instance registered or when the registered target instances are not healthy anymore.

This rule can help you work with the AWS Well-Architected Framework.

You are charged for each hour or partial hour that an AWS ELBv2 load balancer is running, regardless whether you are using the resource or not. Removing unused AWS resources like an Application Load Balancer (ALB) or a Network Load Balancer (NLB) will help you avoid unexpected charges on your AWS bill.

Learn More about Unused AWS ELB Resources

Under-utilized Redshift Cluster Nodes Category: Performance Risk level: High Rule ID: RS-001Collapse

Identify any Amazon Redshift clusters that appear to be under-utilised . Either downsize them or reduce the number of nodes to help lower the cost of your monthly AWS bill.

By default, an AWS Redshift cluster is considered under-utilised when matches the following criteria:

The average CPU utilization has been less than 60% for the last 30 days.
The total number of ReadIOPS and WriteIOPS registered per day for the last 30 days has been less than 100 on average.

The AWS CloudWatch metrics utilized to detect underused Redshift clusters are:

CPUUtilization – the percentage of CPU utilization (Units: Percent).
ReadIOPS – The average number of disk read operations per second. (Units: Count/Second)
WriteIOPS – The average number of disk write operations per second. (Units: Count/Second)

You can change the default threshold values for this rule on the nOps console and set your own values for CPU utilization, the total number of ReadIOPS and WriteIOPS to configure the underuse level for your Redshift clusters.

This rule can help you work with the AWS Well-Architected Framework

Learn More about Under-utilized Redshift Cluster Nodes

Unused NAT Resources Category: Performance Risk level: Low Rule ID: VPC-002Collapse

Identify and remove any unused NAT Gateways to adhere to best practices and avoid unnecessary costs. NAT gateways are used to connect a private instance with outside networks. When a NAT gateway is provisioned, AWS charges you based on the number of hours it was available and the data (GB) it processes.

This rule can help you with the following:

Compliance Frameworks

SOC 2 Readiness Report

AWS Well-Architected Lens

AWS Well-Architected Framework Lens
FTR Lens

CloudWatch, AWS monitoring service can be used monitor your NAT gateway via information it collects from the specified NAT gateway. This information is collected and presented in readable metrics at 1 minute intervals and are stored for 15 months. nOps uses one such metric to determine if a NAT Gateway is considered unused or not. This metric is BytesOutToDestination which is The number of bytes sent out through the NAT gateway to the destination.

A NAT gateway is considered unused if the value of BytesOutToDestination is 0 for the last 7 days.

Learn More about Unused NAT Resources

AWS Knowledge Base

Cloud Security, Cost Optimization, Well-Architected Framework and More

Cost Optimization

Optimize Your Cost by Scheduling Idle Resources

What is Elastic Block Storage (EBS)

What is Elastic IP address

Security

Default AWS Password Policy

nOps Recommended Password Policy

Reliability

Operations and Fault Tolerance

Performance

Products

Solutions

Resources

Company

Documentation

Solutions

Platform

Resources

Documentation

Company

The Best Tools By Category

Cloud Cost Guides

Karpenter

Commitment Management

Request Invitation