Concepts
Storage in AWS is a crucial component for any architect planning to design and deploy systems in the cloud. AWS offers a range of storage services tailored for different needs, which include:
Amazon S3
Amazon Simple Storage Service (Amazon S3) is an object storage service that offers scalability, data availability, security, and performance. It’s used for a variety of storage solutions including backup and restore, archive, enterprise applications, IoT devices, and big data analytics.
Characteristics:
- Durability: S3 promises 99.999999999% (11 9’s) of durability over a given year. It achieves this by automatically replicating data across multiple devices in multiple facilities within a region.
- Replication: S3 provides different replication options such as S3 Standard, S3 Intelligent-Tiering, S3 Glacier, and S3 Glacier Deep Archive. For cross-region replication, S3 also supports CRR (Cross-Region Replication).
Amazon EBS
Amazon Elastic Block Store (Amazon EBS) provides persistent block storage volumes for use with Amazon EC2 instances. EBS volumes are network-attached and remain independent from the life of an instance.
Characteristics:
- Durability: EBS volumes offer a high level of durability and are designed to protect against failures by replicating within the same Availability Zone (AZ).
- Replication: EBS does not automatically replicate data across regions or AZs, but you can take snapshots of your volumes and replicate those across other regions manually.
Amazon EFS
Amazon Elastic File System (Amazon EFS) offers file storage for use with Amazon EC2. You can also access EFS from on-premises servers via AWS Direct Connect or AWS VPN.
Characteristics:
- Durability: It is designed to be highly durable and provides a file storage infrastructure for multiple EC2 instances.
- Replication: EFS automatically replicates data within and across multiple AZs.
Amazon FSx
Amazon FSx provides fully managed third-party file systems with the native compatibility and feature sets for workloads such as Microsoft Windows-based infrastructure, High-Performance Computing (HPC), machine learning, and electronic design automation.
Characteristics:
- Durability: Amazon FSx for Windows File Server and Amazon FSx for Lustre are designed to be highly durable and offer data replication within a single AZ or across multiple AZs, depending on the deployment option.
- Replication: FSx for Windows File Server provides options for Multi-AZ deployments that automatically replicate data synchronously across AZs.
Amazon RDS
Amazon Relational Database Service (Amazon RDS) makes it easy to set up, operate, and scale a relational database in the cloud. It provides cost-efficient and resizable capacity while managing time-consuming database administration tasks.
Characteristics:
- Durability: RDS is configured for automated backups and snapshots are stored durably in Amazon S3.
- Replication: RDS supports several replication features to enhance availability and reliability for your databases, including Multi-AZ deployments and Read Replicas.
Amazon DynamoDB
Amazon DynamoDB is a fully managed NoSQL database service that provides fast and predictable performance with seamless scalability.
Characteristics:
- Durability: DynamoDB stores data across multiple AZs within a region and automatically maintains the desired level of replication and consistency.
- Replication: With DynamoDB Global Tables, you can have a fully replicated database across multiple regions.
Comparison Table
Storage Service | Durability | Replication Options | Use Case |
---|---|---|---|
Amazon S3 | 99.999999999% | CRR, S3 Standard, Intelligent-Tiering, S3 Glacier | Backup, Archive, Data lakes |
Amazon EBS | High | Snapshots (manual cross-region replication) | EC2 Block Storage |
Amazon EFS | High | Automatic across AZs | Shared file system for EC2 |
Amazon FSx | High | Multi-AZ deployment for FSx for Windows File Server | Managed third-party file systems |
Amazon RDS | High | Multi-AZ, Read Replicas | Managed relational database |
Amazon DynamoDB | High | DynamoDB Global Tables | Managed NoSQL database |
Each storage option on AWS has its characteristics that should be evaluated based on the specific requirements of workload durability, cost, performance, and the level of management required. When preparing for the AWS Certified Solutions Architect – Associate exam, it’s important to understand these characteristics so that you can select the most appropriate storage option for your architecture.
Answer the Questions in Comment Section
T/F: Amazon Elastic Block Store (EBS) volumes are automatically replicated within their Availability Zone to prevent data loss due to hardware failures.
- Answer: True
Explanation: Amazon EBS volumes are indeed automatically replicated within the same Availability Zone to provide high durability.
T/F: Amazon S3 Standard-Infrequent Access (S3 Standard-IA) has a lower durability than Amazon S3 Standard.
- Answer: False
Explanation: Amazon S3 Standard and S3 Standard-IA offer the same durability of 999999999% (11 9’s), but S3 Standard-IA is designed for less frequently accessed data with a lower storage price and a higher retrieval fee.
What is the durability of Amazon S3?
- A) 99%
- B) 999999999%
- C) 9999%
- D) 100%
Answer: B) 999999999%
Explanation: Amazon S3 offers 999999999% (11 9’s) of durability for the objects stored within the service.
T/F: Amazon Glacier is a suitable storage option for frequently accessed data that requires millisecond access times.
- Answer: False
Explanation: Amazon Glacier is designed for long-term archival storage with retrieval times ranging from minutes to hours, not suitable for frequently accessed data.
Which AWS storage option offers provisioned IOPS?
- A) Amazon S3
- B) Amazon Glacier
- C) Amazon EFS
- D) Amazon EBS Provisioned IOPS (io2 & io1)
Answer: D) Amazon EBS Provisioned IOPS (io2 & io1)
Explanation: Amazon EBS offers Provisioned IOPS (io2 & io1) volumes that are designed to meet the needs of I/O-intensive workloads, particularly database workloads, that are sensitive to storage performance and consistency.
T/F: Amazon RDS Automated Backups are stored in Amazon S3 with the same durability as standard S3 objects.
- Answer: True
Explanation: Amazon RDS Automated Backups are indeed stored in Amazon S3 and inherit the same high durability of standard S3 objects.
Which AWS storage service primarily serves as a file system which multiple EC2 instances can access simultaneously?
- A) Amazon S3
- B) Amazon EBS
- C) Amazon EFS
- D) Amazon Glacier
Answer: C) Amazon EFS
Explanation: Amazon EFS is designed to provide a simple, scalable, fully managed elastic NFS file system for use with AWS Cloud services and on-premises resources.
T/F: Data stored in S3 One Zone-Infrequent Access (S3 One Zone-IA) is replicated across multiple Availability Zones.
- Answer: False
Explanation: Unlike S3 Standard and S3 Standard-IA, S3 One Zone-IA stores data in a single Availability Zone, which makes it less expensive but also means that data is not replicated across multiple zones and is therefore at a higher risk if the single zone is compromised.
Which of the following options is a feature of Amazon S3 but not Amazon EBS?
- A) Snapshots
- B) Static website hosting
- C) Volume encryption
- D) Provisioned IOPS
Answer: B) Static website hosting
Explanation: Static website hosting is a feature specific to Amazon S3, allowing users to host static websites without the need for a server.
T/F: Amazon EBS volumes can be attached to any Amazon EC2 instance within the same region.
- Answer: False
Explanation: Amazon EBS volumes can only be attached to instances within the same Availability Zone.
How often does Amazon S3 automatically check the integrity of stored data using checksums?
- A) Continuously
- B) Daily
- C) Weekly
- D) Monthly
Answer: A) Continuously
Explanation: Amazon S3 continuously uses checksums to monitor data integrity and promptly repairs any lost data.
T/F: Amazon S3 Intelligent-Tiering moves objects between two access tiers based on changing access patterns.
- Answer: False
Explanation: Amazon S3 Intelligent-Tiering moves objects between multiple access tiers, not just two, based on changing access patterns to save costs without performance impact or operational overhead.
Great post! The section on S3 durability was particularly informative.
Thanks for the detailed explanation on Glacier. It really helped me understand its use case.
Can someone elaborate on the difference between S3 Standard and S3 Intelligent-Tiering in terms of cost and performance?
The replication options for RDS were a bit confusing. Could someone clarify?
Appreciate the in-depth look at EFS and its provisioned throughput!
What are the main considerations when choosing between EBS and EFS for persistent storage?
Great blog post! Really helped clarify a lot of my doubts about AWS storage options.
Could someone explain DynamoDB’s durability and replication in brief?