Last updated Jul 9, 2023 Edit Source

# Simple Storage Service (S3)

#aws #s3 #storage #serverless #cloud

Object storage solution provided by AWS. Serverless storage in the cloud. Do not have to worry about the underlying infrastructure

Object storage is a a simple data storage architecture that manages data as objects, as opposed to other storage architectures which manages data as files and file hierarchy (file systems) and block storage which manages data as blocks within sectors and tracks.

S3 provides unlimited storage.

# Objects

Objects contains data and acts similar to files. they contain:

Key (name)
Value (data itself)
Version ID (if enabled the version)
Metadata (additional information i.e. file-type)

# Buckets

Bucket holds #Objects Buckets can have folders to organise Objects. These should be unique universally similar to domain names. (i.e. there cannot be bucket with same name even in cross-account scenario)

# Storage Classes

Pasted image 20220713123659

# S3-Standard (Default)

The default storage class. If you don’t specify the storage class when you upload an object, Amazon S3 assigns the S3 Standard storage class.

Fast (Low latency and high throughput performance.)
99.99% Availability
11 9’s Durability (Resilient against events that impact an entire Availability Zone.)
Replicated across at least 3
Backed with the Amazon S3 Service Level Agreement for availability.
Supports SSL for data in transit and encryption of data at rest.
S3 Life-cycle management for automatic migration of objects to other S3 Storage classes

# S3-Intelligent Tiering

Uses ML/AI to analyse object usage and automate cost savings by moving objects between four access tiers when accessing patterns change. data is moved to most suitable tier, without any performance impact or added overhead.

Automatically optimises the storage costs for data with changing access patterns.
Stores objects in four access tiers, optimised for frequent, infrequent, archive, and deep archive access.
Frequent and In-frequent Access tiers have the same low latency and high throughput performance as S3 Standard.
Designed for durability of 99.999999999% of objects across multiple Availability zones.
Designed for 99.9% availability over a given year.
Backed with the Amazon S3 Service Level Agreement for availability.
Small monthly monitoring and auto-tiering free.

# S3-Standard Infrequently Accessed (IA)

# S3-Onezone IA

1 AZ only
Avilability is decreased to 99.5%

# S3-Glacier

# S3-Glacier-Deep Archive

Slowest storage class ()
Cheapest tier here

# S3 Security

All new buckets are PRIVATE by default.
Logging per request can be turned on to see the access log of the users
Access Control is configured using Bucket Policies and Access Control Lists (ACL)
ACL are Leagacy feature of controlling access to buckets and objects
Bucket Policies are newer compared to ACL’s and use JSON format policies

# Encryption

# S3 Encryption at transit

SSL/TLS is enabled at transit by default

# S3 Encryption at rest

there are three types of S3 SSE and also supports Client Side Encryption

# Server Side Encryption (SSE)

SSE-AES S3 handles the key and uses AES-256 algorithm
SSE-KMS Envelope the encryption using two passes. Both Amazon KMS and the client manages the keys
SSE-C Similar to KMS but the customer is responsible for managing the key in the aws.

# Client Side Encryption (CSE)

we the clients are responsible for the encryption of the files being stored in the S3. i.e. the file will be encrypted locally even before being transferred to the s3

# Data Consistency

Read After Write Consistency When you upload a new object

Eventual Consistency when you overwrite or delete an object it takes time for S3 to replicate versions to multiple AZ’s. Fetching the updated object from S3 which was just updated might result in returning the old object instead of the newer copy.

# S3- Versioning

Stores all version of S3 Objects Once enabled it cannot be disabled, only suspended on the bucket Fully integrates with Rules MFA Delete feature provides extra protection against accidental deletions

# Replication

Requires Versioning enabled

# Cross Region Replication

When enabled, any object that is uploaded will be automatically replicated to another region/s. It provides higher durability and potential disaster recovery for objects. This requires versioning turned on on both Source and Destination Bucket. Cross account replication is also possible.

# Single Region Replication

When enabled, the objects will be replicated cross-account on the same region as shown above.

# S3 Life-cycle Management

Automated the process of moving objects to different Storage Classes or deleting. Sort of like a cronjob. Can be used together with versioning and can apply changes and commands to both current and previous versions.

# Transfer Acceleration

Fast and secure transfer of files over long distances between your end users and an S3 bucket. Utilises CloudFront’s Edge Locations Instead of uploading directly to the bucket it will use a distinct URL of an Edge Location to upload it there. When the data is uploaded in the Edge Location it is automatically routed to S3 over a optimised network path

Shrestha Rajat