AWS Whitepaper
AWS Whitepaper
Analytics services
• Amazon Athena
• Amazon CloudSearch
• Amazon DataZone
• Amazon EMR
• Amazon FinSpace
• Amazon Kinesis
• Amazon Data Firehose
• Amazon Managed Service for Apache Flink
• Amazon Kinesis Data Streams
• Amazon Kinesis Video Streams
• Amazon OpenSearch Service
• Amazon OpenSearch Serverless
• Amazon Redshift
• Amazon Redshift Serverless
Analytics 10
Overview of Amazon Web Services AWS Whitepaper
• Amazon QuickSight
• AWS Clean Rooms
• AWS Data Exchange
• AWS Data Pipeline
• AWS Entity Resolution
• AWS Glue
• AWS Lake Formation
• Amazon Managed Streaming for Apache Kafka (Amazon MSK)
Amazon Athena
Amazon Athena is an interactive query service that makes it easy to analyze data in Amazon S3
using standard SQL. Athena is serverless, so there is no infrastructure to manage, and you pay only
for the queries that you run.
Athena is easy to use. Simply point to your data in Amazon S3, define the schema, and start
querying using standard SQL. Most results are delivered within seconds. With Athena, there’s no
need for complex extract, transform, and load (ETL) jobs to prepare your data for analysis. This
makes it easy for anyone with SQL skills to quickly analyze large-scale datasets.
Athena is out-of-the-box integrated with AWS Glue Data Catalog, allowing you to create a
unified metadata repository across various services, crawl data sources to discover schemas and
populate your Catalog with new and modified table and partition definitions, and maintain schema
versioning.
Amazon CloudSearch
Amazon CloudSearch is a managed service in the AWS Cloud that makes it simple and cost-
effective to set up, manage, and scale a search solution for your website or application.
Amazon CloudSearch supports 34 languages and popular search features such as highlighting,
autocomplete, and geospatial search.
Amazon DataZone
Amazon DataZone is a data management service that you can use to publish data and make
it available to the business data catalog through your personalized web application. You can
access your data more securely regardless of where it is stored—on AWS, on premises, or in
Amazon Athena 11
Overview of Amazon Web Services AWS Whitepaper
SaaS applications such as Salesforce. Amazon DataZone simplifies your experience across AWS
services such as Amazon Redshift, Amazon Athena, AWS Glue, AWS Lake Formation, and Amazon
QuickSight.
Amazon EMR
Amazon EMR is the industry-leading cloud big data platform for processing vast amounts of
data using open source tools such as Apache Spark, Apache Hive, Apache HBase, Apache Flink,
Apache Hudi, and Presto. Amazon EMR makes it easy to set up, operate, and scale your big data
environments by automating time-consuming tasks such as provisioning capacity and tuning
clusters. With Amazon EMR, you can run petabyte-scale analysis at less than half of the cost of
traditional on-premises solutions and over 3x faster than standard Apache Spark. You can run
workloads on Amazon EC2 instances, on Amazon Elastic Kubernetes Service (Amazon EKS) clusters,
or on-premises using Amazon EMR on AWS Outposts.
Amazon FinSpace
Amazon FinSpace is a data management and analytics service purpose-built for the financial
services industry (FSI). FinSpace reduces the time you spend finding and preparing petabytes of
financial data to be ready for analysis from months to minutes.
Financial services organizations analyze data from internal data stores such as portfolio, actuarial,
and risk management systems as well as petabytes of data from third-party data feeds, such as
historical securities prices from stock exchanges. It can take months to find the right data, get
permissions to access the data in a compliant way, and prepare it for analysis.
FinSpace removes the heavy lifting of building and maintaining a data management system for
financial analytics. With FinSpace, you collect data and catalog it by relevant business concepts
such as asset class, risk classification, or geographic region. FinSpace makes it easy to discover and
share data across your organization in accordance with your compliance requirements. You define
your data access policies in one place and FinSpace enforces them while keeping audit logs to allow
for compliance and activity reporting. FinSpace also includes a library of 100+ functions, such as
time bars and Bollinger bands, for you to prepare data for analysis.
Amazon Kinesis
Amazon Kinesis makes it easy to collect, process, and analyze real-time, streaming data so you can
get timely insights and react quickly to new information. Amazon Kinesis offers key capabilities
to cost-effectively process streaming data at any scale, along with the flexibility to choose the
Amazon EMR 12
Overview of Amazon Web Services AWS Whitepaper
tools that best suit the requirements of your application. With Amazon Kinesis, you can ingest real-
time data such as video, audio, application logs, website clickstreams, and IoT telemetry data for
machine learning (ML), analytics, and other applications. Amazon Kinesis enables you to process
and analyze data as it arrives and respond instantly instead of having to wait until all your data is
collected before the processing can begin.
Amazon Kinesis currently offers four services: Firehose, Managed Service for Apache Flink, Kinesis
Data Streams, and Kinesis Video Streams.
You can easily create a Firehose delivery stream from the AWS Management Console, configure
it with a few clicks, and start sending data to the stream from hundreds of thousands of data
sources to be loaded continuously to AWS—all in just a few minutes. You can also configure your
delivery stream to automatically convert the incoming data to columnar formats such as Apache
Parquet and Apache ORC, before the data is delivered to Amazon S3, for cost-effective storage and
analytics.
Amazon Managed Service for Apache Flink takes care of everything required to run your queries
continuously and scales automatically to match the volume and throughput rate of your incoming
data.
Amazon Kinesis Data Streams is a massively scalable and durable real-time data streaming service.
Kinesis Data Streams can continuously capture gigabytes of data per second from hundreds of
thousands of sources such as website clickstreams, database event streams, financial transactions,
social media feeds, IT logs, and location-tracking events. The data collected is available in
milliseconds to enable real-time analytics use cases such as real-time dashboards, real-time
anomaly detection, dynamic pricing, and more.
Amazon Kinesis Video Streams makes it easy to securely stream video from connected devices
to AWS for analytics, ML, playback, and other processing. Kinesis Video Streams automatically
provisions and elastically scales all the infrastructure needed to ingest streaming video data from
millions of devices. It also durably stores, encrypts, and indexes video data in your streams, and
allows you to access your data through easy-to-use APIs. Kinesis Video Streams enables you to
playback video for live and on-demand viewing, and quickly build applications that take advantage
of computer vision and video analytics through integration with Amazon Rekognition Video, and
libraries for ML frameworks such as Apache MxNet, TensorFlow, and OpenCV.
Amazon OpenSearch Service (OpenSearch Service) makes it easy to deploy, secure, operate, and
scale OpenSearch to search, analyze, and visualize data in real-time. With Amazon OpenSearch
Service, you get easy-to-use APIs and real-time analytics capabilities to power use-cases such as
log analytics, full-text search, application monitoring, and clickstream analytics, with enterprise-
grade availability, scalability, and security. The service offers integrations with open-source
tools such as OpenSearch Dashboards and Logstash for data ingestion and visualization. It also
integrates seamlessly with other AWS services such as Amazon Virtual Private Cloud (Amazon VPC),
AWS Key Management Service (AWS KMS), Amazon Data Firehose, AWS Lambda, AWS Identity and
Access Management (IAM), Amazon Cognito, and Amazon CloudWatch, so that you can go from
raw data to actionable insights quickly.
configuring, managing, and scaling OpenSearch clusters. You get the same interactive millisecond
response times as OpenSearch Service with the simplicity of a serverless environment.
The vector engine for Amazon OpenSearch Serverless, adds a simple, scalable, and high-
performing vector storage and search capability to help developers build ML-augmented
search experiences and generative AI applications without having to manage vector database
infrastructure. Use cases for vector search collections include image search, document search,
music retrieval, product recommendation, video search, location-based search, fraud detection, and
anomaly detection.
Amazon Redshift
Amazon Redshift is the most widely used cloud data warehouse. It makes it fast, simple and cost-
effective to analyze all your data using standard SQL and your existing Business Intelligence (BI)
tools. It allows you to run complex analytic queries against terabytes to petabytes of structured
and semi-structured data, using sophisticated query optimization, columnar storage on high-
performance storage, and massively parallel query completion. Most results come back in seconds.
You can start small for just $0.25 per hour with no commitments and scale out to petabytes
of data for $1,000 per terabyte per year, less than a tenth the cost of traditional on-premises
solutions.
Amazon QuickSight
Amazon QuickSight is a fast, cloud-powered business intelligence (BI) service that makes it easy
for you to deliver insights to everyone in your organization. QuickSight lets you create and publish
interactive dashboards that can be accessed from browsers or mobile devices. You can embed
Amazon Redshift 15
Overview of Amazon Web Services AWS Whitepaper
dashboards into your applications, providing your customers with powerful self-service analytics.
Amazon QuickSight easily scales to tens of thousands of users without any software to install,
servers to deploy, or infrastructure to manage.
Once subscribed to a data product, you can use the AWS Data Exchange API to load data directly
into Amazon S3 and then analyze it with a wide variety of AWS analytics and ML services. For
example, property insurers can subscribe to data to analyze historical weather patterns to calibrate
insurance coverage requirements in different geographies; restaurants can subscribe to population
and location data to identify optimal regions for expansion; academic researchers can conduct
studies on climate change by subscribing to data on carbon dioxide emissions; and healthcare
professionals can subscribe to aggregated data from historical clinical trials to accelerate their
research activities.
For data providers, AWS Data Exchange makes it easy to reach the millions of AWS customers
migrating to the cloud by removing the need to build and maintain infrastructure for data storage,
delivery, billing, and entitling.
intervals. With AWS Data Pipeline, you can regularly access your data where it’s stored, transform
and process it at scale, and efficiently transfer the results to AWS services such as Amazon S3,
Amazon RDS, Amazon DynamoDB, and Amazon EMR.
AWS Data Pipeline helps you easily create complex data processing workloads that are fault
tolerant, repeatable, and highly available. You don’t have to worry about ensuring resource
availability, managing inter-task dependencies, retrying transient failures or timeouts in individual
tasks, or creating a failure notification system. AWS Data Pipeline also allows you to move and
process data that was previously locked up in on-premises data silos.
AWS Entity Resolution is a service that helps you match and link related records stored across
multiple applications, channels, and data stores without building a custom solution. Using
flexible, configurable ML and rule-based techniques, AWS Entity Resolution can remove duplicate
records, create customer profiles by connecting different customer interactions, and personalize
experiences across advertising and marketing campaigns, loyalty programs, and e-commerce. For
example, you can create a unified view of customer interactions by linking recent events, such as ad
clicks, cart abandonment, and purchases, into a unique match ID.
AWS Glue
AWS Glue is a fully managed extract, transform, and load (ETL) service that makes it easy for
customers to prepare and load their data for analytics. You can create and run an ETL job with
a few clicks in the AWS Management Console. You simply point AWS Glue to your data stored
in AWS, and AWS Glue discovers your data and stores the associated metadata (such as table
definition and schema) in the AWS Glue Data Catalog. Once cataloged, your data is immediately
searchable, queryable, and available for ETL.
AWS Glue Data Integration Engines provide access to data using Apache Spark, PySpark, and
Python. With the addition of AWS Glue for Ray, you can further scale your workloads using Ray, an
open-source unified compute framework.
AWS Glue Data Quality can measure and monitor the data quality of Amazon S3 based data lakes,
data warehouses, and other data repositories. It automatically computes statistics, recommends
quality rules, and can monitor and alert you when it detects missing, stale, or bad data. You can
access it in the AWS Glue Data Catalog and in AWS Glue Data Catalog ETL jobs.
However, setting up and managing data lakes today involves a lot of manual, complicated, and
time-consuming tasks. This work includes loading data from diverse sources, monitoring those data
flows, setting up partitions, turning on encryption and managing keys, defining transformation
jobs and monitoring their operation, re-organizing data into a columnar format, configuring access
control settings, deduplicating redundant data, matching linked records, granting access to data
sets, and auditing access over time.
Creating a data lake with Lake Formation is as simple as defining where your data resides and what
data access and security policies you want to apply. Lake Formation then collects and catalogs data
from databases and object storage, moves the data into your new Amazon S3 data lake, cleans
and classifies data using ML algorithms, and secures access to your sensitive data. Your users can
then access a centralized catalog of data which describes available data sets and their appropriate
usage. Your users then leverage these data sets with their choice of analytics and ML services, such
as Amazon EMR for Apache Spark, Amazon Redshift, Amazon Athena, SageMaker AI, and Amazon
QuickSight.
Apache Kafka clusters are challenging to setup, scale, and manage in production. When you run
Apache Kafka on your own, you need to provision servers, configure Apache Kafka manually,
replace servers when they fail, orchestrate server patches and upgrades, architect the cluster for
high availability, ensure data is durably stored and secured, setup monitoring and alarms, and
carefully plan scaling events to support load changes. Amazon MSK makes it easy for you to build
and run production applications on Apache Kafka without needing Apache Kafka infrastructure
management expertise. That means you spend less time managing infrastructure and more time
building applications.
With a few clicks in the Amazon MSK console you can create highly available Apache Kafka clusters
with settings and configuration based on Apache Kafka’s deployment best practices. Amazon MSK
automatically provisions and runs your Apache Kafka clusters. Amazon MSK continuously monitors
cluster health and automatically replaces unhealthy nodes with no downtime to your application.
In addition, Amazon MSK secures your Apache Kafka cluster by encrypting data at rest.
Application integration
Each service is described after the diagram. To help you decide which service best meets your
needs, see Choosing an AWS application integration service or Amazon SQS, Amazon SNS, or
Amazon EventBridge?. For general information, see Application Integration on AWS.
Application integration 19
Overview of Amazon Web Services AWS Whitepaper
Services
• Amazon AppFlow
• Amazon MQ
Application integration 20
Overview of Amazon Web Services AWS Whitepaper
AWS Step Functions is a fully managed service that makes it easy to coordinate the components
of distributed applications and microservices using visual workflows. Building applications from
individual components that each perform a discrete function lets you scale easily and change
applications quickly. Step Functions is a reliable way to coordinate components and step through
the functions of your application. Step Functions provides a graphical console to arrange and
visualize the components of your application as a series of steps. This makes it simple to build and
run multi-step applications. Step Functions automatically initiates and tracks each step, and retries
when there are errors, so your application runs in order and as expected. Step Functions logs the
state of each step, so when things do go wrong, you can diagnose and debug problems quickly. You
can change and add steps without even writing code, so you can easily evolve your application and
innovate faster.
Amazon AppFlow
Amazon AppFlow is a fully managed integration service that enables you to securely transfer
data between Software-as-a-Service (SaaS) applications such as Salesforce, Zendesk, Slack, and
ServiceNow, and AWS services such as Amazon S3 and Amazon Redshift, in just a few clicks. With
Amazon AppFlow, you can run data flows at enterprise scale at the frequency you choose - on a
schedule, in response to a business event, or on demand. You can configure data transformation
capabilities such as filtering and validation to generate rich, ready-to-use data as part of the flow
itself, without additional steps. Amazon AppFlow; automatically encrypts data in motion, and
allows users to restrict data from flowing over the public internet for SaaS applications that are
integrated with AWS PrivateLink, reducing exposure to security threats.
AWS B2B Data Interchange (B2Bi) automates the transformation of Electronic Data Interchange
(EDI) documents into JSON and XML formats to simplify your downstream data integrations.
Businesses use EDI documents to exchange transactional data with trading partners, such as
suppliers and end customers, using standardized formats such as X12.
With B2Bi, you can onboard and manage your trading partners and automate the transformation
of EDI documents into common data representations such as JSON and XML using a low-code
interface. This approach reduces the time, complexity, and cost associated with preparing and
integrating EDI data into their business applications and purpose-built data lakes. As a result,
you can concentrate on using transactional data to drive business insights using the AWS suite of
analytics, AI, and ML services.
Amazon EventBridge
Amazon EventBridge is a serverless event bus that makes it easier to build event-driven
applications at scale using events generated from your applications, integrated Software-as-
a-Service (SaaS) applications, and AWS services. EventBridge delivers a stream of real-time
data from event sources such as Zendesk or Shopify to targets such as AWS Lambda and other
SaaS applications. You can set up routing rules to determine where to send your data to build
application architectures that react in real-time to your data sources with event publisher and
consumer completely decoupled.
Amazon Managed Workflows for Apache Airflow (MWAA) is a managed orchestration service for
Apache Airflow that makes it easier to set up and operate end-to-end data pipelines in the cloud
at scale. Apache Airflow is an open-source tool used to programmatically author, schedule, and
monitor sequences of processes and tasks referred to as “workflows.” With Managed Workflows,
you can use Airflow and Python to create workflows without having to manage the underlying
infrastructure for scalability, availability, and security. Managed Workflows automatically scales its
workflow capacity to meet your needs, and is integrated with AWS security services to help provide
you with fast and secure access to data.
Amazon MQ
Amazon MQ is a managed message broker service for Apache ActiveMQ Classic and RabbitMQ
that makes it easy to set up and operate message brokers in the cloud. Message brokers allow
different software systems–often using different programming languages, and on different
platforms–to communicate and exchange information. Amazon MQ reduces your operational load
by managing the provisioning, setup, and maintenance of ActiveMQ and RabbitMQ, popular open-
source message brokers. Connecting your current applications to Amazon MQ is easy because it
uses industry-standard APIs and protocols for messaging, including JMS, NMS, AMQP, STOMP,
MQTT, and WebSocket. Using standards means that in most cases, there’s no need to rewrite any
messaging code when you migrate to AWS.
Amazon EventBridge 22
Overview of Amazon Web Services AWS Whitepaper
Amazon Simple Notification Service (Amazon SNS) is a highly available, durable, secure, fully
managed pub/sub messaging service that enables you to decouple microservices, distributed
systems, and serverless applications. Amazon SNS provides topics for high-throughput, push-
based, many-to-many messaging. Using Amazon SNS topics, your publisher systems can fan out
messages to a large number of subscriber endpoints for parallel processing, including Amazon SQS
queues, AWS Lambda functions, and HTTP/S webhooks. Additionally, SNS can be used to fan out
notifications to end users using mobile push, SMS, and email.
Amazon Simple Queue Service (Amazon SQS) is a fully managed message queuing service that
enables you to decouple and scale microservices, distributed systems, and serverless applications.
SQS eliminates the complexity and overhead associated with managing and operating message
oriented middleware, and empowers developers to focus on differentiating work. Using Amazon
SQS, you can send, store, and receive messages between software components at any volume,
without losing messages or requiring other services to be available. Get started with Amazon SQS
in minutes using the AWS Management Console, AWS CLI, or SDK of your choice, and three simple
commands.
Amazon SQS offers two types of message queues. Standard queues offer maximum throughput,
best-effort ordering, and at-least-once delivery. Amazon SQS FIFO queues are designed to
guarantee that messages are processed exactly once, in the exact order that they are sent.
Amazon Simple Workflow Service (Amazon SWF) helps developers build, run, and scale background
jobs that have parallel or sequential steps. You can think of Amazon SWF as a fully managed
state tracker and task coordinator in the cloud. If your application’s steps take more than 500
milliseconds to complete, you need to track the state of processing. If you need to recover or retry
if a task fails, Amazon SWF can help you.
Blockchain
Amazon Managed Blockchain is a fully managed service that makes it easy to create and
manage scalable blockchain networks using the popular open source frameworks Hyperledger
Fabric and Ethereum.
Blockchain makes it possible to build applications where multiple parties can run transactions
without the need for a trusted, central authority. Today, building a scalable blockchain network
with existing technologies is complex to set up and hard to manage. To create a blockchain
network, each network member needs to manually provision hardware, install software, create
and manage certificates for access control, and configure networking components. Once the
blockchain network is running, you need to continuously monitor the infrastructure and adapt
to changes, such as an increase in transaction requests, or new members joining or leaving the
network.
Amazon Managed Blockchain is a fully managed service that allows you to set up and manage
a scalable blockchain network with just a few clicks. Amazon Managed Blockchain eliminates
the overhead required to create the network, and automatically scales to meet the demands
of thousands of applications running millions of transactions. Once your network is up and
running, Managed Blockchain makes it easy to manage and maintain your blockchain network.
It manages your certificates, lets you easily invite new members to join the network, and tracks
operational metrics such as usage of compute, memory, and storage resources. In addition,
Managed Blockchain can replicate an immutable copy of your blockchain network activity into
Amazon Quantum Ledger Database (Amazon QLDB), a fully managed ledger database. This
allows you to easily analyze the network activity outside the network and gain insights into
trends.
Business applications
Innovative business applications with the same on-demand scalability, reliability, pay-as-you go
pricing, and machine learning that drives AWS cloud infrastructure.
Applications
Business applications 24
Overview of Amazon Web Services AWS Whitepaper
Alexa for Business is a service that enables organizations and employees to use Alexa to get more
work done. With Alexa for Business, employees can use Alexa as their intelligent assistant to be
more productive in meeting rooms, at their desks, and even with the Alexa devices they already
have at home.
AWS AppFabric
AWS AppFabric is a fully managed service that aggregates and normalizes security data across
software as a service (SaaS) applications. Previously, integrating SaaS applications with existing
security tools required teams to build, manage, and maintain their own point-to-point (P2P)
integrations so that security teams could monitor event logs and understand activity from each
application. With AppFabric, you can quickly connect multiple SaaS applications to increase
observability, productivity, and security—with no coding required.
After the SaaS applications are authorized and connected, AppFabric ingests the data and
normalizes it using the Open Cybersecurity Schema Framework (OCSF). OCSF allows you to set
common policies, standardize security alerts, and quickly manage user access across multiple
applications.
Amazon Chime
Amazon Chime is a communications service that transforms online meetings with a secure, easy-
to-use application that you can trust. Amazon Chime works seamlessly across your devices so that
you can stay connected. You can use Amazon Chime for online meetings, video conferencing, calls,
chat, and to share content, both inside and outside your organization.
Amazon Chime works with Alexa for Business, which means you can use Alexa to start your
meetings with your voice. Alexa can start your video meetings in large conference rooms, and
automatically dial into online meetings in smaller huddle rooms and from your desk.
With the Amazon Chime SDK, builders can easily add real-time voice, video, and messaging
powered by ML into their applications.
Amazon Connect
Amazon Connect is a self-service, omnichannel cloud contact center service that makes it easy
for any business to deliver better customer service at lower cost. Amazon Connect is based on
the same contact center technology used by Amazon customer service associates around the
world to power millions of customer conversations. The self-service graphical interface in Amazon
Connect makes it easy for non-technical users to design contact flows, manage agents, and track
performance metrics – no specialized skills required. There are no up-front payments or long-
term commitments and no infrastructure to manage with Amazon Connect; customers pay by the
minute for Amazon Connect usage plus any associated telephony services.
Amazon Pinpoint
Amazon Pinpoint makes it easy to send targeted messages to your customers through multiple
engagement channels. Examples of targeted campaigns are promotional alerts and customer
retention campaigns, and transactional messages are messages such as order confirmations and
password reset messages.
You can integrate Amazon Pinpoint into your mobile and web apps to capture usage data to
provide you with insight into how customers interact with your apps. Amazon Pinpoint also tracks
the ways that your customers respond to the messages you send—for example, by showing you the
number of messages that were delivered, opened, or clicked.
You can develop custom audience segments and send them pre-scheduled targeted campaigns
via email, SMS, and push notifications. Targeted campaigns are useful for sending promotional or
educational content to re-engage and retain your users.
You can send transactional messages using the console or the Amazon Pinpoint REST API.
Transactional campaigns can be sent via email, SMS, push notifications, and voice messages.
You can also use the API to build custom applications that deliver campaign and transactional
messages.
Amazon SES
Amazon Simple Email Service (Amazon SES) is a cost-effective, flexible, and scalable email service
that enables developers to send mail from within any application. You can configure Amazon
SES quickly to support several email use cases, including transactional, marketing, or mass email
communications. The Amazon SES flexible IP deployment and email authentication options help
drive higher deliverability and protect sender reputation, while sending analytics measure the
impact of each email. With Amazon SES, you can send email securely, globally, and at scale.
Amazon WorkDocs
Notice
New customer sign-ups and account upgrades are no longer available for Amazon
WorkDocs. Learn about migration steps here: How to migrate data from Amazon WorkDocs.
Amazon WorkDocs is a fully managed, secure enterprise storage and sharing service with strong
administrative controls and feedback capabilities that improve user productivity.
Users can comment on files, send them to others for feedback, and upload new versions without
having to resort to emailing multiple versions of their files as attachments. Users can take
advantage of these capabilities wherever they are, using the device of their choice, including PCs,
Macs, tablets, and phones. Amazon WorkDocs offers IT administrators the option of integrating
with existing corporate directories, flexible sharing policies and control of the location where data
is stored.
Amazon WorkMail
Amazon WorkMail is a secure, managed business email and calendar service with support
for existing desktop and mobile email client applications. Amazon WorkMail gives users the
ability to seamlessly access their email, contacts, and calendars using the client application
of their choice, including Microsoft Outlook, native iOS and Android email applications, any
Amazon SES 27
Overview of Amazon Web Services AWS Whitepaper
client application supporting the IMAP protocol, or directly through a web browser. You can
integrate Amazon WorkMail with your existing corporate directory, use email journaling to meet
compliance requirements, and control both the keys that encrypt your data and the location in
which your data is stored. You can also set up interoperability with Microsoft Exchange Server, and
programmatically manage users, groups, and resources using the Amazon WorkMail SDK.
Whether you were born in the cloud, or you are just starting your migration journey to the cloud,
AWS has a set of solutions to help you manage and optimize your spend.
Each service is described after the diagram. To help you decide which service best meets your
needs, see Choosing an AWS cost management strategy. For general information, see Cloud
Financial Management with AWS.
Services
• AWS Application Cost Profiler
With the proportionate cost insights of shared AWS resources, organizations running applications
can establish the data foundation for accurate cost allocation model, and ISV selling applications
can better understand your profitability and customize pricing strategies for your end customers.
AWS Billing Conductor doesn't change the way that you're billed by Amazon Web Services each
month. Instead, it provides you with a mechanism to configure, generate, and display rates to
certain customers over a given billing period. You can also use it to analyze the difference between
the rates you apply to your accounting groupings relative to your actual rates from AWS. As a result
of your AWS Billing Conductor configuration, the payer account can also see the custom rate that's
applied on the billing details page of the AWS Billing console, or configure a cost and usage report
per billing group.
You can configure the billing groups and pricing plans using the AWS Billing Conductor or the AWS
Billing Conductor API. For more information about AWS Billing Conductor service quotas, refer to
Quotas and restrictions.
AWS Cost Explorer has an easy-to-use interface that lets you visualize, understand, and manage
your AWS costs and usage over time. Get started quickly by creating custom reports (including
charts and tabular data) that analyze cost and usage data, both at a high level (such as total costs
and usage across all accounts) and for highly-specific requests (such as m2.2xlarge costs within
account Y that are tagged “project: secretProject”).
AWS Budgets
AWS Budgets gives you the ability to set custom budgets that alert you when your costs or usage
exceed (or are forecasted to exceed) your budgeted amount. You can also use AWS Budgets to set
RI utilization or coverage targets and receive alerts when your utilization drops below the threshold
you define. RI alerts support Amazon EC2, Amazon RDS, Amazon Redshift, and Amazon ElastiCache
reservations.
Budgets can be tracked at the monthly, quarterly, or yearly level, and you can customize the
start and end dates. You can further refine your budget to track costs associated with multiple
dimensions, such as AWS service, linked account, tag, and others. Budget alerts can be sent via
email and/or Amazon Simple Notification Service (Amazon SNS) topic.
Budgets can be created and tracked from the AWS Budgets dashboard or via the AWS Budgets API.
The AWS Cost and Usage Report is a single location for accessing comprehensive information
about your AWS costs and usage.
The AWS Cost and Usage Report lists AWS usage for each service category used by an account
and its IAM users in hourly or daily line items, as well as any tags that you have activated for cost
allocation purposes. You can also customize the AWS Cost and Usage Report to aggregate your
usage data to the daily or monthly level.
AWS provides a number of RI-specific cost management solutions out-of-the-box to help you
better understand and manage your RIs. Using the RI Utilization and Coverage reports available
in AWS Cost Explorer, you can visualize your RI data at an aggregate level or inspect a particular
RI subscription. To access the most detailed RI information available, you can leverage the AWS
Cost and Usage Report. You can also set a custom RI utilization target via AWS Budgets and receive
alerts when your utilization drops below the threshold you define.
Savings Plans
Savings Plans is a flexible pricing model offering lower prices compared to On-Demand pricing, in
exchange for a specific usage commitment (measured in $/hour) for a one or three-year period.
AWS offers three types of Savings Plans – Compute Savings Plans, Amazon EC2 Instance Savings
Plans, and Amazon SageMaker AI Savings Plans. Compute Savings Plans apply to usage across
Amazon EC2, AWS Lambda, and AWS Fargate. The Amazon EC2 Instance Savings Plans apply to
EC2 usage, and Amazon SageMaker AI Savings Plans apply to Amazon SageMaker AI usage. You can
easily sign up a one- or three-year term Savings Plans in AWS Cost Explorer and manage your plans
by taking advantage of recommendations, performance reporting, and budget alerts.
Compute
Each service is described after the diagram. To help you decide which service best meets your
needs, see Choosing an AWS compute service or Amazon Lightsail or Amazon EC2?. For general
information, see Compute on AWS.
Savings Plans 31
Overview of Amazon Web Services AWS Whitepaper
Topics
• Compare AWS compute services
• Amazon EC2
• Amazon EC2 Auto Scaling
• Amazon EC2 Image Builder
• Amazon Lightsail
• Amazon Linux 2023
• AWS App Runner
• AWS Batch
• AWS Elastic Beanstalk
• AWS Fargate
• AWS Lambda
• AWS Serverless Application Repository
• AWS Outposts
• AWS Wavelength
Compute 32
Overview of Amazon Web Services AWS Whitepaper
Cost and capacity management • AWS Savings Plan — Flexible pricing model
that provides savings of up to 72% on AWS
compute usage
• AWS Compute Optimizer — Recommend
s optimal AWS compute resources for your
workloads to reduce costs and improve
performance
• AWS Elastic Beanstalk — Easy-to-use service
for deploying and scaling web applications
and services
• EC2 Image Builder — Build and maintain
secure Linux or Windows Server images
• Elastic Load Balancing (ELB) — Automatic
ally distribute incoming application traffic
across multiple targets
Amazon EC2
Amazon Elastic Compute Cloud (Amazon EC2) is a web service that provides secure, resizable
compute capacity in the cloud. It is designed to make web-scale computing easier for developers.
The simple web interface of Amazon EC2 allows you to obtain and configure capacity with minimal
friction. It provides you with complete control of your computing resources and lets you run on
Amazon’s proven computing environment. Amazon EC2 reduces the time required to obtain and
boot new server instances (called Amazon EC2 instances) to minutes, allowing you to quickly scale
capacity, both up and down, as your computing requirements change. Amazon EC2 changes the
economics of computing by allowing you to pay only for capacity that you actually use. Amazon
EC2 provides developers and system administrators the tools to build failure resilient applications
and isolate themselves from common failure scenarios.
Amazon EC2 35
Overview of Amazon Web Services AWS Whitepaper
Instance types
Amazon EC2 passes on to you the financial benefits of Amazon scale. You pay a very low rate for
the compute capacity you actually consume. For a more detailed description, refer to Amazon EC2
pricing.
Amazon EC2 instance types are named based on their family, generation, processor family,
additional capabilities, and size.
• On-Demand Instances — With On-Demand Instances, you pay for compute capacity by the hour
or the second depending on which instances you run. No longer-term commitments or upfront
payments are needed. You can increase or decrease your compute capacity depending on the
demands of your application and only pay the specified per hourly rates for the instance you use.
On-Demand Instances are recommended for:
• Users that prefer the low cost and flexibility of Amazon EC2 without any up-front payment or
long-term commitment
• Applications being developed or tested on Amazon EC2 for the first time
• Spot Instances —Spot Instances are available at up to a 90% discount compared to On-Demand
prices and let you take advantage of unused Amazon EC2 capacity in the AWS Cloud. You can
significantly reduce the cost of running your applications, grow your application’s compute
capacity and throughput for the same budget, and enable new types of cloud computing
applications. Spot Instances are recommended for:
• Users with urgent computing needs for large amounts of additional capacity
• Reserved Instances — Reserved Instances provide you with a significant discount (up to 72%)
compared to On-Demand Instance pricing. You have the flexibility to change families, operating
system types, and tenancies while benefiting from Reserved Instance pricing when you use
Convertible Reserved Instances.
• C7g Instances — C7g Instances, powered by the latest generation AWS Graviton3 processors,
provide the best price performance in Amazon EC2 for compute-intensive workloads. C7g
instances are ideal for high performance computing (HPC), batch processing, electronic design
automation (EDA), gaming, video encoding, scientific modeling, distributed analytics, CPU-based
ML inference, and ad serving.
Amazon EC2 36
Overview of Amazon Web Services AWS Whitepaper
• Inf2 Instances — Inf2 Instances are purpose--built for deep learning inference. They deliver
high performance at the lowest cost in Amazon EC2 for generative AI models, including large
language models (LLMs) and vision transformers. Inf2 instances are powered by AWS Inferentia2,
the second-generation AWS Inferentia accelerator.
• M7g Instances — M7g instances, powered by the latest generation AWS Graviton3 processors,
provide the best price performance in Amazon EC2 for general purpose workloads. M7g
instances are ideal for applications built on open-source software such as application servers,
microservices, gaming servers, mid-size data stores, and caching fleets.
• R7g Instances — R7g Instances, powered by the latest generation AWS Graviton3 processors,
provide the best price performance in Amazon EC2 for memory-intensive workloads. R7g
instances are ideal for memory-intensive workloads such as open-source databases, in-memory
caches, and near real-time big data analytics.
• Trn1 Instances — Trn1 Instances, powered by AWS Trainium accelerators, are purpose-built for
high-performance deep learning training of generative AI models, including LLMs and latent
diffusion models. Trn1 instances offer up to 50% cost-to-train savings over other comparable
Amazon EC2 instances.
• Savings Plans — Savings Plans are a flexible pricing model that offer low prices on EC2 and
Fargate usage, in exchange for a commitment to a consistent amount of usage (measured in $/
hour) for a one or three year term.
• Dedicated Hosts — A Dedicated Host is a physical EC2 server dedicated for your use. Dedicated
Hosts can help you reduce costs by allowing you to use your existing server-bound software
licenses, including Windows Server, Microsoft SQL Server, and SUSE Linux Enterprise Server
(subject to your license terms), and can also help you meet compliance requirements.
Amazon EC2 Auto Scaling helps you maintain application availability and allows you to
automatically add or remove EC2 instances according to conditions you define. You can use the
fleet management features of Amazon EC2 Auto Scaling to maintain the health and availability
of your fleet. You can also use the dynamic and predictive scaling features of Amazon EC2 Auto
Scaling to add or remove EC2 instances. Dynamic scaling responds to changing demand and
predictive scaling automatically schedules the right number of EC2 instances based on predicted
demand. Dynamic scaling and predictive scaling can be used together to scale faster.
EC2 Image Builder simplifies the building, testing, and deployment of VMs and container images
for use on AWS or on-premises.
Keeping virtual machine (VM) and container images up-to-date can be time consuming, resource
intensive, and error-prone. Currently, customers either manually update and snapshot VMs or have
teams that build automation scripts to maintain images.
EC2 Image Builder significantly reduces the effort of keeping images up-to-date and secure by
providing a simple graphical interface, built-in automation, and AWS-provided security settings.
With Image Builder, there are no manual steps for updating an image nor do you have to build your
own automation pipeline.
Image Builder is offered at no cost, other than the cost of the underlying AWS resources used to
create, store, and share the images.
Amazon Lightsail
Amazon Lightsail is designed to be the easiest way to launch and manage a virtual private server
with AWS. Lightsail plans include everything you need to jumpstart your project – a VM, SSD-based
storage, data transfer, DNS management, and a static IP address – for a low, predictable price.
Amazon Linux 2023 (AL2023) is our new Linux-based operating system for AWS that is designed
to provide a secure, stable, high-performance environment to develop and run your cloud
applications. AL2023 provides seamless integration with various AWS services and development
tools, and offers optimized performance for Amazon EC2 Graviton-based instances and AWS
Support at no additional licensing cost. Starting with AL2023, a new Amazon Linux major release
will be available every two years. This cadence provides you with a more predictable release cycle
and up to 5 years of support, making it easier for you to plan your upgrades.
AL2023 offers several improvements over Amazon Linux 2 (AL2). For example, AL2023 takes a
security-by-default approach to help improve your security posture with preconfigured security
policies, SELinux in permissive mode and IMDSv2 enabled by default, and the availability of kernel
live patching. With deterministic upgrades through versioned repositories, you can lock to a specific
version of the Amazon Linux package repository, giving you control over how and when you absorb
updates. With this capability, you can adhere to operational best practices more efficiently by
ensuring consistency between package versions and updates across your environment. For a full
comparison, refer to Comparing Amazon Linux 2 and Amazon Linux 2023.
Amazon Linux 2023 is generally available in all AWS Regions, including the AWS GovCloud (US) and
the China Regions.
AWS App Runner is a fully managed service that makes it easy for developers to quickly deploy
containerized web applications and APIs, at scale and with no prior infrastructure experience
required. Start with your source code or a container image. AWS App Runner automatically builds
and deploys the web application and load balances traffic with encryption. App Runner also scales
up or down automatically to meet your traffic needs. With App Runner, rather than thinking about
servers or scaling, you have more time to focus on your applications.
AWS Batch
AWS Batch enables developers, scientists, and engineers to easily and efficiently run hundreds
of thousands of batch computing jobs on AWS. AWS Batch dynamically provisions the optimal
quantity and type of compute resources (such as CPU or memory-optimized instances) based
on the volume and specific resource requirements of the batch jobs submitted. With AWS Batch,
there is no need to install and manage batch computing software or server clusters that you use
to run your jobs, allowing you to focus on analyzing results and solving problems. AWS Batch
plans, schedules, and runs your batch computing workloads across the full range of AWS compute
services and features, such as Amazon EC2 and Spot Instances.
AWS Elastic Beanstalk is an easy-to-use service for deploying and scaling web applications and
services developed with Java, .NET, PHP, Node.js, Python, Ruby, Go, and Docker on familiar servers
such as Apache, Nginx, Passenger, and Internet Information Services (IIS).
You can simply upload your code, and AWS Elastic Beanstalk automatically handles the
deployment, from capacity provisioning, load balancing, and auto scaling to application health
monitoring. At the same time, you retain full control over the AWS resources powering your
application and can access the underlying resources at any time.
AWS Fargate
AWS Fargate is a compute engine for Amazon ECS that allows you to run containers without having
to manage servers or clusters. With AWS Fargate, you no longer have to provision, configure, and
scale clusters of VMs to run containers. This removes the need to choose server types, decide when
to scale your clusters, or optimize cluster packing. Fargate removes the need for you to interact
with or think about servers or clusters. Fargate lets you focus on designing and building your
applications instead of managing the infrastructure that runs them.
Amazon ECS has two modes: Fargate launch type and EC2 launch type. With Fargate launch
type, all you have to do is package your application in containers, specify the CPU and memory
requirements, define networking and IAM policies, and launch the application. EC2 launch type
allows you to have server-level, more granular control over the infrastructure that runs your
container applications. With EC2 launch type, you can use Amazon ECS to manage a cluster of
servers and schedule placement of containers on the servers. Amazon ECS keeps track of all the
CPU, memory and other resources in your cluster, and also finds the best server for a container to
run on based on your specified resource requirements.
You are responsible for provisioning, patching, and scaling clusters of servers. You can decide which
type of server to use, which applications and how many containers to run in a cluster to optimize
utilization, and when you should add or remove servers from a cluster. EC2 launch type gives you
more control of your server clusters and provides a broader range of customization options, which
might be required to support some specific applications or possible compliance and government
requirements.
AWS Lambda
AWS Lambda lets you run code without provisioning or managing servers. You pay only for the
compute time you consume—there is no charge when your code is not running. With Lambda, you
can run code for virtually any type of application or backend service—all with zero administration.
Just upload your code, and Lambda takes care of everything required to run and scale your code
with high availability. You can set up your code to automatically run from other AWS services, or
you can call it directly from any web or mobile app.
The AWS Serverless Application Repository enables you to quickly deploy code samples,
components, and complete applications for common use cases such as web and mobile backends,
AWS Fargate 40