Encryption SDK Developer Guide
Encryption SDK Developer Guide
Developer Guide
AWS Encryption SDK Developer Guide
Table of Contents
What Is the AWS Encryption SDK? ....................................................................................................... 1
Where to find more information .................................................................................................. 2
How the SDK Works ................................................................................................................... 2
Symmetric Key Encryption .................................................................................................. 2
Envelope Encryption ........................................................................................................... 3
AWS Encryption SDK Encryption Workflows .......................................................................... 4
Concepts ................................................................................................................................... 5
Data Keys .......................................................................................................................... 6
Master key ........................................................................................................................ 6
Master key operations: Generate, Encrypt, Decrypt ................................................................. 6
Master key provider ............................................................................................................ 7
Cryptographic Materials Manager ......................................................................................... 7
Algorithm Suite ................................................................................................................. 7
Encryption Context ............................................................................................................. 8
Encrypted Message ............................................................................................................. 8
Getting Started .................................................................................................................................. 9
Supported Algorithm Suites .............................................................................................................. 10
Recommended: AES-GCM with Key Derivation and Signing ............................................................ 10
Other Supported Algorithm Suites ............................................................................................. 11
Programming Languages ................................................................................................................... 12
Java ........................................................................................................................................ 12
Prerequisites .................................................................................................................... 12
Installation ...................................................................................................................... 13
Example Code .................................................................................................................. 13
Python .................................................................................................................................... 20
Prerequisites .................................................................................................................... 20
Installation ...................................................................................................................... 20
Example Code .................................................................................................................. 21
Command Line Interface ........................................................................................................... 26
Installing the CLI .............................................................................................................. 27
How to Use the CLI .......................................................................................................... 29
Examples ......................................................................................................................... 36
Syntax and Parameter Reference ........................................................................................ 49
Data Key Caching ............................................................................................................................. 55
How to Implement Data Key Caching ......................................................................................... 55
Implement Data Key Caching: Step-by-Step ......................................................................... 56
Data Key Caching Example: Encrypt a String ........................................................................ 58
Setting Cache Security Thresholds .............................................................................................. 60
Data Key Caching Details .......................................................................................................... 61
How Data Key Caching Works ............................................................................................ 62
Creating a Cryptographic Materials Cache ............................................................................ 64
Creating a Caching Cryptographic Materials Manager ............................................................ 65
What Is in a Data Key Cache Entry? .................................................................................... 65
Encryption Context: How to Select Cache Entries ................................................................. 66
Data Key Caching Example ........................................................................................................ 66
LocalCryptoMaterialsCache Results ..................................................................................... 67
Java Example ................................................................................................................... 68
Python Example ............................................................................................................... 72
AWS CloudFormation Template .......................................................................................... 75
Frequently Asked Questions ............................................................................................................... 79
Reference ........................................................................................................................................ 82
Message Format Reference ........................................................................................................ 82
Header Structure .............................................................................................................. 83
Body Structure ................................................................................................................. 87
iii
AWS Encryption SDK Developer Guide
iv
AWS Encryption SDK Developer Guide
The AWS Encryption SDK answers questions like the following for you:
Without the AWS Encryption SDK, you might spend more effort on building an encryption solution
than on the core functionality of your application. The AWS Encryption SDK answers these questions by
providing the following things.
By default, the AWS Encryption SDK generates a unique data key for each data object that it
encrypts. This follows the cryptography best practice of using unique data keys for each encryption
operation.
The AWS Encryption SDK encrypts your data using a secure, authenticated, symmetric key algorithm.
For more information, see Supported Algorithm Suites (p. 10).
A Framework for Protecting Data Keys with Master Keys
The AWS Encryption SDK protects the data keys that encrypt your data by encrypting them under
one or more master keys. By providing a framework to encrypt data keys with more than one master
key, the AWS Encryption SDK helps make your encrypted data portable.
For example, you can encrypt data under multiple AWS Key Management Service (AWS KMS)
customer master keys (CMKs), each in a different AWS Region. Then you can copy the encrypted data
to any of the regions and use the CMK in that region to decrypt it. You can also encrypt data under
a CMK in AWS KMS and a master key in an on-premises HSM, enabling you to later decrypt the data
even if one of the options is unavailable.
A Formatted Message that Stores Encrypted Data Keys with the Encrypted Data
The AWS Encryption SDK stores the encrypted data and encrypted data key together in an
encrypted message (p. 8) that uses a defined data format. This means you don't need to keep
track of or protect the data keys that encrypt your data because the AWS Encryption SDK does it for
you.
With the AWS Encryption SDK, you define a master key provider (p. 7) that returns one or more
master keys (p. 6). Then you encrypt and decrypt your data using straightforward methods provided
by the AWS Encryption SDK. The AWS Encryption SDK does the rest.
1
AWS Encryption SDK Developer Guide
Where to find more information
For information about implementations of the AWS Encryption SDK in different programming languages.
• Java: See AWS Encryption SDK for Java (p. 12), the AWS Encryption SDK Javadocs, and the aws-
encryption-sdk-java repository on GitHub.
• Python: See AWS Encryption SDK for Python (p. 20), the AWS Encryption SDK Python
documentation, and the aws-encryption-sdk-python repository on GitHub.
• Command Line Interface: See AWS Encryption SDK Command Line Interface (p. 26), Read the Docs
for the AWS Encryption CLI, and the aws-encryption-sdk-cli repository on GitHub.
If you have questions or comments about this guide, let us know! Use the feedback links in the lower
right corner of this page.
The AWS Encryption SDK is provided for free under the Apache license.
Topics
• Symmetric Key Encryption (p. 2)
• Envelope Encryption (p. 3)
• AWS Encryption SDK Encryption Workflows (p. 4)
To decrypt the encrypted message, the AWS Encryption SDK provides the encrypted message to a
decryption algorithm that uses those inputs to return the plaintext data.
Because the same data key is used to encrypt and decrypt the data, the operations are known as
symmetric key encryption and decryption. The following figure shows symmetric key encryption and
decryption in the AWS Encryption SDK.
2
AWS Encryption SDK Developer Guide
Envelope Encryption
Envelope Encryption
The security of your encrypted data depends on protecting the data key that can decrypt it. One
accepted best practice for protecting the data key is to encrypt it. To do this, you need another
encryption key, known as a master key (p. 6). This practice of using a master key to encrypt data keys
is known as envelope encryption. Some of the benefits of envelope encryption include the following.
When you encrypt a data key, you don't have to worry about where to store it because the data
key is inherently protected by encryption. You can safely store the encrypted data key with the
encrypted data. The AWS Encryption SDK does this for you. It saves the encrypted data and the
encrypted data key together in an encrypted message (p. 8).
Encrypting the Same Data Under Multiple Master Keys
Encryption operations can be time-consuming, particularly when the data being encrypted are large
objects. Instead of reencrypting raw data multiple times with different keys, you can reencrypt only
the data keys that protect the raw data.
Combining the Strengths of Multiple Algorithms
In general, symmetric key encryption algorithms are faster and produce smaller ciphertexts than
assymetric or public key encryption. But, public key algorithms provide inherent separation of roles
and easier key management. You might want to combine the strengths of each. For example, you
might encrypt raw data with symmetric key encryption, and then encrypt the data key with public
key encryption.
The AWS Encryption SDK uses envelope encryption. It encrypts your data with a data key. Then, it
encryptes the data key with a master key. The AWS Encryption SDK returns the encrypted data and the
encrypted data keys in a single encrypted message, as shown in the following diagram.
3
AWS Encryption SDK Developer Guide
AWS Encryption SDK Encryption Workflows
If you have multiple master keys, each of them can encrypt the plaintext data key. Then, the AWS
Encryption SDK returns an encrypted message that contains the encrypted data and the collection of
encrypted data keys. Any one of the master keys can decrypt one of the encrypted data keys, which can
then decrypt the data.
When you use envelope encryption, you must protect your master keys from unauthorized access. You
can do this in one of the following ways:
• Use a web service designed for this purpose, such as AWS Key Management Service (AWS KMS).
• Use a hardware security module (HSM) such as those offered by AWS CloudHSM.
• Use your existing key management tools.
If you don't have a key management system, we recommend AWS KMS. The AWS Encryption SDK
integrates with AWS KMS to help you protect and use your master keys. You can also use the AWS
Encryption SDK with other master key providers, including custom ones that you define. Even if you
don't use AWS, you can still use this AWS Encryption SDK.
To indicate the source of the data keys (p. 6) that you want to use to encrypt your data, your
request specifies a cryptographic materials manager (CMM) or a master key provider. (If you specify a
master key provider, the AWS Encryption SDK creates a default CMM that interacts with your chosen
master key provider.)
2. The encryption method asks the CMM for data keys (and related cryptographic material).
4
AWS Encryption SDK Developer Guide
Concepts
3. The CMM gets a master key (p. 6) from its master key provider.
Note
If you are using AWS Key Management Service (AWS KMS), the KMS master key object that is
returned identifies the CMK, but the actual CMK never leaves the AWS KMS service.
4. The CMM asks the master key to generate a data key. The master key returns two copies of the data
key, one in plaintext and one encrypted under the master key.
5. The CMM returns the plaintext and encrypted data keys to the encryption method.
6. The encryption method uses the plaintext data key to encrypt the data, and then discards the
plaintext data key.
7. The encryption method returns an encrypted message (p. 8) that contains the encrypted data and
the encrypted data key.
To indicate the source of the data keys (p. 6) that were used to encrypt your data, your request
specifies a cryptographic materials manager (CMM) or a master key provider. (If you specify a master
key provider, the AWS Encryption SDK creates a default CMM that interacts with the specified master
key provider.)
2. The decryption method extracts the encrypted data key from the encrypted message. Then, it asks the
cryptographic materials manager (CMM) for a data key to decrypt the encrypted data key.
3. The CMM asks its master key provider for a master key that can decrypt the encrypted data key.
4. The CMM uses the master key to decrypt the encrypted data key. Then, it returns the plaintext data
key to the decryption method.
5. The decryption method uses the plaintext data key to decrypt the data, then discards the plaintext
data key.
6. The decryption method returns the plaintext data.
Topics
• Data Keys (p. 6)
• Master key (p. 6)
• Master key operations: Generate, Encrypt, Decrypt (p. 6)
• Master key provider (p. 7)
• Cryptographic Materials Manager (p. 7)
• Algorithm Suite (p. 7)
• Encryption Context (p. 8)
• Encrypted Message (p. 8)
5
AWS Encryption SDK Developer Guide
Data Keys
Data Keys
A data key consists of cryptographic material. It is the secret key that protects the data that you encrypt.
Data keys are generated by master keys (p. 6). You do not need to implement or extend data keys
to use the AWS Encryption SDK. When a master key generates a data key, it returns two copies of the
data key; one in plaintext and one that is encrypted by the master key that generated it. The plaintext
data key can be encrypted by multiple master keys, each of which returns an encrypted copy of the data
key. Every encrypted data key is associated with the master key that encrypted it and the master key
provider (p. 7) that supplied the master key.
When you encrypt data in the AWS Encryption SDK, the encrypted data keys are stored in an encrypted
message (p. 8) along with the encrypted data.
In the AWS Encryption SDK, we distinguish data keys from data encryption keys. Several of the supported
algorithm suites (p. 7), including the default suite, use a key derivation function that prevents the
data key from hitting its cryptographic limits. The key derivation function takes the data key as input and
returns a data encryption key that is actually used to encrypt the data. For this reason, we often say that
data is encrypted "under" a data key rather than "by" the data key.
Master key
A master key encrypts, decrypts, and generates data keys (p. 2).
The AWS Encryption SDK represents master keys as abstract classes or interfaces so you can implement
the master key operations in the way that best meets the security requirements of your organization. For
example, although they are called "keys," master keys might not have their own cryptographic material.
Also, unlike data keys, whose use and algorithm suite (p. 7) are strictly defined by AWS Encryption
SDK, master keys can use any algorithm suite or implementation.
Master keys are instrumental to envelope encryption (p. 3). In envelope encryption, one master key
generates and encrypts a data key that is used to encrypt data. Other master keys then re-encrypt the
plaintext data key. As a result, any master key is sufficient to decrypt the data.
Each master key is associated with one master key provider (p. 7) that returns one or more master
keys to the caller.
The AWS Encryption SDK provides several commonly used master keys, such as AWS Key Management
Service (AWS KMS) customer master keys (CMKs), raw AES-GCM (Advanced Encryption Standard / Galois
Counter Mode) keys, and RSA keys. You can implement your own master keys for other cryptographic
algorithms and services. For example, you could implement master keys backed by implementations of
Elliptical Curve Integrated Encryption Scheme (ECIES), Key Management Interoperability Program (KMIP),
tokenization services, or other proprietary systems.
You can implement the master key methods in the way that works best for your organization. For
example, when asked to generate a data key, a master key can create or return a key in any way that
fulfills the requirements of the algorithm suite that they use. Master keys can generate data keys locally
or remotely. They can derive the keys algorithmically, call a service that generates the cryptographic
material, or return previously-generated data keys. The SDK requires only that they return a valid data
key object.
6
AWS Encryption SDK Developer Guide
Master key provider
Also, although master keys must implement all three methods, you can create master keys that actually
perform only one or two of the three operations. Calls to the remaining methods just fail or return
errors. These limited master keys might be useful in a system with strict access controls that do not let
the same users encrypt and decrypt data.
All master key operations take an encryption context (p. 8) as input. For optimal security, master key
operations that encrypt data keys should cryptographically bind the encryption context to the encrypted
data so that changing any key or value in the encryption context invalidates the encryption. Master
key operations that decrypt should verify the encryption context and fail unless they include the same
encryption context used to encrypt. The encryption context is most useful when there are users who
have permission to decrypt, but not encrypt.
The simplest master key provider always returns the same master key (p. 6). In fact, master keys are
implemented as master keys providers that only return themselves. More complex master key providers
might use key rotation, the encryption context, application permissions, and other factors to select
master keys from among the set they can provide.
Many master keys providers wrap or extend other master key providers to customize their behavior and
functionality. For example, a custom master key provider might select a master key provider from a
collection, delegate requests, and combine their results.
Each Default CMM is associated with a master key provider (p. 7). When it gets a materials request,
the Default CMM gets master keys from its master key provider and uses them to generate the
requested cryptographic material. This might involve a call to a cryptographic service, such as AWS Key
Management Service (AWS KMS).
In each call to encrypt or decrypt data, you specify a CMM or a master key provider. This lets you choose
a particular set of master keys for the operation. You can create a CMM explicitly and specify its master
key provider, but that is not required. If you specify a master key provider in an encryption request, the
SDK creates a Default CMM for the master key provider.
Because the CMM acts as a liaison between the SDK and a master key provider, it is an ideal point for
customization and extension, such as support for policy enforcement and caching.
Algorithm Suite
The AWS Encryption SDK supports several (p. 12) algorithm suites (p. 10), all of which use
Advanced Encryption Standard (AES) as the primary algorithm, and combine it with other algorithm and
values.
The AWS Encryption SDK establishes a recommended algorithm suite as the default for all encryption
operations. The default might change as standards and best practices improve. You can specify an
alternate algorithm suite in requests to encrypt data or when creating a cryptographic materials
manager (CMM) (p. 7), but unless an alternate is required for your situation, it is best to use the
default. The current default is AES-GCM with an HMAC-based extract-and-expand key derivation
7
AWS Encryption SDK Developer Guide
Encryption Context
function (HKDF), Elliptic Curve Digital Signature Algorithm (ECDSA) signing, and a 256-bit encryption
key.
If you specify an algorithm suite, we recommend an algorithm suite that uses a key derivation function
and a message signing algorithm. Algorithm suites that have neither feature are supported only for
backward compatibility.
Encryption Context
To improve the security of your cryptographic operations, use an encryption context in all requests to
encrypt data. The encryption context is optional, but recommended.
An encryption context is a set of key–value pairs that contain arbitrary nonsecret data. The encryption
context can contain any data you choose, but it typically consists of data that is useful in logging and
tracking, such as data about the file type, purpose, or ownership.
In requests to encrypt data, you can include an encryption context along with the plaintext data and
a master key provider. The AWS Encryption SDK cryptographically binds the encryption context to the
encrypted data so that the same encryption context is required to decrypt the data. The AWS Encryption
SDK also includes the encryption context in the encrypted message (p. 8) that it returns, along with
the encrypted data and data keys. The encryption context in the encrypted message always includes the
encryption context that you specified in the encryption request, along with elements that the operation
might add, such as a public signing key.
To decrypt the data, you pass in the encrypted message. Because the AWS Encryption SDK can extract
the encryption context from the message, you do not need to pass it in separately. After decrypting
the data, the AWS Encryption SDK returns a result that includes that encryption context along with
the plaintext data. The functions in your application that decrypt data should always verify that the
encryption context in the decrypt result includes the values that you expect before it returns the
plaintext data.
When choosing an encryption context, remember that it is not a secret. The encryption context is
displayed in plaintext in the header of the encrypted message (p. 8) that the SDK returns. If you
are using AWS Key Management Service, the encryption context also might appear in plaintext in audit
records and logs, such as AWS CloudTrail.
Encrypted Message
Encrypt operations in the AWS Encryption SDK return an encrypted message and decrypt operations
take an encrypted message as input. An encrypted message, a formatted data structure (p. 82)
that includes the encrypted data along with encrypted copies of the data keys, the algorithm ID, and,
optionally, an encryption context and a message signature.
Combining the encrypted data and its encrypted data keys streamlines the decryption operation and
frees you from having to store and manage encrypted data keys independently of the data that they
encrypt.
For technical information about the encrypted message, see Encrypted Message Format (p. 82).
8
AWS Encryption SDK Developer Guide
To interact with AWS KMS, you need to use the AWS SDK for your preferred programming language, such
as the AWS SDK for Java or the AWS SDK for Python (Boto). The AWS Encryption SDK client library works
with the AWS SDKs to support master keys stored in AWS KMS.
1. Create an AWS account. To learn how, see How do I create and activate a new Amazon Web Services
account? in the AWS Knowledge Center.
2. Create a customer master key (CMK) in AWS KMS. To learn how, see Creating Keys in the AWS Key
Management Service Developer Guide.
Tip
To use the CMK programmatically, you will need the ID or Amazon Resource Name (ARN)
of the CMK. For help finding the ID or ARN of a CMK, see Viewing Keys in the AWS Key
Management Service Developer Guide.
3. Create an IAM user with an access key. To learn how, see Creating IAM Users in the IAM User Guide.
When you create the user, for Access type, choose Programmatic access. After you create the user,
choose Download.csv to save the AWS access key that represents your user credentials. Store the file
in a secure location.
We recommend that you use AWS Identity and Access Management (IAM) access keys instead of AWS
(root) account access keys. IAM lets you securely control access to AWS services and resources in your
AWS account. For detailed best practice guidance, see Best Practices for Managing AWS Access Keys
The Download.csv file contains an AWS access key ID and a secret access key that represents the
AWS credentials of the user that you created. When you write code without using an AWS SDK, you
use your access key to sign your requests to AWS. The signature assures AWS that the request came
from you unchanged. However, when you use an AWS SDK, such as the AWS SDK for Java, the SDK
signs all requests to AWS for you.
4. Set your AWS credentials using the instructions for Java or Python and the AWS access key in the
Download.csv file that you downloaded in Step 3.
This procedure allows AWS SDKs to sign requests to AWS for you. Code samples in the AWS
Encryption SDK that interact with AWS KMS assume that you have completed this step.
5. Download and install the AWS Encryption SDK. To learn how, see the installation instructions for the
programming language (p. 12) that you want to use.
9
AWS Encryption SDK Developer Guide
Recommended: AES-GCM with Key Derivation and Signing
The AWS Encryption SDK algorithm suite uses the Advanced Encryption Standard (AES) algorithm in
Galois/Counter Mode (GCM), known as AES-GCM, to encrypt raw data. The SDK supports 256-bit, 192-
bit, and 128-bit encryption keys. The length of the initialization vector (IV) is always 12 bytes; the length
of the authentication tag is always 16 bytes.
The SDK implements AES-GCM in one of three ways. By default, the SDK uses AES-GCM with an HMAC-
based extract-and-expand key derivation function (HKDF), signing, and a 256-bit encryption key.
The HKDF helps you avoid accidental reuse of a data encryption key.
This algorithm suite uses ECDSA and a message signing algorithm (SHA-384 or SHA-256). ECDSA is used
by default, even when it is not specified by the policy for the underlying master key. Message signing
verifies the identity of the message sender and adds message authenticity to the envelope encrypted
data. It is particularly useful when the authorization policy for a master key allows one set of users to
encrypt data and a different set of users to decrypt data.
The following table lists the variations of the recommended algorithm suites.
10
AWS Encryption SDK Developer Guide
Other Supported Algorithm Suites
This algorithm suite uses a key derivation function, but lacks the ECDSA signature that provides
authenticity and nonrepudiation. Use this suite when the users who encrypt data and those who
decrypt it are equally trusted.
AES-GCM without Key Derivation or Signing
This algorithm suite uses the data encryption key as the AES-GCM encryption key, instead of using a
key derivation function to derive a unique key. We discourage using this suite to generate ciphertext,
but the SDK supports it for compatibility reasons.
For more information about how these suites are represented and used in the library, see the section
called “Algorithms Reference” (p. 96).
11
AWS Encryption SDK Developer Guide
Java
Topics
• AWS Encryption SDK for Java (p. 12)
• AWS Encryption SDK for Python (p. 20)
• AWS Encryption SDK Command Line Interface (p. 26)
Topics
• Prerequisites (p. 12)
• Installation (p. 13)
• AWS Encryption SDK for Java Example Code (p. 13)
Prerequisites
Before you install the AWS Encryption SDK for Java, be sure you have the following prerequisites.
You will need Java 8 or later. On the Oracle website, go to Java SE Downloads, and then download
and install the Java SE Development Kit (JDK).
If you use the Oracle JDK, you must also download and install the Java Cryptography Extension (JCE)
Unlimited Strength Jurisdiction Policy Files.
Bouncy Castle
Bouncy Castle provides a cryptography API for Java. If you don't have Bouncy Castle, go to Bouncy
Castle latest releases to download the provider file that corresponds to your JDK.
If you use Apache Maven, Bouncy Castle is available with the following dependency definition.
<dependency>
<groupId>org.bouncycastle</groupId>
<artifactId>bcprov-ext-jdk15on</artifactId>
<version>1.58</version>
</dependency>
12
AWS Encryption SDK Developer Guide
Installation
Although you don't need the AWS SDK for Java to use the AWS Encryption SDK for Java, you do
need it to use AWS Key Management Service (AWS KMS) as a master key provider, and to use
some of the example Java code (p. 13) in this guide. For more information about installing and
configuring the AWS SDK for Java, see AWS SDK for Java.
Installation
You can install the AWS Encryption SDK for Java in the following ways.
Manually
To install the AWS Encryption SDK for Java, clone or download the aws-encryption-sdk-java GitHub
repository.
Using Apache Maven
The AWS Encryption SDK for Java is available through Apache Maven with the following dependency
definition.
<dependency>
<groupId>com.amazonaws</groupId>
<artifactId>aws-encryption-sdk-java</artifactId>
<version>1.3.1</version>
</dependency>
After you install the SDK, get started by looking at the example Java code (p. 13) in this guide and the
Javadoc on GitHub.
Topics
• Encrypting and Decrypting Strings (p. 13)
• Encrypting and Decrypting Byte Streams (p. 15)
• Encrypting and Decrypting Byte Streams with Multiple Master Key Providers (p. 17)
This example uses an AWS Key Management Service (AWS KMS) customer master key (CMK) as the
master key. For help creating a key, see Creating Keys in the AWS Key Management Service Developer
Guide.
To find the Amazon Resource name (ARN) of an existing CMK, go to the Encryption keys section of the
AWS Management Console, select the region, and then click the CMK alias. You can also use the AWS
KMS ListKeys operation. For details, see Viewing Keys in the AWS Key Management Service Developer
Guide.
/*
13
AWS Encryption SDK Developer Guide
Example Code
package com.amazonaws.crypto.examples;
import java.util.Collections;
import java.util.Map;
import com.amazonaws.encryptionsdk.AwsCrypto;
import com.amazonaws.encryptionsdk.CryptoResult;
import com.amazonaws.encryptionsdk.kms.KmsMasterKey;
import com.amazonaws.encryptionsdk.kms.KmsMasterKeyProvider;
/**
* <p>
* Encrypts and then decrypts a string under a KMS key
*
* <p>
* Arguments:
* <ol>
* <li>Key ARN: For help finding the Amazon Resource Name (ARN) of your KMS customer
master
* key (CMK), see 'Viewing Keys' at http://docs.aws.amazon.com/kms/latest/
developerguide/viewing-keys.html
* <li>String to encrypt
* </ol>
*/
public class StringExample {
private static String keyArn;
private static String data;
14
AWS Encryption SDK Developer Guide
Example Code
// Before returning the plaintext, verify that the customer master key that
// was used in the encryption operation was the one supplied to the master key
provider.
if (!decryptResult.getMasterKeyIds().get(0).equals(keyArn)) {
throw new IllegalStateException("Wrong key ID!");
}
// Also, verify that the encryption context in the result contains the
// encryption context supplied to the encryptString method. Because the
// SDK can add values to the encryption context, don't require that
// the entire context matches.
for (final Map.Entry<String, String> e : context.entrySet()) {
if (!e.getValue().equals(decryptResult.getEncryptionContext().get(e.getKey())))
{
throw new IllegalStateException("Wrong Encryption Context!");
}
}
/*
* Copyright 2017 Amazon.com, Inc. or its affiliates. All Rights Reserved.
*
* Licensed under the Apache License, Version 2.0 (the "License"). You may not use this
file except
* in compliance with the License. A copy of the License is located at
*
* http://aws.amazon.com/apache2.0
*
* or in the "license" file accompanying this file. This file is distributed on an "AS IS"
BASIS,
* WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied. See the License
for the
* specific language governing permissions and limitations under the License.
*/
package com.amazonaws.crypto.examples;
import java.io.FileInputStream;
import java.io.FileOutputStream;
import java.io.IOException;
import java.security.SecureRandom;
import java.util.Collections;
import java.util.Map;
import javax.crypto.SecretKey;
import javax.crypto.spec.SecretKeySpec;
import com.amazonaws.encryptionsdk.AwsCrypto;
import com.amazonaws.encryptionsdk.CryptoInputStream;
import com.amazonaws.encryptionsdk.MasterKey;
15
AWS Encryption SDK Developer Guide
Example Code
import com.amazonaws.encryptionsdk.jce.JceMasterKey;
import com.amazonaws.util.IOUtils;
/**
* <p>
* Encrypts and then decrypts a file under a random key.
*
* <p>
* Arguments:
* <ol>
* <li>Name of file containing plaintext data to encrypt
* </ol>
*
* <p>
* This program demonstrates using a standard Java {@link SecretKey} object as a {@link
MasterKey} to
* encrypt and decrypt streaming data.
*/
public class FileStreamingExample {
private static String srcFile;
// Create a JCE master key provider using the random key and an AES-GCM encryption
algorithm
JceMasterKey masterKey = JceMasterKey.getInstance(cryptoKey, "Example",
"RandomKey", "AES/GCM/NoPadding");
// Because the file might be to large to load into memory, we stream the data,
instead of
//loading it all at once.
FileInputStream in = new FileInputStream(srcFile);
CryptoInputStream<JceMasterKey> encryptingStream =
crypto.createEncryptingStream(masterKey, in, context);
// Decrypt the file. Verify the encryption context before returning the plaintext.
in = new FileInputStream(srcFile + ".encrypted");
CryptoInputStream<JceMasterKey> decryptingStream =
crypto.createDecryptingStream(masterKey, in);
// Does it contain the expected encryption context?
if
(!"FileStreaming".equals(decryptingStream.getCryptoResult().getEncryptionContext().get("Example")))
{
throw new IllegalStateException("Bad encryption context");
}
16
AWS Encryption SDK Developer Guide
Example Code
out.close();
}
/**
* In practice, this key would be saved in a secure location.
* For this demo, we generate a new random key for each operation.
*/
private static SecretKey retrieveEncryptionKey() {
SecureRandom rnd = new SecureRandom();
byte[] rawKey = new byte[16]; // 128 bits
rnd.nextBytes(rawKey);
return new SecretKeySpec(rawKey, "AES");
}
}
/*
* Copyright 2017 Amazon.com, Inc. or its affiliates. All Rights Reserved.
*
* Licensed under the Apache License, Version 2.0 (the "License"). You may not use this
file except
* in compliance with the License. A copy of the License is located at
*
* http://aws.amazon.com/apache2.0
*
* or in the "license" file accompanying this file. This file is distributed on an "AS IS"
BASIS,
* WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied. See the License
for the
* specific language governing permissions and limitations under the License.
*/
package com.amazonaws.crypto.examples;
import java.io.FileInputStream;
import java.io.FileOutputStream;
import java.security.GeneralSecurityException;
import java.security.KeyPair;
import java.security.KeyPairGenerator;
import java.security.PrivateKey;
import java.security.PublicKey;
import com.amazonaws.encryptionsdk.AwsCrypto;
import com.amazonaws.encryptionsdk.CryptoOutputStream;
import com.amazonaws.encryptionsdk.MasterKeyProvider;
import com.amazonaws.encryptionsdk.jce.JceMasterKey;
import com.amazonaws.encryptionsdk.kms.KmsMasterKeyProvider;
import com.amazonaws.encryptionsdk.multi.MultipleProviderFactory;
import com.amazonaws.util.IOUtils;
/**
* <p>
* Encrypts a file using both KMS and an asymmetric key pair.
*
* <p>
* Arguments:
* <ol>
17
AWS Encryption SDK Developer Guide
Example Code
* <li>Key ARN: For help finding the Amazon Resource Name (ARN) of your KMS customer
master
* key (CMK), see 'Viewing Keys' at http://docs.aws.amazon.com/kms/latest/
developerguide/viewing-keys.html
* <li>Name of file containing plaintext data to encrypt
* </ol>
*
* You might use AWS Key Management Service (KMS) for most encryption and decryption
operations, but
* still want the option of decrypting your data offline independently of KMS. This sample
* demonstrates one way to do this.
*
* The sample encrypts data under both a KMS customer master key (CMK) and an "escrowed"
RSA key pair
* so that either key alone can decrypt it. You might commonly use the KMS CMK for
decryption. However,
* at any time, you can use the private RSA key to decrypt the ciphertext independent of
KMS.
*
* This sample uses the JCEMasterKey class to generate a RSA public-private key pair
* and saves the key pair in memory. In practice, you would store the private key in a
secure offline
* location, such as an offline HSM, and distribute the public key to your development
team.
*
*/
public class EscrowedEncryptExample {
private static PublicKey publicEscrowKey;
private static PrivateKey privateEscrowKey;
standardEncrypt(kmsArn, fileName);
standardDecrypt(kmsArn, fileName);
escrowDecrypt(fileName);
}
private static void standardEncrypt(final String kmsArn, final String fileName) throws
Exception {
// Encrypt with the KMS CMK and the escrowed public key
// 1. Instantiate the SDK
final AwsCrypto crypto = new AwsCrypto();
18
AWS Encryption SDK Developer Guide
Example Code
IOUtils.copy(in, encryptingStream);
in.close();
encryptingStream.close();
}
private static void standardDecrypt(final String kmsArn, final String fileName) throws
Exception {
// Decrypt with the KMS CMK and the escrow public key. You can use a combined
provider,
// as shown here, or just the KMS master key provider.
19
AWS Encryption SDK Developer Guide
Python
// use an encryption context. For an example, see the other SDK samples.
final FileInputStream in = new FileInputStream(fileName + ".encrypted");
final FileOutputStream out = new FileOutputStream(fileName + ".deescrowed");
final CryptoOutputStream<?> decryptingStream =
crypto.createDecryptingStream(escrowPriv, out);
IOUtils.copy(in, decryptingStream);
in.close();
decryptingStream.close();
}
}
Topics
• Prerequisites (p. 20)
• Installation (p. 20)
• AWS Encryption SDK for Python Example Code (p. 21)
Prerequisites
Before you install the AWS Encryption SDK for Python, be sure you have the following prerequisites.
To use this SDK, you need Python 2.7, or Python 3.3 or later. To download Python, see Python
downloads.
The pip installation tool for Python
If you have Python 2.7.9 or later, or Python 3.4 or later, you already have pip, although you might
want to upgrade it. For more information about upgrading or installing pip, see Installation in the
pip documentation.
Installation
Use pip to install the AWS Encryption SDK for Python, as shown in the following examples.
20
AWS Encryption SDK Developer Guide
Example Code
For more details about using pip to install and upgrade packages, see Installing Packages.
The SDK requires the cryptography library on all platforms. All versions of pip install and build the
cryptography library on Windows. pip 8.1 and later installs and builds cryptography on Linux. If you
are using an earlier version of pip and your Linux environment doesn't have the tools needed to build
the cryptography library, you need to install them. For more information, see Building cryptography on
Linux.
For the latest development version of this SDK, go to the aws-encryption-sdk-python GitHub repository.
After you install the SDK, get started by looking at the example Python code (p. 21) in this guide.
Topics
• Encrypting and Decrypting Strings (p. 21)
• Encrypting and Decrypting Byte Streams (p. 22)
• Encrypting and Decrypting Byte Streams with Multiple Master Key Providers (p. 24)
"""
Copyright 2017 Amazon.com, Inc. or its affiliates. All Rights Reserved.
Licensed under the Apache License, Version 2.0 (the "License"). You may not use this file
except
in compliance with the License. A copy of the License is located at
https://aws.amazon.com/apache-2-0/
or in the "license" file accompanying this file. This file is distributed on an "AS IS"
BASIS,
WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied. See the License
for the
specific language governing permissions and limitations under the License.
"""
import aws_encryption_sdk
:param str key_arn: Amazon Resource Name (ARN) of the KMS CMK
(http://docs.aws.amazon.com/kms/latest/developerguide/viewing-keys.html)
:param bytes source_plaintext: Data to encrypt
:param botocore_session: Existing Botocore session instance
:type botocore_session: botocore.session.Session
"""
21
AWS Encryption SDK Developer Guide
Example Code
# Verify that the "cycled" (encrypted, then decrypted) plaintext is identical to the
source
# plaintext
assert cycled_plaintext == source_plaintext
# Verify that the encryption context used in the decrypt operation includes all key
pairs from
# the encrypt operation. (The SDK can add pairs, so don't require an exact match.)
#
# In production, always use a meaningful encryption context. In this sample, we omit
the
# encryption context (no key pairs).
assert all(
pair in decrypted_header.encryption_context.items()
for pair in encryptor_header.encryption_context.items()
)
"""
Copyright 2017 Amazon.com, Inc. or its affiliates. All Rights Reserved.
Licensed under the Apache License, Version 2.0 (the "License"). You may not use this file
except
in compliance with the License. A copy of the License is located at
https://aws.amazon.com/apache-2-0/
or in the "license" file accompanying this file. This file is distributed on an "AS IS"
BASIS,
WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied. See the License
for the
specific language governing permissions and limitations under the License.
"""
import filecmp
import os
import aws_encryption_sdk
22
AWS Encryption SDK Developer Guide
Example Code
class StaticRandomMasterKeyProvider(RawMasterKeyProvider):
"""Randomly and consistently generates 256-bit keys for each unique key ID."""
provider_id = 'static-random'
def cycle_file(source_plaintext_filename):
"""Encrypts and then decrypts a file under a custom static master key provider.
23
AWS Encryption SDK Developer Guide
Example Code
# Verify that the "cycled" (encrypted, then decrypted) plaintext is identical to the
source
# plaintext
assert filecmp.cmp(source_plaintext_filename, cycled_plaintext_filename)
# Verify that the encryption context used in the decrypt operation includes all key
pairs from
# the encrypt operation
#
# In production, always use a meaningful encryption context. In this sample, we omit
the
# encryption context (no key pairs).
assert all(
pair in decryptor.header.encryption_context.items()
for pair in encryptor.header.encryption_context.items()
)
return ciphertext_filename, cycled_plaintext_filename
"""
Copyright 2017 Amazon.com, Inc. or its affiliates. All Rights Reserved.
Licensed under the Apache License, Version 2.0 (the "License"). You may not use this file
except
in compliance with the License. A copy of the License is located at
https://aws.amazon.com/apache-2-0/
or in the "license" file accompanying this file. This file is distributed on an "AS IS"
BASIS,
WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied. See the License
for the
specific language governing permissions and limitations under the License.
"""
import filecmp
import os
import aws_encryption_sdk
from aws_encryption_sdk.internal.crypto import WrappingKey
from aws_encryption_sdk.key_providers.raw import RawMasterKeyProvider
from aws_encryption_sdk.identifiers import WrappingAlgorithm, EncryptionKeyType
from cryptography.hazmat.backends import default_backend
from cryptography.hazmat.primitives import serialization
from cryptography.hazmat.primitives.asymmetric import rsa
class StaticRandomMasterKeyProvider(RawMasterKeyProvider):
provider_id = 'static-random'
24
AWS Encryption SDK Developer Guide
Example Code
:param str key_arn: Amazon Resource Name (ARN) of the KMS Customer Master Key (CMK)
(http://docs.aws.amazon.com/kms/latest/developerguide/viewing-keys.html)
:param str source_plaintext_filename: Filename of file to encrypt
:param botocore_session: existing botocore session instance
:type botocore_session: botocore.session.Session
"""
# Create a master key provider that includes the KMS and static master key providers
kms_master_key_provider.add_master_key_provider(static_master_key_provider)
25
AWS Encryption SDK Developer Guide
Command Line Interface
# Verify that the "cycled" (encrypted, then decrypted) plaintext is identical to the
source
# plaintext
assert filecmp.cmp(source_plaintext_filename, cycled_kms_plaintext_filename)
assert filecmp.cmp(source_plaintext_filename, cycled_static_plaintext_filename)
# Verify that the encryption context in the decrypt operation includes all key pairs
from the
# encrypt operation.
#
# In production, always use a meaningful encryption context. In this sample, we omit
the
# encryption context (no key pairs).
assert all(
pair in kms_decryptor.header.encryption_context.items()
for pair in encryptor.header.encryption_context.items()
)
assert all(
pair in static_decryptor.header.encryption_context.items()
for pair in encryptor.header.encryption_context.items()
)
return ciphertext_filename, cycled_kms_plaintext_filename,
cycled_static_plaintext_filename
Like all implementations of the AWS Encryption SDK, the AWS Encryption CLI offers advanced data
protection features. These include envelope encryption, additional authenticated data (AAD), and secure,
authenticated, symmetric key algorithm suites, such as 256-bit AES-GCM with key derivation and signing.
The AWS Encryption CLI is built on the AWS Encryption SDK for Python and is supported on Linux,
macOS, and Windows. You can run commands and scripts to encrypt and decrypt your data in your
26
AWS Encryption SDK Developer Guide
Installing the CLI
preferred shell on Linux or macOS, in a Command Prompt window (cmd.exe) on Windows, and in a
PowerShell console on any system.
All language-specific implementations of the AWS Encryption SDK, including the AWS Encryption CLI,
are interoperable. For example, you can encrypt data with the AWS Encryption SDK for Java (p. 12)
and decrypt it with the AWS Encryption CLI.
This topic introduces the AWS Encryption CLI, explains how to install and use it, and provides several
examples to help you get started. For a quick start, see How to Encrypt and Decrypt Your Data with the
AWS Encryption CLI in the AWS Security Blog. For more detailed information, see Read The Docs, and
join us in developing the AWS Encryption CLI in the aws-encryption-sdk-cli repository on GitHub.
Topics
• Installing the AWS Encryption SDK Command Line Interface (p. 27)
• How to Use the AWS Encryption SDK Command Line Interface (p. 29)
• Examples of the AWS Encryption SDK Command Line Interface (p. 36)
• AWS Encryption SDK CLI Syntax and Parameter Reference (p. 49)
Topics
• Installing the Prerequisites (p. 27)
• Installing the AWS Encryption CLI (p. 28)
Before you install the AWS Encryption CLI, be sure that you have the following prerequisites.
Python
The AWS Encryption CLI requires Python 2.7, or Python 3.4 or later. Python is included in most
Linux and macOS installations, although you might need to upgrade to one of the required versions.
However, you have to install Python on Windows, if it is not already installed. To download Python,
see Python downloads.
python
python -V
27
AWS Encryption SDK Developer Guide
Installing the CLI
On Windows, you need to install Python. Then, add the path to the Python.exe file to the value of
the Path environment variable.
By default, Python is installed in the all users directory or in a user profile directory ($home or
%userprofile%) in the AppData\Local\Programs\Python subdirectory. To find the location
of the Python.exe file on your system, check one of the following registry keys. You can use
PowerShell to search the registry.
pip
pip is the Python package manager. To install the AWS Encryption CLI and its dependencies, you
need pip 8.1 or later.
For help installing or upgrading pip, see Installation in the pip documentation.
AWS Command Line Interface
The AWS Command Line Interface (AWS CLI) is required only if you are using AWS Key Management
Service (AWS KMS) customer master keys (CMKs) with the AWS Encryption CLI. If you are using a
different master key provider (p. 7), the AWS CLI is not required.
To use AWS KMS CMKs with the AWS Encryption CLI, you need to install and configure the AWS CLI.
The configuration makes the credentials that you use to authenticate to AWS KMS available to the
AWS Encryption CLI.
The AWS Encryption CLI requires the cryptography library on all platforms. All versions of pip install and
build the cryptography library on Windows and OS X.
On Linux, pip 8.1 and later installs and builds the cryptography library. If you are using an earlier version
of pip and your Linux environment doesn't have the tools needed to build the cryptography library, you
must install them. For more information, see Building cryptography on Linux.
To find the version number of your AWS Encryption CLI and AWS Encryption SDK
aws-encryption-cli --version
aws-encryption-sdk-cli/1.1.0 aws-encryption-sdk/1.3.2
To install the version of the AWS Encryption CLI currently in development, see the aws-encryption-sdk-cli
repository on GitHub.
28
AWS Encryption SDK Developer Guide
How to Use the CLI
For more details about using pip to install and upgrade Python packages, see the pip documentation.
Topics
• How to Encrypt and Decrypt Data (p. 29)
• How to Specify a Master Key (p. 30)
• How to Provide Input (p. 32)
• How to Specify the Output Location (p. 32)
• How to Use an Encryption Context (p. 33)
• How to Store Parameters in a Configuration File (p. 34)
• When you encrypt data in the AWS Encryption CLI, you specify your plaintext data and a master
key (p. 6), such as an AWS Key Management Service (AWS KMS) customer master key (CMK). If you are
using a custom master key provider, you need to specify the provider. You also specify output locations
for the encrypted message (p. 8) and for metadata about the encryption operation. An encryption
context (p. 8) is optional, but recommended.
The AWS Encryption CLI gets a unique data key from the master key and encrypts your data. It returns
an encrypted message (p. 8) and metadata about the operation. The encrypted message contains your
encrypted data (ciphertext) and an encrypted copy of the data key. You don't have to worry about
storing, managing, or losing the data key.
• When you decrypt data, you pass in your encrypted message, the optional encryption context, and
location for the plaintext output and the metadata. If you are using a custom master key provider, you
also supply the master key. If you are using an AWS KMS CMK, AWS KMS derives the master key from
the encrypted message.
The AWS Encryption CLI uses the master key to decrypt the data key in the encrypted message. Then
it uses the data key to decrypt your data. It returns your plaintext data and metadata about the
operation.
29
AWS Encryption SDK Developer Guide
How to Use the CLI
To specify a master key, use the --master-keys parameter (-m). Its value is a collection of
attributes (p. 30) with the attribute=value format. The attributes that you use depend on the
master key provider and the command.
• AWS KMS. In encrypt commands, you must specify a --master-keys parameter with a key attribute.
The other attributes are optional. In decrypt commands, the --master-keys parameter is optional
and it can only have a profile attribute.
• Custom master key provider. You must specify the --master-keys parameter in every command.
The parameter value must have key and provider attributes.
You can include multiple --master-keys parameters (p. 31) in the same command.
If an attribute name or value includes spaces or special characters, enclose both the name and value in
quotation marks. For example, --master-keys key=12345 "provider=my cool provider".
Use the key attribute to identify a master key. The value can be any key identifier that the master
key provider recognizes.
--master-keys key=1234abcd-12ab-34cd-56ef-1234567890ab
In an encrypt command, each --master-keys parameter value must include at least one key
attribute and value. You can use multiple key attributes (p. 31) in each --master-keys
parameter value.
In encrypt commands that use AWS KMS CMKs, the value of key can be the CMK ID, its Amazon
Resource Name (ARN), an alias name, or alias ARN. For example, this encrypt command uses an alias
ARN in the value of the key attribute.
In decrypt commands that use a custom master key provider, key and provider attributes are
required. The key attribute is not permitted in decrypt commands that use an AWS KMS CMK.
The provider attribute identifies the master key provider (p. 7). The default value is aws-kms, which
represents AWS KMS. If you are using a different master key provider, the provider attribute is
required.
30
AWS Encryption SDK Developer Guide
How to Use the CLI
For more information about using custom (non-AWS KMS) master key providers, see the Advanced
Configuration topic in the README file for the AWS Encryption SDK CLI repository.
Region: Specify an AWS Region
Use the region attribute to specify the AWS Region of an AWS KMS CMK. This attribute is valid only
in encrypt commands and only when the master key provider is AWS KMS.
AWS Encryption CLI commands use the AWS Region that is specified in the key attribute value if it
includes a region, such as an ARN. if the key value specifies a AWS Region, the region attribute is
ignored.
The region attribute takes precedence over other region specifications. If you don't use a region
attribute, AWS Encryption CLI commands uses the AWS Region specified in your AWS CLI named
profile, if any, or your default profile.
Profile: Specify a Named Profile
Use the profile attribute to specify an AWS CLI named profile. Named profiles can include
credentials and an AWS Region. This attribute is valid only when the master key provider is AWS
KMS.
You can use the profile attribute to specify alternate credentials in encrypt and decrypt commands.
In an encrypt command, the AWS Encryption CLI uses the AWS Region in the named profile
only when the key value does not include a region and there is no region attribute. In a decrypt
command, the AWS Region in the name profile is ignored.
If you specify more than one master key, the first master key generates (and encrypts) the data key that
is used to encrypt your data. The other master keys only encrypt the data key. The resulting encrypted
message (p. 8) contains the encrypted data ("ciphertext") and a collection of encrypted data keys, one
encrypted by each master key. Any of the master keys can decrypt one data key and then decrypt the
data.
$cmk_oregon=arn:aws:kms:us-west-2:111122223333:key/1234abcd-12ab-34cd-56ef-1234567890ab
$cmk_ohio=arn:aws:kms:us-east-2:111122223333:key/0987ab65-43cd-21ef-09ab-87654321cdef
• Include multiple --master-keys parameters in the same command. Use this syntax when the
attribute values that you specify do not apply to all of the master keys in the command.
31
AWS Encryption SDK Developer Guide
How to Use the CLI
The --input parameter (-i) , which tells the AWS Encryption CLI where to find the input, is required in
all AWS Encryption CLI commands.
• Use a file.
--input myData.txt
--input testdir/*.xml
• Use a directory or directory name pattern. When the input is a directory, the --recursive parameter
(-r, -R) is required.
• Pipe input to the command (stdin). Use a value of - for the --input parameter. (The --input
parameter is always required.)
If an output file already exists, by default, the AWS Encryption CLI prints a warning, then overwrites the
file. To prevent overwriting, use the --interactive parameter, which prompts you for confirmation
before overwriting, or --no-overwrite, which skips the input if the output would cause an overwrite.
To suppress the overwrite warning, use --quiet. To capture errors and warnings from the AWS
Encryption CLI, use the 2>&1 redirection operator to write them to the output stream.
Note
Commands that overwrite output files begin by deleting the output file. If the command fails,
the output file might already be deleted.
• Specify a file name. If you specify a path to the file, all directories in the path must exist before the
command runs.
--output myEncryptedData.txt
• Specify a directory. The output directory must exist before the command runs.
32
AWS Encryption SDK Developer Guide
How to Use the CLI
If the input contains subdirectories, the command reproduces the subdirectories under the specified
directory.
--output Test
When the output location is a directory (without file names), the AWS Encryption CLI creates output
file names based on the input file names plus a suffix. Encrypt operations append .encrypted to
the input file name and the decrypt operations append .decrypted. To change the suffix, use the --
suffix parameter.
For example, if you encrypt file.txt, the encrypt command creates file.txt.encrypted. If you
decrypt file.txt.encrypted, the decrypt command creates file.txt.encrypted.decrypted.
• Write to the command line (stdout). Enter a value of - for the --output parameter. You can use --
output - to pipe output to another command or program.
--output -
An encryption context is a type of arbitrary, non-secret additional authenticated data. In the AWS
Encryption CLI, the encryption context consists of a collection of name=value pairs. You can use any
content in the pairs, including information about the files, data that helps you to find the encryption
operation in logs, or data that your grants or policies require.
In an Encrypt Command
The encryption context that you specify in an encrypt command, along with any additional encryption
context that the encryption components add, is cryptographically bound to the encrypted data. It is also
included (in plaintext) in the encrypted message (p. 8) that the command returns. If you are using an
AWS KMS customer master key (CMK), the encryption context also might appear in plaintext in audit
records and logs, such as AWS CloudTrail.
The following example shows a encryption context with three name=value pairs.
In a Decrypt Command
In a decrypt command, the encryption context helps you to confirm that you are decrypting the right
encrypted message.
You are not required to provide an encryption context in a decrypt command, even if an encryption
context was used on encrypt. However, if you do, the AWS Encryption CLI verifies that every element in
the encryption context of the decrypt command matches an element in the encryption context of the
encrypted message. If any element does not match, the decrypt command fails.
For example, the following command decrypts the encrypted message only if its encryption context
includes dept=IT.
33
AWS Encryption SDK Developer Guide
How to Use the CLI
An encryption context is an important part of your security strategy. However, when choosing an
encryption context, remember that its values are not secret. Do not include any confidential data in the
encryption context.
• In an encrypt command, use the --encryption-context parameter with one or more name=value
pairs. Use a space to separate each pair.
• In a decrypt command, the --encryption-context parameter value can include name=value pairs,
name elements (with no values), or a combination of both.
If the name or value in a name=value pair includes spaces or special characters, enclose the entire pair
in quotation marks.
For example, this encrypt command includes an encryption context with two pairs, purpose=test and
dept=23.
These decrypt command would succeed. The encryption context in each commands is a subset of the
original encryption context.
However, these decrypt commands would fail. The encryption context in the encrypted message does
not contain the specified elements.
A configuration file is a text file that contains parameters and values for an AWS Encryption CLI
command. When you refer to a configuration file in a AWS Encryption CLI command, the reference is
34
AWS Encryption SDK Developer Guide
How to Use the CLI
replaced by the parameters and values in the configuration file. The effect is the same is if you typed the
file content at the command line. A configuration file can have any name and it can be located in any
directory that the current user can access.
The following example configuration file, cmk.conf, specifies two AWS KMS CMKs in different regions.
--master-keys key=arn:aws:kms:us-
west-2:111122223333:key/1234abcd-12ab-34cd-56ef-1234567890ab
--master-keys key=arn:aws:kms:us-
east-2:111122223333:key/0987ab65-43cd-21ef-09ab-87654321cdef
To use the configuration file in a command, prefix the file name with an at sign (@). In a PowerShell
console, use a backtick character to escape the at sign (`@).
Bash
PowerShell
• You can include multiple parameters in each configuration file and list them in any order. List each
parameter with its values (if any) on a separate line.
• Use # to add a comment to all or part of a line.
• You can include references to other configuration files. Do not use a backtick to escape the @ sign, even
in PowerShell.
• If you use quotes in a configuration file, the quoted text cannot span multiple lines.
# Archive Files
--encrypt
--output /archive/logs
--recursive
--interactive
--encryption-context class=unclassified dept=IT
--suffix # No suffix
--metadata-output ~/metadata
@caching.conf # Use limited caching
You can also include multiple configuration files in a command. This example command uses both the
encrypt.conf and master-keys.conf configurations files.
Bash
35
AWS Encryption SDK Developer Guide
Examples
PowerShell
Topics
• Encrypting a File (p. 36)
• Decrypting a File (p. 38)
• Encrypting All Files in a Directory (p. 39)
• Decrypting All Files in a Directory (p. 40)
• Encrypting and Decrypting on the Command Line (p. 42)
• Using Multiple Master Keys (p. 43)
• Encrypting and Decrypting in Scripts (p. 45)
• Using Data Key Caching (p. 47)
Encrypting a File
This example uses the AWS Encryption CLI to encrypt the contents of the hello.txt file, which
contains a "Hello World" string.
When you run an encrypt command on a file, the AWS Encryption CLI gets the contents of the file,
generates a unique data key (p. 6), encrypts the file contents under the data key, and then writes the
encrypted message (p. 8) to a new file.
The first command saves the Amazon Resource Name (ARN) of an AWS KMS customer master key (CMK)
in the $cmkArn variable.
The second command encrypts the file contents. The command uses the --encrypt parameter to
specify the operation and the --input parameter to indicate the file to encrypt. The --master-keys
parameter (p. 30), and its required key attribute, tell the command to use the master key represented
by the CMK ARN.
The command uses the --metadata-output parameter to specify a text file for the metadata
about the encryption operation. As a best practice, the command uses the --encryption-context
parameter to specify an encryption context (p. 33).
The value of the --output parameter, a dot (.), tells the command to write the output file to the current
directory.
Bash
\\ To run this example, replace the fictitious CMK ARN with a valid value.
36
AWS Encryption SDK Developer Guide
Examples
$ cmkArn=arn:aws:kms:us-west-2:111122223333:key/1234abcd-12ab-34cd-56ef-1234567890ab
$ aws-encryption-cli --encrypt \
--input hello.txt \
--master-keys key=$cmkArn \
--metadata-output ~/metadata \
--encryption-context purpose=test \
--output .
PowerShell
# To run this example, replace the fictitious CMK ARN with a valid value.
PS C:\> $CmkArn = arn:aws:kms:us-
west-2:111122223333:key/1234abcd-12ab-34cd-56ef-1234567890ab
When the encrypt command succeeds, it does not return any output. To determine whether the
command succeeded, check the Boolean value in the $? variable. When the command succeeds, the
value of $? is 0 (Bash) or True (PowerShell). When the command fails, the value of $? is non-zero (Bash)
or False (PowerShell).
Bash
$ echo $?
0
PowerShell
PS C:\> $?
True
You can also use a directory listing command to see that the encrypt command created a new file,
hello.txt.encrypted. Because the encrypt command did not specify a file name for the output, the
AWS Encryption CLI wrote the output to a file with the same name as the input file plus a .encrypted
suffix. To use a different suffix, or suppress the suffix, use the --suffix parameter.
The hello.txt.encrypted file contains an encrypted message (p. 8) that includes the ciphertext
of the hello.txt file, an encrypted copy of the data key, and additional metadata, including the
encryption context.
Bash
$ ls
hello.txt hello.txt.encrypted
PowerShell
PS C:\> dir
37
AWS Encryption SDK Developer Guide
Examples
Directory: C:\TestCLI
Decrypting a File
This example uses the AWS Encryption CLI to decrypt the contents of the Hello.txt.encrypted file
that was encrypted in the previous example.
The decrypt command uses the --decrypt parameter to indicate the operation and --input
parameter to identify the file to decrypt. The value of the --output parameter is a dot that represents
the current directory.
This command does not have a --master-keys parameter. A --master-keys parameter is required in
decrypt commands only when you are using a custom master key provider. If you are using an AWS KMS
CMK, you cannot specify a master key, because AWS KMS derives it from the encrypted message.
The --encryption-context parameter is optional in the decrypt command, even when an encryption
context (p. 33) is provided in the encrypt command. In this case, the decrypt command uses the same
encryption context that was provided in the encrypt command. Before decrypting, the AWS Encryption
CLI verifies that the encryption context in the encrypted message includes a purpose=test pair. If it
does not, the decrypt command fails.
The --metadata-output parameter specifies a file for metadata about the decryption operation. The
value of the --output parameter, a dot (.), writes the output file to the current directory.
Bash
$ aws-encryption-cli --decrypt \
--input hello.txt.encrypted \
--encryption-context purpose=test \
--metadata-output ~/metadata \
--output .
PowerShell
When a decrypt command succeeds, it does not return any output. To determine whether the command
succeeded, get the value of the $? variable. You can also use a directory listing command to see that the
command created a new file with a .decrypted suffix. To see the plaintext content, use a command to
get the file content, such as cat or Get-Content.
Bash
$ ls
hello.txt hello.txt.encrypted hello.txt.encrypted.decrypted
$ cat hello.txt.encrypted.decrypted
38
AWS Encryption SDK Developer Guide
Examples
Hello World
PowerShell
PS C:\> dir
Directory: C:\TestCLI
When a command affects multiple files, the AWS Encryption CLI processes each file individually. It gets
the file contents, gets a unique data key (p. 6) for the file from the master key, encrypts the file contents
under the data key, and writes the results to a new file in the output directory. As a result, you can
decrypt the output files independently.
This listing of the TestDir directory shows the plaintext files that we want to encrypt.
Bash
$ ls testdir
cool-new-thing.py hello.txt employees.csv
PowerShell
Directory: C:\TestDir
The first command saves the Amazon Resource Name (ARN) of an AWS KMS customer master key (CMK)
in the $cmkArn variable.
The second command encrypts the content of files in the TestDir directory and writes the files of
encrypted content to the TestEnc directory. If the TestEnc directory doesn't exist, the command fails.
Because the input location is a directory, the --recursive parameter is required.
The --master-keys parameter (p. 30), and its required key attribute, specify the master key. The
encrypt command includes an encryption context (p. 33), dept=IT. When you specify an encryption
context in a command that encrypts multiple files, the same encryption context is used for all of the
files.
39
AWS Encryption SDK Developer Guide
Examples
The command also has a --metadata-output parameter to tell the AWS Encryption CLI where to write
the metadata about the encryption operations. The AWS Encryption CLI writes one metadata record for
each file that was encrypted.
When the command completes, the AWS Encryption CLI writes the encrypted files to the TestEnc
directory, but it does not return any output.
The final command lists the files in the TestEnc directory. There is one output file of encrypted content
for each input file of plaintext content. Because the command did not specify an alternate suffix, the
encrypt command appended .encrypted to each of the input file names.
Bash
# To run this example, replace the fictitious CMK ARN with a valid master key
identifier.
$ cmkArn=arn:aws:kms:us-west-2:111122223333:key/1234abcd-12ab-34cd-56ef-1234567890ab
$ aws-encryption-cli --encrypt \
--input testdir --recursive\
--master-keys key=$cmkArn \
--encryption-context dept=IT \
--metadata-output ~/metadata \
--output testenc
$ ls testenc
cool-new-thing.py.encrypted employees.csv.encrypted hello.txt.encrypted
PowerShell
# To run this example, replace the fictitious CMK ARN with a valid master key
identifier.
PS C:\> $cmkArn = arn:aws:kms:us-
west-2:111122223333:key/1234abcd-12ab-34cd-56ef-1234567890ab
Directory: C:\TestEnc
Bash
$ ls testenc
40
AWS Encryption SDK Developer Guide
Examples
PowerShell
Directory: C:\TestEnc
This decrypt command decrypts all of the files in the TestEnc directory and writes the plaintext files to
the TestDec directory. Because the encrypted files were encrypted under an AWS KMS CMK, there is no
--master-keys parameter in the command. The command uses the --interactive parameter to tell
the AWS Encryption CLI to prompt you before overwriting a file with the same name.
This command also uses the encryption context that was provided when the files were encrypted. When
decrypting multiple files, the AWS Encryption CLI checks the encryption context of every file. If the
encryption context check on any file fails, the AWS Encryption CLI rejects the file, writes a warning,
records the failure in the metadata, and then continues checking the remaining files. If the AWS
Encryption CLI fails to decrypt a file for any other reason, the entire decrypt command fails immediately.
In this example, the encrypted messages in all of the input files contain the dept=IT encryption context
element. However, if you were decrypting messages with different encryption contexts, you might still
be able to verify part of the encryption context. For example, if some messages had an encryption
context of dept=finance and others had dept=IT, you could verify that the encryption context always
contains a dept name without specifying the value. If you wanted to be more specific, you could decrypt
the files in separate commands.
The decrypt command does not return any output, but you can use a directory listing command to see
that it created new files with the .decrypted suffix. To see the plaintext content, use a command to get
the file content.
Bash
$ ls testdec
cool-new-thing.py.encrypted.decrypted hello.txt.encrypted.decrypted
employees.csv.encrypted.decrypted
PowerShell
41
AWS Encryption SDK Developer Guide
Examples
This example pipes a plaintext string to an encrypt command and saves the encrypted message in a
variable. Then, it pipes the encrypted message in the variable to a decrypt command, which writes its
output to the pipeline (stdout).
• The first command saves the Amazon Resource Name (ARN) of an AWS KMS customer master key
(CMK) in the $cmkArn variable.
Bash
$ cmkArn=arn:aws:kms:us-west-2:111122223333:key/1234abcd-12ab-34cd-56ef-1234567890ab
PowerShell
• The second command pipes the Hello World string to the encrypt command and saves the result in
the $encrypted variable.
The --input and --output parameters are required in all AWS Encryption CLI commands. To
indicate that input is being piped to the command (stdin), use a hyphen (-) for the value of the --
input parameter. To send the output to the command line (stdout), use a hyphen for the value of the
--output parameter.
The --encode parameter Base64-encodes the output before returning it. This prevents the shell from
misinterpreting the non-ASCII characters in the encrypted message.
Because this command is just a proof of concept, we omit the encryption context and suppress the
metadata (-S).
Bash
PowerShell
42
AWS Encryption SDK Developer Guide
Examples
• The third command pipes the encrypted message in the $encrypted variable to the decrypt
command.
This decrypt command uses --input - to indicate that input is coming from the pipeline (stdin) and
--output - to send the output to the pipeline (stdout). (The input parameter takes the location of
the input, not the actual input bytes, so you cannot use the $encrypted variable as the value of the
--input parameter.)
Because the output was encrypted and then encoded, the decrypt command uses the --decode
parameter to decode Base64-encoded input before decrypting it. You can also use the --decode
parameter to decode Base64-encoded input before encrypting it.
Again, the command omits the encryption context and suppresses the metadata (-S).
Bash
PowerShell
You can also perform the encrypt and decrypt operations in a single command without the intervening
variable.
As in the previous example, the --input and --output parameters have a - value and the command
uses the --encode parameter to encode the output and the --decode parameter to decode the input.
Bash
$ cmkArn=arn:aws:kms:us-west-2:111122223333:key/1234abcd-12ab-34cd-56ef-1234567890ab
PowerShell
43
AWS Encryption SDK Developer Guide
Examples
When you use multiple master keys to encrypt data, any one of the master keys can be used to
decrypt the data. This strategy assures that you can decrypt the data even if one of the master keys is
unavailable. If you are storing the encrypted data in multiple AWS Regions, this strategy lets you use a
master key in the same Region to decrypt the data.
When you encrypt with multiple master keys, the first master key plays a special role. It generates the
data key that is used to encrypt the data. The remaining master keys encrypt the plaintext data key. The
resulting encrypted message (p. 8) includes the encrypted data and a collection of encrypted data keys,
one for each master key. Although the first master key generated the data key, any of the master keys
can decrypt one of the data keys, which can be used to decrypt the data.
This example command uses three master keys to encrypt the Finance.log file, one in each of three
AWS Regions.
It writes the encrypted message to the Archive directory. The command uses the --suffix parameter
with no value to suppress the suffix, so the input and output files names will be the same.
The command uses the --master-keys parameter with three key attributes. You can also use multiple
--master-keys parameters in the same command.
To encrypt the log file, the AWS Encryption CLI asks the first master key in the list, $cmk1, to generate
the data key that it uses to encrypt the data. Then, it uses each of the other master keys to encrypt
the plaintext copy of the data key. The encrypted message in the output file includes all three of the
encrypted data keys.
Bash
$ cmk1=arn:aws:kms:us-west-2:111122223333:key/1234abcd-12ab-34cd-56ef-1234567890ab
$ cmk2=arn:aws:kms:us-east-2:111122223333:key/0987ab65-43cd-21ef-09ab-87654321cdef
$ cmk3=arn:aws:kms:ap-southeast-1:111122223333:key/1a2b3c4d-5e6f-1a2b-3c4d-5e6f1a2b3c4d
PowerShell
This command decrypts the encrypted copy of the Finance.log file and writes it to a
Finance.log.clear file in the Finance directory.
When you decrypt data that was encrypted under AWS KMS CMKs, you cannot tell AWS KMS to use a
particular CMK to decrypt the data. The key attribute of the --master-keys parameter is not valid in
a decrypt command with the aws-kms provider. The AWS Encryption CLI can use any of the CMKs that
44
AWS Encryption SDK Developer Guide
Examples
were used to encrypt the data, provided that the AWS credentials you are using have permission to call
the Decrypt API on the master key. For more information, see Authentication and Access Control for
AWS KMS.
Bash
PowerShell
In this example, the script gets a collection of log files, compresses them, encrypts them, and then copies
the encrypted files to an Amazon S3 bucket. This script processes each file separately, so that you can
decrypt and expand them independently.
When you compress and encrypt files, be sure to compress before you encrypt. Properly encrypted data
is not compressible.
Warning
Be careful when compressing data that includes both secrets and data that might be controlled
by a malicious actor. The final size of the compressed data might inadvertently reveal sensitive
information about its contents.
You can find the complete scripts in the Examples directory of the aws-encryption-sdk-cli repository in
GitHub.
PowerShell
[Parameter()]
[Switch]
$Recurse,
[Parameter(Mandatory=$true)]
[String]
$masterKeyID,
[Parameter()]
[String]
$masterKeyProvider = 'aws-kms',
45
AWS Encryption SDK Developer Guide
Examples
[Parameter(Mandatory)]
[ValidateScript({Test-Path $_})]
[String]
$ZipDirectory,
[Parameter(Mandatory)]
[ValidateScript({Test-Path $_})]
[String]
$EncryptDirectory,
[Parameter()]
[String]
$EncryptionContext,
[Parameter(Mandatory)]
[ValidateScript({Test-Path $_})]
[String]
$MetadataDirectory,
[Parameter(Mandatory)]
[ValidateScript({Test-S3Bucket -BucketName $_})]
[String]
$S3Bucket,
[Parameter()]
[String]
$S3BucketFolder
)
BEGIN {}
PROCESS {
if ($files = dir $FilePath -Recurse:$Recurse)
{
# Step 1: Compress
foreach ($file in $files)
{
$fileName = $file.Name
try
{
Microsoft.PowerShell.Archive\Compress-Archive -Path $file.FullName -
DestinationPath $ZipDirectory\$filename.zip
}
catch
{
Write-Error "Zip failed on $file.FullName"
}
# Step 2: Encrypt
if (-not (Test-Path "$ZipDirectory\$filename.zip"))
{
Write-Error "Cannot find zipped file: $ZipDirectory\$filename.zip"
}
else
{
# 2>&1 captures command output
$err = (aws-encryption-cli -e -i "$ZipDirectory\$filename.zip" `
-o $EncryptDirectory `
-m key=$masterKeyID provider=
$masterKeyProvider `
-c $EncryptionContext `
--metadata-output $MetadataDirectory `
-v) 2>&1
46
AWS Encryption SDK Developer Guide
Examples
}
else
{
Write-S3Object -BucketName $S3Bucket -File "$EncryptDirectory\
$fileName.zip.encrypted"
}
}
}
}
}
}
By default, the AWS Encryption CLI (and other versions of the AWS Encryption SDK) generates a
unique data key for each file that it encrypts. Although using a unique data key for each operation is
a cryptographic best practice, limited reuse of data keys is acceptable for some situations. If you are
considering data key caching, consult with a security engineer to understand the security requirements
of your application and determine security thresholds that are right for you.
In this example, data key caching speeds up the encryption operation by reducing the frequency of
requests to the master key provider.
The command in this example encrypts a large directory with multiple subdirectories that contain a total
of approximately 800 small log files. The first command saves the ARN of the CMK in a cmkARN variable.
The second command encrypts all of the files in the input directory (recursively) and writes them to an
archive directory. The command uses the --suffix parameter to specify the .archive suffix.
The --caching parameter enables data key caching. The capacity attribute, which limits the number
of data keys in the cache, is set to 1, because serial file processing never uses more than one data key at
a time. The max_age attribute, which determines how long the cached data key can used, is set to 10
seconds.
The optional max_messages_encrypted attribute is set to 10 messages, so a single data key is never
used to encrypt more than 10 files. Limiting the number of files encrypted by each data key reduces the
number of files that would be affected in the unlikely event that a data key was compromised.
To run this command on log files that your operating system generates, you might need administrator
permissions (sudo in Linux; Run as Administrator in Windows).
Bash
$ cmkArn=arn:aws:kms:us-west-2:111122223333:key/1234abcd-12ab-34cd-56ef-1234567890ab
47
AWS Encryption SDK Developer Guide
Examples
$ aws-encryption-cli --encrypt \
--input /var/log/httpd --recursive \
--output ~/archive --suffix .archive \
--master-keys key=$cmkArn \
--encryption-context class=log \
--suppress-metadata \
--caching capacity=1 max_age=10 max_messages_encrypted=10
PowerShell
To test the effect of data key caching, this example uses the Measure-Command cmdlet in PowerShell.
When you run this example without data key caching, it takes about 25 seconds to complete. This
process generates a new data key for each file in the directory.
Days : 0
Hours : 0
Minutes : 0
Seconds : 25
Milliseconds : 453
Ticks : 254531202
TotalDays : 0.000294596298611111
TotalHours : 0.00707031116666667
TotalMinutes : 0.42421867
TotalSeconds : 25.4531202
TotalMilliseconds : 25453.1202
Data key caching makes the process quicker, even when you limit each data key to a maximum of 10
files. The command now takes less than 12 seconds to complete and reduces the number of calls to the
master key provider to 1/10 of the original value.
Days : 0
Hours : 0
48
AWS Encryption SDK Developer Guide
Syntax and Parameter Reference
Minutes : 0
Seconds : 11
Milliseconds : 813
Ticks : 118132640
TotalDays : 0.000136727592592593
TotalHours : 0.00328146222222222
TotalMinutes : 0.196887733333333
TotalSeconds : 11.813264
TotalMilliseconds : 11813.264
If you eliminate the max_messages_encrypted restriction, all files are encrypted under the same
data key. This change increases the risk of reusing data keys without making the process much faster.
However, it reduces the number of calls to the master key provider to 1.
Days : 0
Hours : 0
Minutes : 0
Seconds : 10
Milliseconds : 252
Ticks : 102523367
TotalDays : 0.000118661304398148
TotalHours : 0.00284787130555556
TotalMinutes : 0.170872278333333
TotalSeconds : 10.2523367
TotalMilliseconds : 10252.3367
Topics
• AWS Encryption CLI Syntax (p. 49)
• AWS Encryption CLI Command Line Parameters (p. 51)
• Advanced Parameters (p. 54)
Get Help
To get the full AWS Encryption CLI syntax with parameter descriptions, use --help or -h.
49
AWS Encryption SDK Developer Guide
Syntax and Parameter Reference
To get the version number of your AWS Encryption CLI installation, use --version. Be sure to
include the version when you ask questions, report problems, or share tips about using the AWS
Encryption CLI.
aws-encryption-cli --version
Encrypt Data
The following syntax diagram shows the parameters that an encrypt command uses.
aws-encryption-cli --encrypt
--input <input> [--recursive] [--decode]
--output <output> [--interactive] [--no-overwrite] [--suffix
[<suffix>]] [--encode]
--master-keys [--master-keys ...]
key=<keyID> [provider=<provider-name>] [region=<aws-region>]
[profile=<aws-profile>]
--metadata-output <location> [--overwrite-metadata] | --suppress-
metadata
[--encryption-context <encryption_context> [<encryption_context>
...]]
[--algorithm <algorithm_suite>]
[--caching <attributes>]
[--frame-length <length>]
[-v | -vv | -vvv | -vvvv]
[--quiet]
Decrypt Data
The following syntax diagram shows the parameters that a decrypt command uses.
aws-encryption-cli --decrypt
--input <input> [--recursive] [--decode]
--output <output> [--interactive] [--no-overwrite] [--suffix
[<suffix>]] [--encode]
--metadata-output <location> [--overwrite-metadata] | --suppress-
metadata
[--master-keys [--master-keys ...]
[key=<keyID>] [provider=<provider-name>] [region=<aws-region>]
[profile=<aws-profile>]]
[--encryption-context <encryption_context> [<encryption_context>
...]]
[--caching <attributes>]
[--max-length <length>]
[-v | -vv | -vvv | -vvvv]
[--quiet]
You can refer to configuration files that contain parameters and their values. This is equivalent to
typing the parameters and values in the command. For an example, see How to Store Parameters in
a Configuration File (p. 34).
aws-encryption-cli @<configuration_file>
50
AWS Encryption SDK Developer Guide
Syntax and Parameter Reference
--encrypt (-e)
Encrypts the input data. Every command must have an --encrypt or --decrypt parameter.
--decrypt (-d)
Decrypts the input data. Every command must have an --encrypt or --decrypt parameter.
--master-keys (-m)
Specifies the master keys (p. 6) used in encryption and decryption operations. You can use multiple
master keys parameters in each command.
Attributes: The value of the --master-keys parameter consists of the following attributes. The
format is attribute_name=value.
key
The key attribute is required in all encrypt commands. When you use an AWS KMS customer
master key (CMK) in an encrypt command, the value of the key attribute can be a CMK ID or
Amazon Resource Name (ARN), an alias, or an alias ARN.
The key attribute is required in decrypt commands when the master key provider is not AWS
KMS. The key attribute is not permitted in commands that decrypt data that was encrypted
under an AWS KMS CMK.
You can specify multiple key attributes in each --master-keys parameter value. However,
any provider, region, and profile attributes apply to all master keys in the parameter value. To
specify master keys with different attribute values, use multiple --master-keys parameters in
the command.
provider
Identifies the master key provider (p. 7). The format is a provider=ID pair. The default value,
aws-kms, represents AWS KMS. This attribute is required only when the master key provider is
not AWS KMS.
region
Identifies the AWS Region of an AWS KMS CMK. This attribute is valid only for AWS KMS CMKs. It
is used only when the key identifier does not specify a region; otherwise, it is ignored. When it is
used, it overrides the default region in the AWS CLI named profile.
profile
Identifies an AWS CLI named profile. This attribute is valid only for AWS KMS CMKs. The region
in the profile is used only when the key identifier does not specify a region and there is no
region attribute in the command.
51
AWS Encryption SDK Developer Guide
Syntax and Parameter Reference
--input (-i)
Specifies the location of the data to encrypt or decrypt. This parameter is required. The value can be
a path to a file or directory, or a file name pattern. If you are piping input to the command (stdin),
use -.
If the input does not exist, the command completes successfully without error or warning.
--recursive (-r, -R)
Performs the operation on files in the input directory and its subdirectories. This parameter is
required when the value of --input is a directory.
--decode
If you are decrypting a message that was encrypted and then encoded, you must decode the
message before decrypting it. This parameter does that for you.
For example, if you used the --encode parameter in an encrypt command, use the --decode
parameter in the corresponding decrypt command. You can also use this parameter to decode
Base64-encoded input before you encrypt it.
--output (-o)
Specifies a destination for the output. This parameter is required. The value can be a file name, an
existing directory, or -, which writes output to the command line (stdout).
If the specified output directory does not exist, the command fails. If the input contains
subdirectories, the AWS Encryption CLI reproduces the subdirectories under the output directory
that you specify.
By default, the AWS Encryption CLI overwrites files with the same name. To change that behavior,
use the --interactive or --no-overwrite parameters. To suppress the overwrite warning, use
the --quiet parameter.
Note
If a command that would overwrite an output file fails, the output file is deleted.
--interactive
Does not overwrite files. Instead, if the output file exists, the AWS Encryption CLI skips the
corresponding input.
--suffix
Specifies a custom file name suffix for files that the AWS Encryption CLI creates. To indicate no
suffix, use the parameter with no value (--suffix).
By default, when the --output parameter does not specify a file name, the output file name
has the same name as the input file name plus the suffix. The suffix for encrypt commands is
.encrypted. The suffix for decrypt commands is .decrypted.
--encode
Applies Base64 (binary to text) encoding to the output. Encoding prevents the shell host
program from misinterpreting non-ASCII characters in output text.
52
AWS Encryption SDK Developer Guide
Syntax and Parameter Reference
Use this parameter when writing encrypted output to stdout (--output -), especially in a
PowerShell console, even when you are piping the output to another command or saving it in a
variable.
--metadata-output
Specifies a location for metadata about the cryptographic operations. Enter a path and file name. If
the directory does not exist, the command fails. To write the metadata to the command line (stdout),
use -.
You cannot write command output (--output) and metadata output (--metadata-output) to
stdout in the same command. Also, when the value of --input or --output is a directory (without
file names), you cannot write the metadata output to the same directory or to any subdirectory of
that directory.
If you specify an existing file, by default, the AWS Encryption CLI appends new metadata records to
any content in the file. This feature lets you create a single file that contains the metadata for all of
your cryptographic operations. To overwrite the content in an existing file, use the --overwrite-
metadata parameter.
The AWS Encryption CLI returns a JSON-formatted metadata record for each encryption or
decryption operation that the command performs. Each metadata record includes the full paths
to the input and output file, the encryption context, the algorithm suite, and other valuable
information that you can use to review the operation and verify that it meets your security
standards.
--overwrite-metadata
Overwrites the content in the metadata output file. By default, the --metadata-output
parameter appends metadata to any existing content in the file.
--suppress-metadata (-S)
Specifies an encryption context (p. 33) for the operation. This parameter is not required, but it is
recommended.
• In an --encrypt command, enter one or more name=value pairs. Use spaces to separate the
pairs.
• In a decrypt command, enter name=value pairs, name elements with no values, or both.
If the name or value in a name=value pair includes spaces or special characters, enclose the
entire pair in quotation marks. For example, --encryption-context "department=software
development".
--help (-h)
Displays verbose information, warning, and debugging messages. The detail in the output increases
with the number of vs in the parameter. The most detailed setting (-vvvv) returns debugging-level
data from the AWS Encryption CLI and all of the components that it uses.
--quiet (-q)
Suppresses warning messages, such as the message that appears when you overwrite an output file.
53
AWS Encryption SDK Developer Guide
Syntax and Parameter Reference
Advanced Parameters
--algorithm
Specifies an alternate algorithm suite (p. 7). This parameter is optional and valid only in encrypt
commands. By default, the AWS Encryption CLI uses the default algorithm suite for the AWS
Encryption SDK, which is AES-GCM with an HKDF, an ECDSA signature, and a 256-bit encryption key.
This algorithm suite is recommended for most encryption operations. For a list of alternate values,
see Read the Docs.
--frame-length
Creates output with specified frame length. Enter a value in bytes. This parameter is optional and
valid only in encrypt commands.
--max-length
Indicates the maximum frame size (or maximum content length for non-framed messages) in bytes
to read from encrypted messages. This parameter is optional and valid only in decrypt commands. It
is designed to protect you from decrypting extremely large malicious ciphertext.
--caching
Enables the data key caching (p. 55) feature, which reuses data keys, instead of generating a new
data key for each input file. This parameter supports an advanced scenario. Be sure to read the Data
Key Caching (p. 55) documentation before using this feature.
Determine how long cache entries are used, beginning when they are added to the cache.
max_messages_encrypted
Determines the maximum number of messages that a cached entry can encrypt.
max_bytes_encrypted
Determines the maximum number of bytes that a cached entry can encrypt.
54
AWS Encryption SDK Developer Guide
How to Implement Data Key Caching
Caching can reduce your use of cryptographic services, such as AWS Key Management Service (AWS
KMS). If you are hitting your AWS KMS requests-per-second limit, caching can help. Your application can
use cached keys to service some of your data key requests instead of calling AWS KMS. (You can also
create a case in the AWS Support Center to raise the limit for your account.)
The AWS Encryption SDK helps you to create and manage your data key cache. It provides a
LocalCryptoMaterialsCache (p. 64) and a caching cryptographic materials manager (p. 65) that
interacts with the cache and enforces security thresholds (p. 60) that you set. Working together, these
components help you to benefit from the efficiency of reusing data keys while maintaining the security
of your system.
Data key caching is an optional feature of the AWS Encryption SDK that you should use cautiously.
By default, the AWS Encryption SDK generates a new data key for every encryption operation. This
technique supports cryptographic best practices, which discourage excessive reuse of data keys. In
general, use data key caching only when it is required to meet your performance goals. Then, use the
data key caching security thresholds (p. 60) to ensure that you use the minimum amount of caching
required to meet your cost and performance goals.
For a detailed discussion of these security tradeoffs, see AWS Encryption SDK: How to Decide if Data Key
Caching is Right for Your Application in the AWS Security Blog.
Topics
• How to Implement Data Key Caching (p. 55)
• Setting Cache Security Thresholds (p. 60)
• Data Key Caching Details (p. 61)
• Data Key Caching Example (p. 66)
Topics
55
AWS Encryption SDK Developer Guide
Implement Data Key Caching: Step-by-Step
Java
Python
cache = LocalCryptoMaterialsCache(MAX_CACHE_SIZE)
• Create a master key provider (p. 7). This example uses an AWS Key Management Service (AWS KMS)
master key provider.
Java
Python
key_provider = aws_encryption_sdk.KMSMasterKeyProvider(key_ids=[kms_cmk_arn])
• Create a caching cryptographic materials manager (p. 65) (caching CMM).
Associate your caching CMM with your cache and master key provider. Then, set cache security
thresholds (p. 60) on the caching CMM.
56
AWS Encryption SDK Developer Guide
Implement Data Key Caching: Step-by-Step
Java
/*
* Security thresholds
* Max entry age is required.
* Max messages (and max bytes) per entry are optional
*/
int MAX_ENTRY_AGE_SECONDS = 60;
int MAX_ENTRY_MSGS = 10;
Python
# Security thresholds
# Max entry age is required.
# Max messages (and max bytes) per entry are optional
#
MAX_ENTRY_AGE_SECONDS = 60.0
MAX_ENTRY_MESSAGES = 10
That's all you need to do. Then, let the AWS Encryption SDK manage the cache for you, or add your own
cache management logic.
When you want to use data key caching in a call to encrypt or decrypt data, specify your caching CMM
instead of a master key provider or other CMM.
Note
If you are encrypting data streams, or any data of unknown size, be sure to specify the data size
in the request. The AWS Encryption SDK does not use data key caching when encrypting data of
unknown size.
Java
Python
57
AWS Encryption SDK Developer Guide
Data Key Caching Example: Encrypt a String
source=plaintext_source,
materials_manager=caching_cmm
)
The example creates a LocalCryptoMaterialsCache (p. 64) and a master key provider (p. 7) for an AWS
KMS customer master key (CMK). Then, it uses the cache and master key provider to create a caching
CMM with appropriate security thresholds (p. 60). The encryption request specifies the caching CMM,
the plaintext data to encrypt, and an encryption context (p. 66).
To run the example, you need to supply the Amazon Resource Name (ARN) of a KMS CMK. Be sure that
you have permission to use the CMK to generate a data key.
For more detailed, real-world examples of creating and using a data key cache, see Data Key Caching
Example in Java (p. 68) and Data Key Caching Example in Python (p. 72).
Java
/*
* Copyright 2017 Amazon.com, Inc. or its affiliates. All Rights Reserved.
*
* Licensed under the Apache License, Version 2.0 (the "License"). You may not use this
file except
* in compliance with the License. A copy of the License is located at
*
* http://aws.amazon.com/apache2.0
*
* or in the "license" file accompanying this file. This file is distributed on an "AS
IS" BASIS,
* WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied. See the
License for the
* specific language governing permissions and limitations under the License.
*/
import java.nio.charset.StandardCharsets;
import java.util.Collections;
import java.util.Map;
import java.util.concurrent.TimeUnit;
import javax.xml.bind.DatatypeConverter;
import com.amazonaws.encryptionsdk.AwsCrypto;
import com.amazonaws.encryptionsdk.CryptoMaterialsManager;
import com.amazonaws.encryptionsdk.MasterKeyProvider;
import com.amazonaws.encryptionsdk.caching.CachingCryptoMaterialsManager;
import com.amazonaws.encryptionsdk.caching.CryptoMaterialsCache;
import com.amazonaws.encryptionsdk.caching.LocalCryptoMaterialsCache;
import com.amazonaws.encryptionsdk.kms.KmsMasterKey;
import com.amazonaws.encryptionsdk.kms.KmsMasterKeyProvider;
/**
* <p>
* Encrypts a string using an AWS KMS customer master key (CMK) and data key caching
*
* <p>
* Arguments:
58
AWS Encryption SDK Developer Guide
Data Key Caching Example: Encrypt a String
* <ol>
* <li>KMS CMK ARN: To find the Amazon Resource Name of your AWS KMS customer master
key (CMK),
* see 'Viewing Keys' at http://docs.aws.amazon.com/kms/latest/developerguide/
viewing-keys.html
* <li>Max entry age: Maximum time (in seconds) that a cached entry can be used
* <li>Cache capacity: Maximum number of entries in the cache
* </ol>
*/
public class SimpleDataKeyCachingExample {
/*
* Security thresholds
* Max entry age is required.
* Max messages (and max bytes) per data key are optional
*/
private static final int MAX_ENTRY_MSGS = 100;
// Encryption context
final Map<String, String> encryptionContext =
Collections.singletonMap("purpose", "test");
// Create a cache
CryptoMaterialsCache cache = new LocalCryptoMaterialsCache(cacheCapacity);
CachingCryptoMaterialsManager.newBuilder().withMasterKeyProvider(keyProvider)
.withCache(cache)
.withMaxAge(maxEntryAge, TimeUnit.SECONDS)
.withMessageUseLimit(MAX_ENTRY_MSGS)
.build();
Python
59
AWS Encryption SDK Developer Guide
Setting Cache Security Thresholds
:param str kms_cmk_arn: Amazon Resource Name (ARN) of the KMS customer master key
:param float max_age_in_cache: Maximum time in seconds that a cached entry can be
used
:param int cache_capacity: Maximum number of entries in the cache
"""
# Data to be encrypted
my_data = 'My plaintext data'
# Security thresholds
# Max messages (and max bytes) per data key are optional
MAX_ENTRY_MESSAGES = 100
# Create a cache
cache = aws_encryption_sdk.LocalCryptoMaterialsCache(cache_capacity)
return encrypted_message
The security thresholds help you to limit how long each cached data key is used and how much data is
protected under each data key. The caching CMM returns cached data keys only when the cache entry
conforms to all of the security thresholds. If the cache entry exceeds any threshold, the entry is not used
for the current operation and it is evicted from the cache.
As a rule, use the minimum amount of caching that is required to meet your cost and performance goals.
The AWS Encryption SDK only caches data keys that are encrypted by using a key derivation function.
Also, it establishes upper limits for the threshold values. These restrictions ensure that data keys are
60
AWS Encryption SDK Developer Guide
Data Key Caching Details
not reused beyond their cryptographic limits. However, because your plaintext data keys are cached (in
memory, by default), try to minimize the time that the keys are saved . Also, try to limit the data that
might be exposed if a key is compromised.
For examples of setting cache security thresholds, see AWS Encryption SDK: How to Decide if Data Key
Caching is Right for Your Application in the AWS Security Blog.
Note
The caching CMM enforces all of the following thresholds. If you do not specify an optional
value, the caching CMM uses the default value.
To disable data key caching temporarily, do not set the cache capacity (p. 64) or security
thresholds to 0. Instead, use the null cryptographic materials cache (NullCryptoMaterialsCache)
that the AWS Encryption SDK provides. The NullCryptoMaterialsCache returns a miss for every
get request and does not respond to put requests. For more information, see the SDK for your
programming language (p. 12).
Determines how long a cached entry can be used, beginning when it was added. This value is
required. Enter a value greater than 0. There is no maximum value.
The LocalCryptoMaterialsCache tries to evict cache entries as soon as possible after they reach the
maximum age value. Other conforming caches might perform differently.
Use the shortest interval that still allows your application to benefit from the cache. You can use
the maximum age threshold like a key rotation policy. Use it to limit reuse of data keys, minimize
exposure of cryptographic materials, and evict data keys whose policies might have changed while
they were cached.
Maximum messages encrypted (optional)
Specifies the maximum number of messages that a cached data key can encrypt. This value is
optional. Enter a value between 1 and 2^32 messages. The default value is 2^32 messages.
Set the number of messages protected by each cached key to be large enough to get value
from reuse, but small enough to limit the number of messages that might be exposed if a key is
compromised.
Maximum bytes encrypted (optional)
Specifies the maximum number of bytes that a cached data key can encrypt. This value is optional.
Enter a value between 0 and 2^63 - 1. The default value is 2^63 - 1. A value of 0 lets you encrypt
empty message strings.
The first use of each data key (before caching) is exempt from this threshold. Also, to enforce this
threshold, requests to encrypt data of unknown size, such as streamed data with no length specifier,
do not use the data key cache.
The bytes in the current request are included when evaluating this threshold. If the bytes processed,
plus current bytes, exceed the threshold, the cached data key is evicted from the cache, even though
it might have been used on a smaller request.
Topics
61
AWS Encryption SDK Developer Guide
How Data Key Caching Works
In addition to a cache, data key caching uses a caching cryptographic materials manager (p. 65)
(caching CMM). The caching CMM is a specialized cryptographic materials manager (CMM) (p. 7) that
interacts with a cache (p. 64) and an underlying CMM (p. 7) or master key provider (p. 7). The caching
CMM caches the data keys that its underlying CMM (or master key provider) returns. The caching CMM
also enforces cache security thresholds that you set.
To prevent the wrong data key from being selected from the cache, each caching CMM requires that
several properties of each cached data key match the materials request, as follows:
• For encryption material requests, the cached entry and the request must have the same algorithm
suite (p. 7), encryption context (p. 66) (even when empty), and partition name (a string that
identifies the caching CMM).
• For decryption material requests, the cached entry and the request must have the same algorithm
suite (p. 7), encryption context (p. 66) (even when empty), and partition name (a string that
identifies the caching CMM).
Note
The AWS Encryption SDK caches data keys only when the algorithm suite (p. 7) uses a key
derivation function.
Data key caching is not used for data of unknown size, such as streamed data. This allows
the caching CMM to properly enforce the maximum bytes threshold (p. 60). To avoid this
behavior, add the data length to the encryption request.
The following workflows show how a request to encrypt data is processed with and without data key
caching. They show how the caching components that you create, including the cache and the caching
CMM, are used in the process.
The request specifies a cryptographic materials manager (CMM) or master key provider. If you specify
a master key provider, the AWS Encryption SDK creates a default CMM that interacts with the master
key provider you specified.
2. The AWS Encryption SDK asks the CMM for a data key to encrypt the data (get cryptographic
materials).
3. The CMM asks its master key provider for master keys (p. 6) (or objects that represent master
keys). Then, it uses the master keys to generate a new data key (p. 6). This might involve a call to a
cryptographic service, such as AWS Key Management Service (AWS KMS). The CMM returns plaintext
and encrypted copies of the data key to the AWS Encryption SDK.
62
AWS Encryption SDK Developer Guide
How Data Key Caching Works
4. The AWS Encryption SDK uses the plaintext data key to encrypt the data and it returns an encrypted
message (p. 8) to the user.
The request specifies a caching cryptographic materials manager (caching CMM) (p. 65) that is
associated with a default cryptographic materials manager (CMM) or a master key provider. If you
specify a master key provider, the SDK creates a default CMM for you.
2. The SDK asks the specified caching CMM for a data key to encrypt the data (get cryptographic
materials).
3. The caching CMM requests a data key from the cache.
a. If the cache finds a match, it updates the age and use values of the matched cache entry, and
returns the cached data key to the caching CMM.
If the cache entry conforms to its security thresholds (p. 60), the caching CMM returns it to the
SDK. Otherwise, it tells the cache to evict the entry and proceeds as though there was no match.
b. If the cache cannot find a valid match, the caching CMM asks its underlying CMM to generate a new
data key.
The CMM gets master keys (or objects that represent master keys) from its master key provider and
it uses them to generate a new data key. This might involve a call to a service, such as AWS Key
Management Service. The CMM returns the plaintext and encrypted copies of the data key to the
caching CMM.
63
AWS Encryption SDK Developer Guide
Creating a Cryptographic Materials Cache
The caching CMM saves the new data key in the cache.
4. The caching CMM returns plaintext and encrypted copies of the data key to the AWS Encryption SDK.
5. The AWS Encryption SDK uses the data key to encrypt the data and it returns an encrypted
message (p. 8) to the user.
LocalCryptoMaterialsCache includes logic for basic cache management, including adding, evicting,
and matching cached entries, and maintaining the cache. You don't need to write any custom cache
management logic. You can use LocalCryptoMaterialsCache as is, customize it, or substitute any
compatible cache.
When you create a LocalCryptoMaterialsCache, you set its capacity, that is, the maximum number of
entries that the cache can hold. This setting helps you to design an efficient cache with limited data key
reuse.
The AWS Encryption SDK also provides a null cryptographic materials cache (NullCryptoMaterialsCache).
The NullCryptoMaterialsCache returns a miss for all get operations and does not respond to put
operations. You can use the NullCryptoMaterialsCache in testing or to temporarily disable caching in an
application that includes caching code.
In the AWS Encryption SDK, each cryptographic materials cache is associated with a caching
cryptographic materials manager (p. 65) (caching CMM). The caching CMM gets data keys from the
cache, puts data keys in the cache, and enforces security thresholds (p. 60) that you set. When you
64
AWS Encryption SDK Developer Guide
Creating a Caching Cryptographic Materials Manager
create a caching CMM, you specify the cache that it uses and the underlying CMM or master key provider
that generates the data keys that it caches.
There are two types of CMMs. Both get data keys (and related cryptographic material), but in different
ways, as follows:
• A CMM is associated with a master key provider. When the SDK asks the CMM for data keys (get
encryption materials), the CMM gets master keys (or objects that represent master keys) from its
master key provider. Then, it uses the master keys to generate, encrypt, or decrypt the data keys.
• A caching CMM is associated with one cache, such as a LocalCryptoMaterialsCache (p. 64), and a
CMM or master key provider. (If you specify a master key provider, the SDK creates a default CMM
for the master key provider.) When the SDK asks the caching CMM for data keys, the caching CMM
tries to get them from the cache. If it cannot find a valid, matching data key, the caching CMM asks its
underlying CMM for the data keys. Then, it caches those data keys before returning them to the caller.
The caching CMM also enforces security thresholds (p. 60) that you set for each cache entry. Because
the security thresholds are set in and enforced by the caching CMM, you can use any compatible cache,
even if the cache is not designed for sensitive material.
For details about creating and managing CMMs and caching CMMs in your application, see the SDK for
your programming language (p. 12).
The entries that are added to a data key cache as a result of a encryption operation include the following
elements:
The entries that are added to a data key cache as a result of a decryption operation include the following
elements:
65
AWS Encryption SDK Developer Guide
Encryption Context: How to Select Cache Entries
An encryption context (p. 8) is a set of key-value pairs that contain arbitrary nonsecret data. During
encryption, the encryption context is cryptographically bound to the encrypted data so that the same
encryption context is required to decrypt the data. In the AWS Encryption SDK, the encryption context is
stored in the encrypted message (p. 8) along with the encrypted data and data keys.
When you use a data key cache, you can also use the encryption context to select particular cached data
keys for your encryption operations. The encryption context is saved in the cache entry with the data key
(it's part of the cache entry ID). Cached data keys are reused only when their encryption contexts match.
If you want to reuse certain data keys for an encryption request, specify the same encryption context. If
you want to avoid those data keys, specify a different encryption context.
The encryption context is always optional, but recommended. If you don't specify an encryption context
in your request, an empty encryption context is included in the cache entry identifier and matched to
each request.
In this scenario, multiple data producers generate data, encrypt it, and write to a Kinesis stream in
each region. AWS Lambda functions (consumers) decrypt the streams and write plaintext data to a
DynamoDB table in the region. Data producers and consumers use the AWS Encryption SDK and a
KMS master key provider (p. 7). To reduce calls to KMS, each producer and consumer has their own
LocalCryptoMaterialsCache.
You can find the source code for these examples in Java (p. 68) and Python (p. 72). The sample also
includes a AWS CloudFormation template that defines the resources for the samples.
66
AWS Encryption SDK Developer Guide
LocalCryptoMaterialsCache Results
LocalCryptoMaterialsCache Results
The following table shows that LocalCryptoMaterialsCache reduces the total calls to KMS (per second per
region) in this example to 1% of its original value.
Producer requests
LocalCryptoMaterialsCache
1 rps / 100 1 rps / 100 1 rps / 100 500 5
uses uses uses
67
AWS Encryption SDK Developer Guide
Java Example
Consumer requests
LocalCryptoMaterialsCache
1 rps per 500 5 2 10
producer / 100
uses
The code creates two instances of a LocalCryptoMaterialsCache; one for data producers that are
encrypting data and another for data consumers (Lambda functions) that are decrypting data. For
implementation details, see the Javadoc for the AWS Encryption SDK.
Producer
The producer gets a map, converts it to JSON, uses the AWS Encryption SDK to encrypt it, and pushes
the ciphertext record to a Kinesis stream in each region.
The code defines a caching cryptographic materials manager (p. 65) (caching CMM) and associates
it with a LocalCryptoMaterialsCache (p. 64) and an underlying KMS master key provider (p. 7). The
caching CMM caches the data keys (and related cryptographic materials (p. 65)) from the master key
provider. It also interacts with the cache on behalf of the SDK and enforces security thresholds that you
set.
Because the call to the encryptData method specifies a caching CMM, instead of a regular
cryptographic materials manager (CMM) (p. 7) or master key provider, the method will use data key
caching.
/*
* Copyright 2017 Amazon.com, Inc. or its affiliates. All Rights Reserved.
*
* Licensed under the Apache License, Version 2.0 (the "License"). You may not use this
file except
* in compliance with the License. A copy of the License is located at
*
* http://aws.amazon.com/apache2.0
*
* or in the "license" file accompanying this file. This file is distributed on an "AS IS"
BASIS,
* WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied. See the License
for the
* specific language governing permissions and limitations under the License.
*/
package com.amazonaws.crypto.examples.kinesisdatakeycaching;
import java.nio.ByteBuffer;
import java.util.ArrayList;
68
AWS Encryption SDK Developer Guide
Java Example
import java.util.HashMap;
import java.util.List;
import java.util.Map;
import java.util.UUID;
import java.util.concurrent.TimeUnit;
import com.amazonaws.ClientConfiguration;
import com.amazonaws.auth.DefaultAWSCredentialsProviderChain;
import com.amazonaws.encryptionsdk.AwsCrypto;
import com.amazonaws.encryptionsdk.CryptoResult;
import com.amazonaws.encryptionsdk.MasterKeyProvider;
import com.amazonaws.encryptionsdk.caching.CachingCryptoMaterialsManager;
import com.amazonaws.encryptionsdk.caching.LocalCryptoMaterialsCache;
import com.amazonaws.encryptionsdk.kms.KmsMasterKey;
import com.amazonaws.encryptionsdk.kms.KmsMasterKeyProvider;
import com.amazonaws.encryptionsdk.multi.MultipleProviderFactory;
import com.amazonaws.regions.Region;
import com.amazonaws.services.kinesis.AmazonKinesis;
import com.amazonaws.services.kinesis.AmazonKinesisClientBuilder;
import com.amazonaws.util.json.Jackson;
/**
* Pushes data to Kinesis Streams in multiple regions.
*/
public class MultiRegionRecordPusher {
private static long MAX_ENTRY_AGE_MILLISECONDS = 300000;
private static long MAX_ENTRY_USES = 100;
private static int MAX_CACHE_ENTRIES = 100;
private final String streamName_;
private ArrayList<AmazonKinesis> kinesisClients_;
private CachingCryptoMaterialsManager cachingMaterialsManager_;
private AwsCrypto crypto_;
/**
* Creates an instance of this object with Kinesis clients for all target regions
* and a cached key provider containing KMS master keys in all target regions.
*/
public MultiRegionRecordPusher(final Region[] regions, final String kmsAliasName, final
String streamName){
streamName_ = streamName;
crypto_ = new AwsCrypto();
kinesisClients_ = new ArrayList<AmazonKinesis>();
masterKeys.add(regionMasterKey);
}
69
AWS Encryption SDK Developer Guide
Java Example
MasterKeyProvider<?> masterKeyProvider =
MultipleProviderFactory.buildMultiProvider(
KmsMasterKey.class,
masterKeys
);
cachingMaterialsManager_ = CachingCryptoMaterialsManager.newBuilder()
.withMasterKeyProvider(masterKeyProvider)
.withCache(new LocalCryptoMaterialsCache(MAX_CACHE_ENTRIES))
.withMaxAge(MAX_ENTRY_AGE_MILLISECONDS, TimeUnit.MILLISECONDS)
.withMessageUseLimit(MAX_ENTRY_USES)
.build();
}
/**
* JSON serializes and encrypts the received record data and pushes it to all target
streams.
*/
public void putRecord(final Map<Object, Object> data){
String partitionKey = UUID.randomUUID().toString();
Map<String, String> encryptionContext = new HashMap<String, String>();
encryptionContext.put("stream", streamName_);
// Encrypt data
CryptoResult<byte[], ?> result = crypto_.encryptData(
cachingMaterialsManager_,
jsonData.getBytes(),
encryptionContext
);
byte[] encryptedData = result.getResult();
Consumer
The data consumer is an AWS Lambda function that is triggered by Kinesis events. It decrypts and
deserializes each record, and writes the plaintext record to a DynamoDB table in the same region.
Like the producer code, the consumer code enables data key caching by using a caching cryptographic
materials manager (caching CMM) in calls to the decryptData method.
/*
* Copyright 2017 Amazon.com, Inc. or its affiliates. All Rights Reserved.
*
* Licensed under the Apache License, Version 2.0 (the "License"). You may not use this
file except
* in compliance with the License. A copy of the License is located at
*
* http://aws.amazon.com/apache2.0
*
* or in the "license" file accompanying this file. This file is distributed on an "AS IS"
BASIS,
70
AWS Encryption SDK Developer Guide
Java Example
* WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied. See the License
for the
* specific language governing permissions and limitations under the License.
*/
package com.amazonaws.crypto.examples.kinesisdatakeycaching;
import java.io.UnsupportedEncodingException;
import java.nio.ByteBuffer;
import java.util.concurrent.TimeUnit;
import com.amazonaws.encryptionsdk.AwsCrypto;
import com.amazonaws.encryptionsdk.CryptoResult;
import com.amazonaws.encryptionsdk.caching.CachingCryptoMaterialsManager;
import com.amazonaws.encryptionsdk.caching.LocalCryptoMaterialsCache;
import com.amazonaws.encryptionsdk.kms.KmsMasterKey;
import com.amazonaws.encryptionsdk.kms.KmsMasterKeyProvider;
import com.amazonaws.services.dynamodbv2.AmazonDynamoDBClientBuilder;
import com.amazonaws.services.dynamodbv2.document.DynamoDB;
import com.amazonaws.services.dynamodbv2.document.Item;
import com.amazonaws.services.dynamodbv2.document.Table;
import com.amazonaws.services.lambda.runtime.Context;
import com.amazonaws.services.lambda.runtime.events.KinesisEvent;
import com.amazonaws.services.lambda.runtime.events.KinesisEvent.KinesisEventRecord;
import com.amazonaws.util.BinaryUtils;
/**
* Decrypts all incoming Kinesis records and writes records to DynamoDB.
*/
public class LambdaDecryptAndWrite {
private static final long MAX_ENTRY_AGE_MILLISECONDS = 600000;
private static final int MAX_CACHE_ENTRIES = 100;
private CachingCryptoMaterialsManager cachingMaterialsManager_;
private AwsCrypto crypto_;
private Table table_;
/**
* Because the cache is used only for decryption, the code doesn't set
* the max bytes or max message security thresholds that are are enforced
* only on on data keys used for encryption.
*/
public LambdaDecryptAndWrite() {
String cmkArn = System.getenv("CMK_ARN");
cachingMaterialsManager_ = CachingCryptoMaterialsManager.newBuilder()
.withMasterKeyProvider(new KmsMasterKeyProvider(cmkArn))
.withCache(new LocalCryptoMaterialsCache(MAX_CACHE_ENTRIES))
.withMaxAge(MAX_ENTRY_AGE_MILLISECONDS, TimeUnit.MILLISECONDS)
.build();
/**
*
* @param event
* @param context
*/
public void handleRequest(KinesisEvent event, Context context) throws
UnsupportedEncodingException{
for (KinesisEventRecord record : event.getRecords()) {
ByteBuffer ciphertextBuffer = record.getKinesis().getData();
byte[] ciphertext = BinaryUtils.copyAllBytesFrom(ciphertextBuffer);
71
AWS Encryption SDK Developer Guide
Python Example
The code creates two instances of a LocalCryptoMaterialsCache; one for data producers that are
encrypting data and another for data consumers (Lambda functions) that are decrypting data. For
implementation details, see the Python documentation for the AWS Encryption SDK.
Producer
The producer gets a map, converts it to JSON, uses the AWS Encryption SDK to encrypt it, and pushes
the ciphertext record to an Kinesis stream in each region.
The code defines a caching cryptographic materials manager (p. 65) (caching CMM) and associates
it with a LocalCryptoMaterialsCache (p. 64) and an underlying KMS master key provider (p. 7). The
caching CMM caches the data keys (and related cryptographic materials (p. 65)) from the master key
provider. It also interacts with the cache on behalf of the SDK and enforces security thresholds that you
set.
Because the call to the encrypt method specifies a caching CMM, instead of a regular cryptographic
materials manager (CMM) (p. 7) or master key provider, the method will use data key caching.
"""
Copyright 2017 Amazon.com, Inc. or its affiliates. All Rights Reserved.
Licensed under the Apache License, Version 2.0 (the "License"). You may not use this file
except
in compliance with the License. A copy of the License is located at
https://aws.amazon.com/apache-2-0/
or in the "license" file accompanying this file. This file is distributed on an "AS IS"
BASIS,
WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied. See the License
for the
specific language governing permissions and limitations under the License.
"""
import json
import uuid
72
AWS Encryption SDK Developer Guide
Python Example
class MultiRegionRecordPusher(object):
"""Pushes data to Kinesis Streams in multiple regions."""
CACHE_CAPACITY = 100
MAX_ENTRY_AGE_SECONDS = 300.0
MAX_ENTRY_MESSAGES_ENCRYPTED = 100
cache = LocalCryptoMaterialsCache(capacity=self.CACHE_CAPACITY)
self._materials_manager = CachingCryptoMaterialsManager(
master_key_provider=_key_provider,
cache=cache,
max_age=self.MAX_ENTRY_AGE_SECONDS,
max_messages_encrypted=self.MAX_ENTRY_MESSAGES_ENCRYPTED
)
# Encrypt data
encrypted_data, _header = encrypt(
source=json_data,
materials_manager=self._materials_manager,
encryption_context=encryption_context
)
73
AWS Encryption SDK Developer Guide
Python Example
Consumer
The data consumer is an AWS Lambda function that is triggered by Kinesis events. It decrypts and
deserializes each record, and writes the plaintext record to a DynamoDB table in the same region.
Like the producer code, the consumer code enables data key caching by using a caching cryptographic
materials manager (caching CMM) in calls to the decrypt method.
"""
Copyright 2017 Amazon.com, Inc. or its affiliates. All Rights Reserved.
Licensed under the Apache License, Version 2.0 (the "License"). You may not use this file
except
in compliance with the License. A copy of the License is located at
https://aws.amazon.com/apache-2-0/
or in the "license" file accompanying this file. This file is distributed on an "AS IS"
BASIS,
WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied. See the License
for the
specific language governing permissions and limitations under the License.
"""
import base64
import json
import logging
import os
_LOGGER = logging.getLogger(__name__)
_is_setup = False
CACHE_CAPACITY = 100
MAX_ENTRY_AGE_SECONDS = 600.0
def setup():
"""Sets up clients that should persist across Lambda invocations."""
global materials_manager
key_provider = KMSMasterKeyProvider()
cache = LocalCryptoMaterialsCache(capacity=CACHE_CAPACITY)
# Because the cache is used only for decryption, the code doesn't set
# the max bytes or max message security thresholds that are are enforced
# only on on data keys used for encryption.
materials_manager = CachingCryptoMaterialsManager(
master_key_provider=key_provider,
cache=cache,
max_age=MAX_ENTRY_AGE_SECONDS
)
global table
table_name = os.environ.get('TABLE_NAME')
table = boto3.resource('dynamodb').Table(table_name)
global _is_setup
_is_setup = True
74
AWS Encryption SDK Developer Guide
AWS CloudFormation Template
Parameters:
SourceCodeBucket:
Type: String
Description: S3 bucket containing Lambda source code zip files
PythonLambdaS3Key:
Type: String
Description: S3 key containing Python Lambda source code zip file
PythonLambdaObjectVersionId:
Type: String
Description: S3 version id for S3 key containing Python Lambda source code zip file
JavaLambdaS3Key:
Type: String
Description: S3 key containing Python Lambda source code zip file
JavaLambdaObjectVersionId:
Type: String
Description: S3 version id for S3 key containing Python Lambda source code zip file
KeyAliasSuffix:
Type: String
Description: 'Suffix to use for KMS CMK Alias (ie: alias/<KeyAliasSuffix>)'
StreamName:
Type: String
Description: Name to use for Kinesis Stream
Resources:
InputStream:
Type: AWS::Kinesis::Stream
Properties:
Name: !Ref StreamName
ShardCount: 2
PythonLambdaOutputTable:
Type: AWS::DynamoDB::Table
Properties:
AttributeDefinitions:
-
AttributeName: id
AttributeType: S
KeySchema:
-
75
AWS Encryption SDK Developer Guide
AWS CloudFormation Template
AttributeName: id
KeyType: HASH
ProvisionedThroughput:
ReadCapacityUnits: 1
WriteCapacityUnits: 1
PythonLambdaRole:
Type: AWS::IAM::Role
Properties:
AssumeRolePolicyDocument:
Version: 2012-10-17
Statement:
-
Effect: Allow
Principal:
Service: lambda.amazonaws.com
Action: sts:AssumeRole
ManagedPolicyArns:
- arn:aws:iam::aws:policy/service-role/AWSLambdaBasicExecutionRole
Policies:
-
PolicyName: PythonLambdaAccess
PolicyDocument:
Version: 2012-10-17
Statement:
-
Effect: Allow
Action:
- dynamodb:DescribeTable
- dynamodb:BatchWriteItem
Resource: !Sub arn:aws:dynamodb:${AWS::Region}:
${AWS::AccountId}:table/${PythonLambdaOutputTable}
-
Effect: Allow
Action:
- dynamodb:PutItem
Resource: !Sub arn:aws:dynamodb:${AWS::Region}:
${AWS::AccountId}:table/${PythonLambdaOutputTable}*
-
Effect: Allow
Action:
- kinesis:GetRecords
- kinesis:GetShardIterator
- kinesis:DescribeStream
- kinesis:ListStreams
Resource: !Sub arn:aws:kinesis:${AWS::Region}:
${AWS::AccountId}:stream/${InputStream}
PythonLambdaFunction:
Type: AWS::Lambda::Function
Properties:
Description: Python consumer
Runtime: python2.7
MemorySize: 512
Timeout: 90
Role: !GetAtt PythonLambdaRole.Arn
Handler: aws_crypto_examples.kinesis_datakey_caching.consumer.lambda_handler
Code:
S3Bucket: !Ref SourceCodeBucket
S3Key: !Ref PythonLambdaS3Key
S3ObjectVersion: !Ref PythonLambdaObjectVersionId
Environment:
Variables:
TABLE_NAME: !Ref PythonLambdaOutputTable
PythonLambdaSourceMapping:
Type: AWS::Lambda::EventSourceMapping
Properties:
BatchSize: 1
76
AWS Encryption SDK Developer Guide
AWS CloudFormation Template
Enabled: true
EventSourceArn: !Sub arn:aws:kinesis:${AWS::Region}:${AWS::AccountId}:stream/
${InputStream}
FunctionName: !Ref PythonLambdaFunction
StartingPosition: TRIM_HORIZON
JavaLambdaOutputTable:
Type: AWS::DynamoDB::Table
Properties:
AttributeDefinitions:
-
AttributeName: id
AttributeType: S
KeySchema:
-
AttributeName: id
KeyType: HASH
ProvisionedThroughput:
ReadCapacityUnits: 1
WriteCapacityUnits: 1
JavaLambdaRole:
Type: AWS::IAM::Role
Properties:
AssumeRolePolicyDocument:
Version: 2012-10-17
Statement:
-
Effect: Allow
Principal:
Service: lambda.amazonaws.com
Action: sts:AssumeRole
ManagedPolicyArns:
- arn:aws:iam::aws:policy/service-role/AWSLambdaBasicExecutionRole
Policies:
-
PolicyName: JavaLambdaAccess
PolicyDocument:
Version: 2012-10-17
Statement:
-
Effect: Allow
Action:
- dynamodb:DescribeTable
- dynamodb:BatchWriteItem
Resource: !Sub arn:aws:dynamodb:${AWS::Region}:
${AWS::AccountId}:table/${JavaLambdaOutputTable}
-
Effect: Allow
Action:
- dynamodb:PutItem
Resource: !Sub arn:aws:dynamodb:${AWS::Region}:
${AWS::AccountId}:table/${JavaLambdaOutputTable}*
-
Effect: Allow
Action:
- kinesis:GetRecords
- kinesis:GetShardIterator
- kinesis:DescribeStream
- kinesis:ListStreams
Resource: !Sub arn:aws:kinesis:${AWS::Region}:
${AWS::AccountId}:stream/${InputStream}
JavaLambdaFunction:
Type: AWS::Lambda::Function
Properties:
Description: Java consumer
Runtime: java8
MemorySize: 512
77
AWS Encryption SDK Developer Guide
AWS CloudFormation Template
Timeout: 90
Role: !GetAtt JavaLambdaRole.Arn
Handler:
com.amazonaws.crypto.examples.kinesisdatakeycaching.LambdaDecryptAndWrite::handleRequest
Code:
S3Bucket: !Ref SourceCodeBucket
S3Key: !Ref JavaLambdaS3Key
S3ObjectVersion: !Ref JavaLambdaObjectVersionId
Environment:
Variables:
TABLE_NAME: !Ref JavaLambdaOutputTable
CMK_ARN: !GetAtt RegionKinesisCMK.Arn
JavaLambdaSourceMapping:
Type: AWS::Lambda::EventSourceMapping
Properties:
BatchSize: 1
Enabled: true
EventSourceArn: !Sub arn:aws:kinesis:${AWS::Region}:${AWS::AccountId}:stream/
${InputStream}
FunctionName: !Ref JavaLambdaFunction
StartingPosition: TRIM_HORIZON
RegionKinesisCMK:
Type: AWS::KMS::Key
Properties:
Description: Used to encrypt data passing through Kinesis Stream in this region
Enabled: true
KeyPolicy:
Version: 2012-10-17
Statement:
-
Effect: Allow
Principal:
AWS: !Sub arn:aws:iam::${AWS::AccountId}:root
Action:
# Data plane actions
- kms:Encrypt
- kms:GenerateDataKey
# Control plane actions
- kms:CreateAlias
- kms:DeleteAlias
- kms:DescribeKey
- kms:DisableKey
- kms:EnableKey
- kms:PutKeyPolicy
- kms:ScheduleKeyDeletion
- kms:UpdateAlias
- kms:UpdateKeyDescription
Resource: '*'
-
Effect: Allow
Principal:
AWS:
- !GetAtt PythonLambdaRole.Arn
- !GetAtt JavaLambdaRole.Arn
Action: kms:Decrypt
Resource: '*'
RegionKinesisCMKAlias:
Type: AWS::KMS::Alias
Properties:
AliasName: !Sub alias/${KeyAliasSuffix}
TargetKeyId: !Ref RegionKinesisCMK
78
AWS Encryption SDK Developer Guide
How is the AWS Encryption SDK different from the AWS SDKs?
The AWS SDKs provide libraries for interacting with Amazon Web Services (AWS). They integrate
with AWS Key Management Service (AWS KMS) to generate, encrypt, and decrypt data keys.
However, in most cases you can't use them to directly encrypt or decrypt raw data.
The AWS Encryption SDK provides an encryption library that optionally integrates with AWS KMS as
a master key provider. The AWS Encryption SDK builds on the AWS SDKs to do the following things:
• Generate, encrypt, and decrypt data keys
• Use those data keys to encrypt and decrypt your raw data
• Store the encrypted data keys with the corresponding encrypted data in a single object
You can also use the AWS Encryption SDK with no AWS integration by defining a custom master key
provider.
How is the AWS Encryption SDK different from the Amazon S3 encryption client?
The Amazon S3 encryption client in the AWS SDK for Java, AWS SDK for Ruby, and AWS SDK
for .NET provides encryption and decryption for data that you store in Amazon Simple Storage
Service (Amazon S3). These clients are tightly coupled to Amazon S3 and are intended for use only
with data stored there.
The AWS Encryption SDK provides encryption and decryption for data that you can store anywhere.
The AWS Encryption SDK and the Amazon S3 encryption client are not compatible because they
produce ciphertexts with different data formats.
Which cryptographic algorithms are supported by the AWS Encryption SDK, and which one is the
default?
The AWS Encryption SDK uses the Advanced Encryption Standard (AES) algorithm in Galois/Counter
Mode (GCM), known as AES-GCM. The SDK supports 256-bit, 192-bit, and 128-bit encryption keys.
In all cases, the length of the initialization vector (IV) is 12 bytes; the length of the authentication
tag is 16 bytes. By default, the SDK uses the data key as an input to the HMAC-based extract-and-
expand key derivation function (HKDF) to derive the AES-GCM encryption key, and also adds an
Elliptic Curve Digital Signature Algorithm (ECDSA) signature.
79
AWS Encryption SDK Developer Guide
For information about choosing which algorithm to use, see Supported Algorithm Suites (p. 10).
For implementation details about the supported algorithms, see Algorithms Reference (p. 96).
How is the initialization vector (IV) generated and where is it stored?
In previous releases, the AWS Encryption SDK randomly generated a unique IV value for each
encryption operation. The SDK now uses a deterministic method to construct a different IV
value for each frame so that every IV is unique within its message. The SDK stores the IV in the
encrypted message that it returns. For more information, see AWS Encryption SDK Message Format
Reference (p. 82).
How is each data key generated, encrypted, and decrypted?
The method depends on the master key provider and the implementation of its master keys. When
AWS KMS is the master key provider, the SDK uses the AWS KMS GenerateDataKey API operation
to generate each data key in both plaintext and encrypted forms. It uses the Decrypt operation to
decrypt the data key. AWS KMS encrypts and decrypts the data key by using the customer master
key (CMK) that you specified when configuring the master key provider.
How do I keep track of the data keys that were used to encrypt my data?
The AWS Encryption SDK does this for you. When you encrypt data, the SDK encrypts the data key
and stores the encrypted key along with the encrypted data in the encrypted message (p. 8) that it
returns. When you decrypt data, the AWS Encryption SDK extracts the encrypted data key from the
encrypted method, decrypts it, and then uses it to decrypt the data.
How does the AWS Encryption SDK store encrypted data keys with their encrypted data?
The encryption operations in the AWS Encryption SDK return an encrypted message (p. 8), a single
data structure that contains the encrypted data and its encrypted data keys. The message format
consists of at least two parts: a header and a body. In some cases, the message format consists of a
third part known as a footer. The message header contains the encrypted data keys and information
about how the message body is formed. The message body contains the encrypted data. The
message footer contains a signature that authenticates the message header and message body. For
more information, see AWS Encryption SDK Message Format Reference (p. 82).
How much overhead does the AWS Encryption SDK's message format add to my encrypted data?
The amount of overhead added by the AWS Encryption SDK depends on several factors, including
the following:
• The size of the plaintext data
• Which of the supported algorithms is used
• Whether additional authenticated data (AAD) is provided, and the length of that AAD
• The number and type of master key providers
• The frame size (when framed data (p. 87) is used)
When you use the AWS Encryption SDK with its default configuration, with one CMK in AWS KMS as
the master key, with no AAD, and encrypt nonframed data, the overhead is approximately 600 bytes.
In general, you can reasonably assume that the AWS Encryption SDK adds overhead of 1 KB or less,
not including the provided AAD. For more information, see AWS Encryption SDK Message Format
Reference (p. 82).
Can I use my own master key provider?
Yes. The implementation details vary depending on which of the supported programming
languages (p. 12) you use. However, all supported languages allow you to define custom
cryptographic materials managers (CMMs) (p. 7), master key providers, and master keys.
Can I encrypt data under more than one master key?
Yes. You can encrypt the data key with additional master keys to add redundancy in case a master
key is in a different region or is unavailable for decryption.
80
AWS Encryption SDK Developer Guide
To encrypt data under multiple master keys, create a master key provider with multiple master keys.
You can see examples of this pattern in the example code for Java (p. 17) and Python (p. 24).
When you encrypt data by using a master key provider that returns multiple master keys, the AWS
Encryption SDK encrypts the data that you pass to the encryption methods with a data key and
encrypts that data key with the same master key. Then, it encrypts the data with the other master
keys that the master key provider returned. The resulting message includes the encrypted data and
one encrypted data key for each master key. The resulting message can be decrypted by using any
one of the master keys used in the encryption operation.
Which data types can I encrypt with the AWS Encryption SDK?
The AWS Encryption SDK can encrypt raw bytes (byte arrays), I/O streams (byte streams), and
strings. We provide example code for each of the supported programming languages (p. 12).
How does the AWS Encryption SDK encrypt and decrypt input/output (I/O) streams?
The AWS Encryption SDK creates an encrypting or decrypting stream that wraps an underlying I/O
stream. The encrypting or decrypting stream performs a cryptographic operation on a read or write
call. For example, it can read plaintext data on the underlying stream and encrypt it before returning
the result. Or it can read ciphertext from an underlying stream and decrypt it before returning the
result. We provide example code for encrypting and decrypting streams for each of the supported
programming languages (p. 12).
81
AWS Encryption SDK Developer Guide
Message Format Reference
The information on this page is a reference for building your own encryption library that is compatible
with the AWS Encryption SDK. If you are not building your own compatible encryption library, you
likely do not need this information.
To use the AWS Encryption SDK in one of the supported programming languages, see Programming
Languages (p. 12).
The AWS Encryption SDK uses the supported algorithms (p. 10) to return a single data structure or
message that contains encrypted data and the corresponding encrypted data keys. The following topics
explain the algorithms and the data structure. Use this information to build libraries that can read and
write ciphertexts that are compatible with this SDK.
Topics
• AWS Encryption SDK Message Format Reference (p. 82)
• Body Additional Authenticated Data (AAD) Reference for the AWS Encryption SDK (p. 90)
• AWS Encryption SDK Message Format Examples (p. 91)
• AWS Encryption SDK Algorithms Reference (p. 96)
• AWS Encryption SDK Initialization Vector Reference (p. 98)
To use the AWS Encryption SDK in one of the supported programming languages, see Programming
Languages (p. 12).
The encryption operations in the AWS Encryption SDK return a single data structure or message that
contains the encrypted data (ciphertext) and all encrypted data keys. To understand this data structure,
or to build libraries that read and write it, you need to understand the message format.
The message format consists of at least two parts: a header and a body. In some cases, the message
format consists of a third part, a footer. The message format defines an ordered sequence of bytes in
network byte order, also called big-endian format. The message format begins with the header, followed
by the body, followed by the footer (when there is one).
Topics
• Header Structure (p. 83)
• Body Structure (p. 87)
82
AWS Encryption SDK Developer Guide
Header Structure
Header Structure
The message header contains the encrypted data key and information about how the message body is
formed. The following table describes the fields that form the header. The bytes are appended in the
order shown.
Header Structure
Encrypted Data Key(s) (p. 85) Variable. Determined by the number of encrypted
data keys and the length of each.
Header Authentication (p. 86) Variable. Determined by the algorithm (p. 96)
that generated the message.
Version
The version of this message format. The current version is 1.0, encoded as the byte 01 in
hexadecimal notation.
Type
The type of this message format. The type indicates the kind of structure. The only supported type
is described as customer authenticated encrypted data. Its type value is 128, encoded as byte 80 in
hexadecimal notation.
Algorithm ID
An identifier for the algorithm used. It is a 2-byte value interpreted as a 16-bit unsigned integer. For
more information about the algorithms, see AWS Encryption SDK Algorithms Reference (p. 96).
Message ID
A randomly generated 128-bit value that identifies the message. The Message ID:
83
AWS Encryption SDK Developer Guide
Header Structure
The length of the additional authenticated data (AAD). It is a 2-byte value interpreted as a 16-bit
unsigned integer that specifies the number of bytes that contain the AAD.
AAD
The additional authenticated data. The AAD is an encoding of the encryption context, an array of
key-value pairs where each key and value is a string of UTF-8 encoded characters. The encryption
context is converted to a sequence of bytes and used for the AAD value.
When the algorithms with signing (p. 96) are used, the encryption context must contain the
key-value pair {'aws-crypto-public-key', Qtxt}. Qtxt represents the elliptic curve point Q
compressed according to SEC 1 version 2.0 and then base64-encoded. The encryption context can
contain additional values, but the maximum length of the constructed AAD is 2^16 - 1 bytes.
The following table describes the fields that form the AAD. Key-value pairs are sorted, by key, in
ascending order according to UTF-8 character code. The bytes are appended in the order shown.
AAD Structure
The number of key-value pairs in the AAD. It is a 2-byte value interpreted as a 16-bit unsigned
integer that specifies the number of key-value pairs in the AAD. The maximum number of key-
value pairs in the AAD is 2^16 - 1.
Key Length
The length of the key for the key-value pair. It is a 2-byte value interpreted as a 16-bit unsigned
integer that specifies the number of bytes that contain the key.
Key
The key for the key-value pair. It is a sequence of UTF-8 encoded bytes.
Value Length
The length of the value for the key-value pair. It is a 2-byte value interpreted as a 16-bit
unsigned integer that specifies the number of bytes that contain the value.
84
AWS Encryption SDK Developer Guide
Header Structure
Value
The value for the key-value pair. It is a sequence of UTF-8 encoded bytes.
Encrypted Data Key Count
The number of encrypted data keys. It is a 2-byte value interpreted as a 16-bit unsigned integer that
specifies the number of encrypted data keys.
Encrypted Data Key(s)
A sequence of encrypted data keys. The length of the sequence is determined by the number of
encrypted data keys and the length of each. The sequence contains at least one encrypted data key.
The following table describes the fields that form each encrypted data key. The bytes are appended
in the order shown.
Key Provider ID (p. 85) Variable. Equal to the value specified in the
previous 2 bytes (Key Provider ID Length).
Key Provider Information (p. 85) Variable. Equal to the value specified in the
previous 2 bytes (Key Provider Information
Length).
Encrypted Data Key (p. 86) Variable. Equal to the value specified in the
previous 2 bytes (Encrypted Data Key Length).
The length of the key provider identifier. It is a 2-byte value interpreted as a 16-bit unsigned
integer that specifies the number of bytes that contain the key provider ID.
Key Provider ID
The key provider identifier. It is used to indicate the provider of the encrypted data key and
intended to be extensible.
Key Provider Information Length
The length of the key provider information. It is a 2-byte value interpreted as a 16-bit unsigned
integer that specifies the number of bytes that contain the key provider information.
Key Provider Information
When AWS KMS is the key provider, the following are true:
• This value contains the Amazon Resource Name (ARN) of the AWS KMS customer master key
(CMK).
• This value is always the full CMK ARN, regardless of which key identifier (key ID, alias, etc.) was
specified when calling the master key provider.
85
AWS Encryption SDK Developer Guide
Header Structure
The length of the encrypted data key. It is a 2-byte value interpreted as a 16-bit unsigned
integer that specifies the number of bytes that contain the encrypted data key.
Encrypted Data Key
The encrypted data key. It is the data encryption key encrypted by the key provider.
Content Type
Non-framed content is not broken into parts; it is a single encrypted blob. Non-framed content is
type 1, encoded as the byte 01 in hexadecimal notation.
Framed content is broken into equal-length parts; each part is encrypted separately. Framed content
is type 2, encoded as the byte 02 in hexadecimal notation.
Reserved
The length of the initialization vector (IV). It is a 1-byte value interpreted as an 8-bit unsigned
integer that specifies the number of bytes that contain the IV. This value is determined by the IV
bytes value of the algorithm (p. 96) that generated the message.
Frame Length
The length of each frame of framed content. It is a 4-byte value interpreted as a 32-bit unsigned
integer that specifies the number of bytes that form each frame. When the content is non-framed—
that is, when the value of the content type field is 1—this value must be 0.
Header Authentication
The header authentication is determined by the algorithm (p. 96) that generated the message.
The header authentication is calculated over the entire header. It consists of an IV and an
authentication tag. The bytes are appended in the order shown.
IV
The initialization vector (IV) used to calculate the header authentication tag.
Authentication Tag
The authentication value for the header. It is used to authenticate the entire contents of the
header.
86
AWS Encryption SDK Developer Guide
Body Structure
Body Structure
The message body contains the encrypted data, called the ciphertext. The structure of the body depends
on the content type (non-framed or framed). The following sections describe the format of the message
body for each content type.
Topics
• Non-Framed Data (p. 87)
• Framed Data (p. 87)
Non-Framed Data
Non-framed data is encrypted in a single blob with a unique IV and body AAD (p. 90). The following
table describes the fields that form non-framed data. The bytes are appended in the order shown.
Encrypted Content (p. 87) Variable. Equal to the value specified in the
previous 8 bytes (Encrypted Content Length).
IV
The initialization vector (IV) to use with the encryption algorithm (p. 96).
Encrypted Content Length
The length of the encrypted content, or ciphertext. It is an 8-byte value interpreted as a 64-bit
unsigned integer that specifies the number of bytes that contain the encrypted content.
Technically, the maximum allowed value is 2^63 - 1, or 8 exbibytes (8 EiB). However, in practice
the maximum value is 2^36 - 32, or 64 gibibytes (64 GiB), due to restrictions imposed by the
implemented algorithms (p. 96).
Note
The Java implementation of this SDK further restricts this value to 2^31 - 1, or 2 gibibytes
(2 GiB), due to restrictions in the language.
Encrypted Content
The encrypted content (ciphertext) as returned by the encryption algorithm (p. 96).
Authentication Tag
The authentication value for the body. It is used to authenticate the message body.
Framed Data
Framed data is divided into equal-length parts, except for the last part. Each frame is encrypted
separately with a unique IV and body AAD (p. 90).
87
AWS Encryption SDK Developer Guide
Body Structure
There are two kinds of frames: regular and final. A final frame is always used. When the length of the
data is an exact multiple of the frame length, the final frame contains no data—that is, it has a content
length of 0. When the length of the data is less than the frame length, only a final frame is written.
The following tables describe the fields that form the frames. The bytes are appended in the order
shown.
Encrypted Content (p. 88) Variable. Equal to the value specified in the Frame
Length (p. 86) of the header.
Sequence Number
The frame sequence number. It is an incremental counter number for the frame. It is a 4-byte value
interpreted as a 32-bit unsigned integer.
Framed data must start at sequence number 1. Subsequent frames must be in order and must
contain an increment of 1 of the previous frame. Otherwise, the decryption process stops and
reports an error.
IV
The initialization vector (IV) for the frame. The SDK uses a deterministic method to construct a
different IV for each frame in the message. Its length is specified by the algorithm suite (p. 96)
used.
Encrypted Content
The encrypted content (ciphertext) for the frame, as returned by the encryption algorithm (p. 96).
Authentication Tag
The authentication value for the frame. It is used to authenticate the entire frame.
Encrypted Content (p. 89) Variable. Equal to the value specified in the
previous 4 bytes (Encrypted Content Length).
88
AWS Encryption SDK Developer Guide
Footer Structure
An indicator for the final frame. The value is encoded as the 4 bytes FF FF FF FF in hexadecimal
notation.
Sequence Number
The frame sequence number. It is an incremental counter number for the frame. It is a 4-byte value
interpreted as a 32-bit unsigned integer.
Framed data must start at sequence number 1. Subsequent frames must be in order and must
contain an increment of 1 of the previous frame. Otherwise, the decryption process stops and
reports an error.
IV
The initialization vector (IV) for the frame. The SDK uses a deterministic method to construct a
different IV for each frame in the message. The length of the IV length is specified by the algorithm
suite (p. 96).
Encrypted Content Length
The length of the encrypted content. It is a 4-byte value interpreted as a 32-bit unsigned integer
that specifies the number of bytes that contain the encrypted content for the frame.
Encrypted Content
The encrypted content (ciphertext) for the frame, as returned by the encryption algorithm (p. 96).
Authentication Tag
The authentication value for the frame. It is used to authenticate the entire frame.
Footer Structure
When the algorithms with signing (p. 96) are used, the message format contains a footer. The
message footer contains a signature calculated over the message header and body. The following table
describes the fields that form the footer. The bytes are appended in the order shown.
Footer Structure
Signature Length
The length of the signature. It is a 2-byte value interpreted as a 16-bit unsigned integer that
specifies the number of bytes that contain the signature.
89
AWS Encryption SDK Developer Guide
Body AAD Reference
Signature
The signature. It is used to authenticate the header and body of the message.
To use the AWS Encryption SDK in one of the supported programming languages, see Programming
Languages (p. 12).
Regardless of which type of body data (p. 87) is used to form the message body (non-framed or
framed), you must provide additional authenticated data (AAD) to the AES-GCM algorithm (p. 96) for
each cryptographic operation. For more information about AAD, see the definition section in the Galois/
Counter Mode of Operation (GCM) specification.
The following table describes the fields that form the body AAD. The bytes are appended in the order
shown.
Body AAD Content (p. 90) Variable. See Body AAD Content in the following
list.
Message ID
The same Message ID (p. 83) value set in the message header.
Body AAD Content
For non-framed data (p. 87), use the value AWSKMSEncryptionClient Single Block.
For regular frames in framed data (p. 87), use the value AWSKMSEncryptionClient Frame.
For the final frame in framed data (p. 87), use the value AWSKMSEncryptionClient Final
Frame.
Sequence Number
For framed data (p. 87), this is the frame sequence number.
90
AWS Encryption SDK Developer Guide
Message Format Examples
For non-framed data (p. 87), use the value 1, encoded as the 4 bytes 00 00 00 01 in
hexadecimal notation.
Content Length
The length, in bytes, of the plaintext data provided to the algorithm for encryption. It is an 8-byte
value interpreted as a 64-bit unsigned integer.
To use the AWS Encryption SDK in one of the supported programming languages, see Programming
Languages (p. 12).
The following topics show examples of the AWS Encryption SDK message format. Each example shows
the raw bytes, in hexadecimal notation, followed by a description of what those bytes represent.
Topics
• Non-Framed Data (p. 91)
• Framed Data (p. 93)
Non-Framed Data
The following example shows the message format for non-framed data.
+--------+
| Header |
+--------+
01 Version (1.0)
80 Type (128, customer authenticated encrypted
data)
0378 Algorithm ID (see Algorithms Reference)
B8929B01 753D4A45 C0217F39 404F70FF Message ID (random 128-bit value)
008E AAD Length (142)
0004 AAD Key-Value Pair Count (4)
0005 AAD Key-Value Pair 1, Key Length (5)
30746869 73 AAD Key-Value Pair 1, Key ("0This")
0002 AAD Key-Value Pair 1, Value Length (2)
6973 AAD Key-Value Pair 1, Value ("is")
0003 AAD Key-Value Pair 2, Key Length (3)
31616E AAD Key-Value Pair 2, Key ("1an")
000A AAD Key-Value Pair 2, Value Length (10)
656E6372 79774690 6F6E AAD Key-Value Pair 2, Value ("encryption")
0008 AAD Key-Value Pair 3, Key Length (8)
32636F6E 74657874 AAD Key-Value Pair 3, Key ("2context")
0007 AAD Key-Value Pair 3, Value Length (7)
6578616D 706C65 AAD Key-Value Pair 3, Value ("example")
0015 AAD Key-Value Pair 4, Key Length (21)
6177732D 63727970 746F2D70 75626C69 AAD Key-Value Pair 4, Key ("aws-crypto-public-
key")
632D6B65 79
0044 AAD Key-Value Pair 4, Value Length (68)
41734738 67473949 6E4C5075 3136594B AAD Key-Value Pair 4, Value
("AsG8gG9InLPu16YKlqXTOD+nykG8YqHAhqecj8aXfD2e5B4gtVE73dZkyClA+rAMOQ==")
91
AWS Encryption SDK Developer Guide
Non-Framed Data
92
AWS Encryption SDK Developer Guide
Framed Data
Framed Data
The following example shows the message format for framed data.
+--------+
| Header |
+--------+
01 Version (1.0)
80 Type (128, customer authenticated encrypted
data)
0378 Algorithm ID (see Algorithms Reference)
93
AWS Encryption SDK Developer Guide
Framed Data
94
AWS Encryption SDK Developer Guide
Framed Data
95
AWS Encryption SDK Developer Guide
Algorithms Reference
To use the AWS Encryption SDK in one of the supported programming languages, see Programming
Languages (p. 12).
To build your own library that can read and write ciphertexts that are compatible with the AWS
Encryption SDK, you should understand how the SDK implements the supported algorithms to encrypt
raw data. The SDK supports nine algorithm suites. An implementation specifies the encryption algorithm
and mode, encryption key length, key derivation algorithm (if one applies), and signature algorithm (if
one applies). The following table contains an overview of each implementation. By default, the SDK uses
the first implementation in the following table. The list that follows the table provides more information.
96
AWS Encryption SDK Developer Guide
Algorithms Reference
Algorithm ID
A 2-byte value that uniquely identifies an algorithm's implementation. This value is stored in the
ciphertext's message header (p. 83).
Algorithm Name
The encryption algorithm used. For all algorithm suites, the SDK uses the Advanced Encryption
Standard (AES) encryption algorithm.
Data Key Length
The length of the data key. The SDK supports 256-bit, 192-bit, and 128-bit keys. The data key is
generated by a master key. For some implementations, this data key is used as input to an HMAC-
based extract-and-expand key derivation function (HKDF). The output of the HKDF is used as
the data encryption key in the encryption algorithm. For more information, see Key Derivation
Algorithm in this list.
Algorithm Mode
The mode used with the encryption algorithm. For all algorithm suites, the SDK uses Galois/Counter
Mode (GCM).
IV Length
The HMAC-based extract-and-expand key derivation function (HKDF) used to derive the data
encryption key. The SDK uses the HKDF defined in RFC 5869, with the following specifics:
• The hash function used is either SHA-384 or SHA-256, as specified by the algorithm ID.
• For the extract step:
• No salt is used. Per the RFC, this means that the salt is set to a string of zeros. The string length
is equal to the length of the hash function output; that is, 48 bytes for SHA-384 and 32 bytes
for SHA-256.
• The input keying material is the data key received from the master key provider.
97
AWS Encryption SDK Developer Guide
Initialization Vector Reference
The signature algorithm used to generate a signature over the ciphertext header and body. The SDK
uses the Elliptic Curve Digital Signature Algorithm (ECDSA) with the following specifics:
• The elliptic curve used is either the P-384 or P-256 curve, as specified by the algorithm ID. These
curves are defined in Digital Signature Standard (DSS) (FIPS PUB 186-4).
• The hash function used is SHA-384 (with the P-384 curve) or SHA-256 (with the P-256 curve).
To use the AWS Encryption SDK in one of the supported programming languages, see Programming
Languages (p. 12).
The AWS Encryption SDK supplies the initialization vectors (IVs) that are required by all supported
algorithm suites (p. 96). The SDK uses frame sequence numbers to construct an IV so that no two
frames in the same message can have the same IV.
Each IV is constructed from two big-endian byte arrays concatenated in the following order:
Before the introduction of data key caching (p. 55), the AWS Encryption SDK always used a new
data key to encrypt each message, and it generated all IVs randomly. Randomly generated IVs were
cryptographically safe because data keys were never reused. When the SDK introduced data key caching,
which intentionally reuses data keys, we changed the way the SDK generates IVs.
Using deterministic IVs that cannot repeat within a message significantly increases the number of
invocations that can safely be executed under a single data key. In addition, data keys that are cached
always use an algorithm suite with a key derivation function. Using a deterministic IV with a pseudo-
random key derivation function to derive encryption keys from a data key allows the AWS Encryption
SDK to encrypt 2^32 messages without exceeding cryptographic bounds.
98
AWS Encryption SDK Developer Guide
99