0% found this document useful (0 votes)

55 views

The Functional API

The document provides an introduction to the Keras functional API, which allows for more flexible model building compared to the sequential API. It describes how to build a basic multi-layer perceptron model using the functional API by creating input, hidden, and output layers. Models can then be trained, evaluated, and used for inference just like models built with the sequential API. The functional API represents neural networks as directed acyclic graphs (DAGs) of layers, providing more control over model topology.

Uploaded by

Shubham Raj

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

55 views

The Functional API

Uploaded by

Shubham Raj

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 18

8/1/2020 The Functional API

Search Keras documentation...

» Developer guides / The Functional API

About Keras

Getting started The Functional API

Developer guides Author: fchollet
Date created: 2019/03/01
Keras API reference Last modi ed: 2020/04/12
Description: Complete guide to the functional API.
Code examples
View in Colab • GitHub source
Why choose Keras?

Community & governance

Contributing to Keras
Setup
import numpy as np
import tensorflow as tf
from tensorflow import keras
from tensorflow.keras import layers

Introduction
The Keras functional API is a way to create models that is more exible than the
tf.keras.Sequential API. The functional API can handle models with non-linear topology, models
with shared layers, and models with multiple inputs or outputs.

The main idea that a deep learning model is usually a directed acyclic graph (DAG) of layers. So the
functional API is a way to build graphs of layers.

Consider the following model:

(input: 784-dimensional vectors)

↧
[Dense (64 units, relu activation)]
↧
[Dense (64 units, relu activation)]
↧
[Dense (10 units, softmax activation)]
↧
(output: logits of a probability distribution over 10 classes)

This is a basic graph with three layers. To build this model using the functional API, start by creating
an input node:

inputs = keras.Input(shape=(784,))

The shape of the data is set as a 784-dimensional vector. The batch size is always omitted since
only the shape of each sample is speci ed.

If, for example, you have an image input with a shape of (32, 32, 3), you would use:

# Just for demonstration purposes.

img_inputs = keras.Input(shape=(32, 32, 3))

The inputs that is returned contains information about the shape and dtype of the input data that
you feed to your model. Here's the shape:

inputs.shape

https://keras.io/guides/functional_api/ 1/18
8/1/2020 The Functional API

TensorShape([None, 784])

Here's the dtype:

inputs.dtype

tf.float32

You create a new node in the graph of layers by calling a layer on this inputs object:

dense = layers.Dense(64, activation="relu")

x = dense(inputs)

The "layer call" action is like drawing an arrow from "inputs" to this layer you created. You're
"passing" the inputs to the dense layer, and out you get x.

Let's add a few more layers to the graph of layers:

x = layers.Dense(64, activation="relu")(x)
outputs = layers.Dense(10)(x)

At this point, you can create a Model by specifying its inputs and outputs in the graph of layers:

model = keras.Model(inputs=inputs, outputs=outputs, name="mnist_model")

Let's check out what the model summary looks like:

model.summary()

Model: "mnist_model"
_________________________________________________________________
Layer (type) Output Shape Param #
=================================================================
input_1 (InputLayer) [(None, 784)] 0
_________________________________________________________________
dense (Dense) (None, 64) 50240
_________________________________________________________________
dense_1 (Dense) (None, 64) 4160
_________________________________________________________________
dense_2 (Dense) (None, 10) 650
=================================================================
Total params: 55,050
Trainable params: 55,050
Non-trainable params: 0
_________________________________________________________________

You can also plot the model as a graph:

keras.utils.plot_model(model, "my_first_model.png")

https://keras.io/guides/functional_api/ 2/18
8/1/2020 The Functional API

And, optionally, display the input and output shapes of each layer in the plotted graph:

keras.utils.plot_model(model, "my_first_model_with_shape_info.png", show_shapes=True)

This gure and the code are almost identical. In the code version, the connection arrows are
replaced by the call operation.

A "graph of layers" is an intuitive mental image for a deep learning model, and the functional API is
a way to create models that closely mirror this.

Training, evaluation, and inference

Training, evaluation, and inference work exactly in the same way for models built using the
functional API as for Sequential models.

Here, load the MNIST image data, reshape it into vectors, t the model on the data (while
monitoring performance on a validation split), then evaluate the model on the test data:

https://keras.io/guides/functional_api/ 3/18
8/1/2020 The Functional API

(x_train, y_train), (x_test, y_test) = keras.datasets.mnist.load_data()

x_train = x_train.reshape(60000, 784).astype("float32") / 255

x_test = x_test.reshape(10000, 784).astype("float32") / 255

model.compile(
loss=keras.losses.SparseCategoricalCrossentropy(from_logits=True),
optimizer=keras.optimizers.RMSprop(),
metrics=["accuracy"],
)

history = model.fit(x_train, y_train, batch_size=64, epochs=2, validation_split=0.2)

test_scores = model.evaluate(x_test, y_test, verbose=2)

print("Test loss:", test_scores[0])
print("Test accuracy:", test_scores[1])

Epoch 1/2
750/750 [==============================] - 1s 1ms/step - loss: 0.3528 - accuracy: 0.9005 -
val_loss: 0.1883 - val_accuracy: 0.9457
Epoch 2/2
750/750 [==============================] - 1s 1ms/step - loss: 0.1684 - accuracy: 0.9505 -
val_loss: 0.1385 - val_accuracy: 0.9597
313/313 - 0s - loss: 0.1361 - accuracy: 0.9605
Test loss: 0.13611680269241333
Test accuracy: 0.9605000019073486

For further reading, see the training and evaluation guide.

Save and serialize

Saving the model and serialization work the same way for models built using the functional API as
they do for Sequential models. To standard way to save a functional model is to call model.save() to
save the entire model as a single le. You can later recreate the same model from this le, even if
the code that built the model is no longer available.

This saved le includes the: - model architecture - model weight values (that were learned during
training) - model training con g, if any (as passed to compile) - optimizer and its state, if any (to
restart training where you left o )

model.save("path_to_my_model")
del model
# Recreate the exact same model purely from the file:
model = keras.models.load_model("path_to_my_model")

For details, read the model serialization & saving guide.

Use the same graph of layers to deﬁne multiple models

In the functional API, models are created by specifying their inputs and outputs in a graph of layers.
That means that a single graph of layers can be used to generate multiple models.

In the example below, you use the same stack of layers to instantiate two models: an encoder model
that turns image inputs into 16-dimensional vectors, and an end-to-end autoencoder model for
training.

https://keras.io/guides/functional_api/ 4/18
8/1/2020 The Functional API

encoder_input = keras.Input(shape=(28, 28, 1), name="img")

x = layers.Conv2D(16, 3, activation="relu")(encoder_input)
x = layers.Conv2D(32, 3, activation="relu")(x)
x = layers.MaxPooling2D(3)(x)
x = layers.Conv2D(32, 3, activation="relu")(x)
x = layers.Conv2D(16, 3, activation="relu")(x)
encoder_output = layers.GlobalMaxPooling2D()(x)

encoder = keras.Model(encoder_input, encoder_output, name="encoder")

encoder.summary()

x = layers.Reshape((4, 4, 1))(encoder_output)
x = layers.Conv2DTranspose(16, 3, activation="relu")(x)
x = layers.Conv2DTranspose(32, 3, activation="relu")(x)
x = layers.UpSampling2D(3)(x)
x = layers.Conv2DTranspose(16, 3, activation="relu")(x)
decoder_output = layers.Conv2DTranspose(1, 3, activation="relu")(x)

autoencoder = keras.Model(encoder_input, decoder_output, name="autoencoder")

autoencoder.summary()

Model: "encoder"
_________________________________________________________________
Layer (type) Output Shape Param #
=================================================================
img (InputLayer) [(None, 28, 28, 1)] 0
_________________________________________________________________
conv2d (Conv2D) (None, 26, 26, 16) 160
_________________________________________________________________
conv2d_1 (Conv2D) (None, 24, 24, 32) 4640
_________________________________________________________________
max_pooling2d (MaxPooling2D) (None, 8, 8, 32) 0
_________________________________________________________________
conv2d_2 (Conv2D) (None, 6, 6, 32) 9248
_________________________________________________________________
conv2d_3 (Conv2D) (None, 4, 4, 16) 4624
_________________________________________________________________
global_max_pooling2d (Global (None, 16) 0
=================================================================
Total params: 18,672
Trainable params: 18,672
Non-trainable params: 0
_________________________________________________________________
Model: "autoencoder"
_________________________________________________________________
Layer (type) Output Shape Param #
=================================================================
img (InputLayer) [(None, 28, 28, 1)] 0
_________________________________________________________________
conv2d (Conv2D) (None, 26, 26, 16) 160
_________________________________________________________________
conv2d_1 (Conv2D) (None, 24, 24, 32) 4640
_________________________________________________________________
max_pooling2d (MaxPooling2D) (None, 8, 8, 32) 0
_________________________________________________________________
conv2d_2 (Conv2D) (None, 6, 6, 32) 9248
_________________________________________________________________
conv2d_3 (Conv2D) (None, 4, 4, 16) 4624
_________________________________________________________________
global_max_pooling2d (Global (None, 16) 0
_________________________________________________________________
reshape (Reshape) (None, 4, 4, 1) 0
_________________________________________________________________
conv2d_transpose (Conv2DTran (None, 6, 6, 16) 160
_________________________________________________________________
conv2d_transpose_1 (Conv2DTr (None, 8, 8, 32) 4640
_________________________________________________________________
up_sampling2d (UpSampling2D) (None, 24, 24, 32) 0
_________________________________________________________________
conv2d_transpose_2 (Conv2DTr (None, 26, 26, 16) 4624
_________________________________________________________________
conv2d_transpose_3 (Conv2DTr (None, 28, 28, 1) 145
=================================================================
Total params: 28,241
Trainable params: 28,241
Non-trainable params: 0
_________________________________________________________________

https://keras.io/guides/functional_api/ 5/18
8/1/2020 The Functional API

Here, the decoding architecture is strictly symmetrical to the encoding architecture, so the output
shape is the same as the input shape (28, 28, 1).

The reverse of a Conv2D layer is a Conv2DTranspose layer, and the reverse of a MaxPooling2D layer is an
UpSampling2D layer.

All models are callable, just like layers

You can treat any model as if it were a layer by invoking it on an Input or on the output of another
layer. By calling a model you aren't just reusing the architecture of the model, you're also reusing
its weights.

To see this in action, here's a di erent take on the autoencoder example that creates an encoder
model, a decoder model, and chain them in two calls to obtain the autoencoder model:

encoder_input = keras.Input(shape=(28, 28, 1), name="original_img")

encoder = keras.Model(encoder_input, encoder_output, name="encoder")

encoder.summary()

decoder_input = keras.Input(shape=(16,), name="encoded_img")

x = layers.Reshape((4, 4, 1))(decoder_input)
x = layers.Conv2DTranspose(16, 3, activation="relu")(x)
x = layers.Conv2DTranspose(32, 3, activation="relu")(x)
x = layers.UpSampling2D(3)(x)
x = layers.Conv2DTranspose(16, 3, activation="relu")(x)
decoder_output = layers.Conv2DTranspose(1, 3, activation="relu")(x)

decoder = keras.Model(decoder_input, decoder_output, name="decoder")

decoder.summary()

autoencoder_input = keras.Input(shape=(28, 28, 1), name="img")

encoded_img = encoder(autoencoder_input)
decoded_img = decoder(encoded_img)
autoencoder = keras.Model(autoencoder_input, decoded_img, name="autoencoder")
autoencoder.summary()

https://keras.io/guides/functional_api/ 6/18
8/1/2020 The Functional API

Model: "encoder"
_________________________________________________________________
Layer (type) Output Shape Param #
=================================================================
original_img (InputLayer) [(None, 28, 28, 1)] 0
_________________________________________________________________
conv2d_4 (Conv2D) (None, 26, 26, 16) 160
_________________________________________________________________
conv2d_5 (Conv2D) (None, 24, 24, 32) 4640
_________________________________________________________________
max_pooling2d_1 (MaxPooling2 (None, 8, 8, 32) 0
_________________________________________________________________
conv2d_6 (Conv2D) (None, 6, 6, 32) 9248
_________________________________________________________________
conv2d_7 (Conv2D) (None, 4, 4, 16) 4624
_________________________________________________________________
global_max_pooling2d_1 (Glob (None, 16) 0
=================================================================
Total params: 18,672
Trainable params: 18,672
Non-trainable params: 0
_________________________________________________________________
Model: "decoder"
_________________________________________________________________
Layer (type) Output Shape Param #
=================================================================
encoded_img (InputLayer) [(None, 16)] 0
_________________________________________________________________
reshape_1 (Reshape) (None, 4, 4, 1) 0
_________________________________________________________________
conv2d_transpose_4 (Conv2DTr (None, 6, 6, 16) 160
_________________________________________________________________
conv2d_transpose_5 (Conv2DTr (None, 8, 8, 32) 4640
_________________________________________________________________
up_sampling2d_1 (UpSampling2 (None, 24, 24, 32) 0
_________________________________________________________________
conv2d_transpose_6 (Conv2DTr (None, 26, 26, 16) 4624
_________________________________________________________________
conv2d_transpose_7 (Conv2DTr (None, 28, 28, 1) 145
=================================================================
Total params: 9,569
Trainable params: 9,569
Non-trainable params: 0
_________________________________________________________________
Model: "autoencoder"
_________________________________________________________________
Layer (type) Output Shape Param #
=================================================================
img (InputLayer) [(None, 28, 28, 1)] 0
_________________________________________________________________
encoder (Functional) (None, 16) 18672
_________________________________________________________________
decoder (Functional) (None, 28, 28, 1) 9569
=================================================================
Total params: 28,241
Trainable params: 28,241
Non-trainable params: 0
_________________________________________________________________

As you can see, the model can be nested: a model can contain sub-models (since a model is just
like a layer). A common use case for model nesting is ensembling. For example, here's how to
ensemble a set of models into a single model that averages their predictions:

https://keras.io/guides/functional_api/ 7/18
8/1/2020 The Functional API

def get_model():
inputs = keras.Input(shape=(128,))
outputs = layers.Dense(1)(inputs)
return keras.Model(inputs, outputs)

model1 = get_model()
model2 = get_model()
model3 = get_model()

inputs = keras.Input(shape=(128,))
y1 = model1(inputs)
y2 = model2(inputs)
y3 = model3(inputs)
outputs = layers.average([y1, y2, y3])
ensemble_model = keras.Model(inputs=inputs, outputs=outputs)

Manipulate complex graph topologies

Models with multiple inputs and outputs
The functional API makes it easy to manipulate multiple inputs and outputs. This cannot be
handled with the Sequential API.

For example, if you're building a system for ranking custom issue tickets by priority and routing
them to the correct department, then the model will have three inputs:

the title of the ticket (text input),

the text body of the ticket (text input), and
any tags added by the user (categorical input)

This model will have two outputs:

the priority score between 0 and 1 (scalar sigmoid output), and

the department that should handle the ticket (softmax output over the set of departments).

You can build this model in a few lines with the functional API:

num_tags = 12 # Number of unique issue tags

num_words = 10000 # Size of vocabulary obtained when preprocessing text data
num_departments = 4 # Number of departments for predictions

title_input = keras.Input(
shape=(None,), name="title"
) # Variable-length sequence of ints
body_input = keras.Input(shape=(None,), name="body") # Variable-length sequence of ints
tags_input = keras.Input(
shape=(num_tags,), name="tags"
) # Binary vectors of size `num_tags`

# Embed each word in the title into a 64-dimensional vector

title_features = layers.Embedding(num_words, 64)(title_input)
# Embed each word in the text into a 64-dimensional vector
body_features = layers.Embedding(num_words, 64)(body_input)

# Reduce sequence of embedded words in the title into a single 128-dimensional vector
title_features = layers.LSTM(128)(title_features)
# Reduce sequence of embedded words in the body into a single 32-dimensional vector
body_features = layers.LSTM(32)(body_features)

# Merge all available features into a single large vector via concatenation
x = layers.concatenate([title_features, body_features, tags_input])

# Stick a logistic regression for priority prediction on top of the features

priority_pred = layers.Dense(1, name="priority")(x)
# Stick a department classifier on top of the features
department_pred = layers.Dense(num_departments, name="department")(x)

# Instantiate an end-to-end model predicting both priority and department

model = keras.Model(
inputs=[title_input, body_input, tags_input],
outputs=[priority_pred, department_pred],
)

https://keras.io/guides/functional_api/ 8/18
8/1/2020 The Functional API

Now plot the model:

keras.utils.plot_model(model, "multi_input_and_output_model.png", show_shapes=True)

When compiling this model, you can assign di erent losses to each output. You can even assign
di erent weights to each loss -- to modulate their contribution to the total training loss.

model.compile(
optimizer=keras.optimizers.RMSprop(1e-3),
loss=[
keras.losses.BinaryCrossentropy(from_logits=True),
keras.losses.CategoricalCrossentropy(from_logits=True),
],
loss_weights=[1.0, 0.2],
)

Since the output layers have di erent names, you could also specify the loss like this:

model.compile(
optimizer=keras.optimizers.RMSprop(1e-3),
loss={
"priority": keras.losses.BinaryCrossentropy(from_logits=True),
"department": keras.losses.CategoricalCrossentropy(from_logits=True),
},
loss_weights=[1.0, 0.2],
)

Train the model by passing lists of NumPy arrays of inputs and targets:

# Dummy input data

title_data = np.random.randint(num_words, size=(1280, 10))
body_data = np.random.randint(num_words, size=(1280, 100))
tags_data = np.random.randint(2, size=(1280, num_tags)).astype("float32")

# Dummy target data

priority_targets = np.random.random(size=(1280, 1))
dept_targets = np.random.randint(2, size=(1280, num_departments))

model.fit(
{"title": title_data, "body": body_data, "tags": tags_data},
{"priority": priority_targets, "department": dept_targets},
epochs=2,
batch_size=32,
)

Epoch 1/2
40/40 [==============================] - 1s 27ms/step - loss: 1.3097 - priority_loss:
0.6958 - department_loss: 3.0697
Epoch 2/2
40/40 [==============================] - 1s 27ms/step - loss: 1.2982 - priority_loss:
0.6946 - department_loss: 3.0178

<tensorflow.python.keras.callbacks.History at 0x154d75c50>

https://keras.io/guides/functional_api/ 9/18
8/1/2020 The Functional API

When calling t with a Dataset object, it should yield either a tuple of lists like ([title_data,
body_data, tags_data], [priority_targets, dept_targets]) or a tuple of dictionaries like ({'title':
title_data, 'body': body_data, 'tags': tags_data}, {'priority': priority_targets, 'department':
dept_targets}).

For more detailed explanation, refer to the training and evaluation guide.

A toy ResNet model

In addition to models with multiple inputs and outputs, the functional API makes it easy to
manipulate non-linear connectivity topologies -- these are models with layers that are not
connected sequentially. Something the Sequential API can not handle.

A common use case for this is residual connections. Let's build a toy ResNet model for CIFAR10 to
demonstrate this:

inputs = keras.Input(shape=(32, 32, 3), name="img")

x = layers.Conv2D(32, 3, activation="relu")(inputs)
x = layers.Conv2D(64, 3, activation="relu")(x)
block_1_output = layers.MaxPooling2D(3)(x)

x = layers.Conv2D(64, 3, activation="relu", padding="same")(block_1_output)

x = layers.Conv2D(64, 3, activation="relu", padding="same")(x)
block_2_output = layers.add([x, block_1_output])

x = layers.Conv2D(64, 3, activation="relu", padding="same")(block_2_output)

x = layers.Conv2D(64, 3, activation="relu", padding="same")(x)
block_3_output = layers.add([x, block_2_output])

x = layers.Conv2D(64, 3, activation="relu")(block_3_output)
x = layers.GlobalAveragePooling2D()(x)
x = layers.Dense(256, activation="relu")(x)
x = layers.Dropout(0.5)(x)
outputs = layers.Dense(10)(x)

model = keras.Model(inputs, outputs, name="toy_resnet")

model.summary()

https://keras.io/guides/functional_api/ 10/18
8/1/2020 The Functional API

Model: "toy_resnet"
____________________________________________________________________________________________

Layer (type) Output Shape Param # Connected to

============================================================================================

img (InputLayer) [(None, 32, 32, 3)] 0

____________________________________________________________________________________________

conv2d_8 (Conv2D) (None, 30, 30, 32) 896 img[0][0]

____________________________________________________________________________________________

conv2d_9 (Conv2D) (None, 28, 28, 64) 18496 conv2d_8[0][0]

____________________________________________________________________________________________

max_pooling2d_2 (MaxPooling2D) (None, 9, 9, 64) 0 conv2d_9[0][0]

____________________________________________________________________________________________

conv2d_10 (Conv2D) (None, 9, 9, 64) 36928 max_pooling2d_2[0][0]

____________________________________________________________________________________________

conv2d_11 (Conv2D) (None, 9, 9, 64) 36928 conv2d_10[0][0]

____________________________________________________________________________________________

add (Add) (None, 9, 9, 64) 0 conv2d_11[0][0]

max_pooling2d_2[0][0]
____________________________________________________________________________________________

conv2d_12 (Conv2D) (None, 9, 9, 64) 36928 add[0][0]

____________________________________________________________________________________________

conv2d_13 (Conv2D) (None, 9, 9, 64) 36928 conv2d_12[0][0]

____________________________________________________________________________________________

add_1 (Add) (None, 9, 9, 64) 0 conv2d_13[0][0]

add[0][0]
____________________________________________________________________________________________

conv2d_14 (Conv2D) (None, 7, 7, 64) 36928 add_1[0][0]

____________________________________________________________________________________________

global_average_pooling2d (Globa (None, 64) 0 conv2d_14[0][0]

____________________________________________________________________________________________

dense_6 (Dense) (None, 256) 16640

global_average_pooling2d[0][0]
____________________________________________________________________________________________

dropout (Dropout) (None, 256) 0 dense_6[0][0]

____________________________________________________________________________________________

dense_7 (Dense) (None, 10) 2570 dropout[0][0]

============================================================================================

Total params: 223,242

Trainable params: 223,242
Non-trainable params: 0
____________________________________________________________________________________________

Plot the model:

keras.utils.plot_model(model, "mini_resnet.png", show_shapes=True)

https://keras.io/guides/functional_api/ 11/18
8/1/2020 The Functional API

Now train the model:

https://keras.io/guides/functional_api/ 12/18
8/1/2020 The Functional API

(x_train, y_train), (x_test, y_test) = keras.datasets.cifar10.load_data()

x_train = x_train.astype("float32") / 255.0

x_test = x_test.astype("float32") / 255.0
y_train = keras.utils.to_categorical(y_train, 10)
y_test = keras.utils.to_categorical(y_test, 10)

model.compile(
optimizer=keras.optimizers.RMSprop(1e-3),
loss=keras.losses.CategoricalCrossentropy(from_logits=True),
metrics=["acc"],
)
# We restrict the data to the first 1000 samples so as to limit execution time
# on Colab. Try to train on the entire dataset until convergence!
model.fit(x_train[:1000], y_train[:1000], batch_size=64, epochs=1, validation_split=0.2)

13/13 [==============================] - 1s 96ms/step - loss: 2.3007 - acc: 0.0950 -

val_loss: 2.2903 - val_acc: 0.1150

<tensorflow.python.keras.callbacks.History at 0x1559a2050>

Shared layers
Another good use for the functional API are for models that use shared layers. Shared layers are
layer instances that are reused multiple times in a same model -- they learn features that
correspond to multiple paths in the graph-of-layers.

Shared layers are often used to encode inputs from similar spaces (say, two di erent pieces of text
that feature similar vocabulary). They enable sharing of information across these di erent inputs,
and they make it possible to train such a model on less data. If a given word is seen in one of the
inputs, that will bene t the processing of all inputs that pass through the shared layer.

To share a layer in the functional API, call the same layer instance multiple times. For instance,
here's an Embedding layer shared across two di erent text inputs:

# Embedding for 1000 unique words mapped to 128-dimensional vectors

shared_embedding = layers.Embedding(1000, 128)

# Variable-length sequence of integers

text_input_a = keras.Input(shape=(None,), dtype="int32")

# Variable-length sequence of integers

text_input_b = keras.Input(shape=(None,), dtype="int32")

# Reuse the same layer to encode both inputs

encoded_input_a = shared_embedding(text_input_a)
encoded_input_b = shared_embedding(text_input_b)

Extract and reuse nodes in the graph of layers

Because the graph of layers you are manipulating is a static data structure, it can be accessed and
inspected. And this is how you are able to plot functional models as images.

This also means that you can access the activations of intermediate layers ("nodes" in the graph)
and reuse them elsewhere -- which is very useful for something like feature extraction.

Let's look at an example. This is a VGG19 model with weights pretrained on ImageNet:

vgg19 = tf.keras.applications.VGG19()

And these are the intermediate activations of the model, obtained by querying the graph data
structure:

features_list = [layer.output for layer in vgg19.layers]

https://keras.io/guides/functional_api/ 13/18
8/1/2020 The Functional API

Use these features to create a new feature-extraction model that returns the values of the
intermediate layer activations:

feat_extraction_model = keras.Model(inputs=vgg19.input, outputs=features_list)

img = np.random.random((1, 224, 224, 3)).astype("float32")

extracted_features = feat_extraction_model(img)

This comes in handy for tasks like neural style transfer, among other things.

Extend the API using custom layers

tf.keras includes a wide range of built-in layers, for example:

Convolutional layers: Conv1D, Conv2D, Conv3D, Conv2DTranspose

Pooling layers: MaxPooling1D, MaxPooling2D, MaxPooling3D, AveragePooling1D
RNN layers: GRU, LSTM, ConvLSTM2D
BatchNormalization, Dropout, Embedding, etc.

But if you don't nd what you need, it's easy to extend the API by creating your own layers. All
layers subclass the Layer class and implement:

call method, that speci es the computation done by the layer.

build method, that creates the weights of the layer (this is just a style convention since you
can create weights in __init__, as well).

To learn more about creating layers from scratch, read custom layers and models guide.

The following is a basic implementation of tf.keras.layers.Dense:

class CustomDense(layers.Layer):
def __init__(self, units=32):
super(CustomDense, self).__init__()
self.units = units

def build(self, input_shape):

self.w = self.add_weight(
shape=(input_shape[-1], self.units),
initializer="random_normal",
trainable=True,
)
self.b = self.add_weight(
shape=(self.units,), initializer="random_normal", trainable=True
)

def call(self, inputs):

return tf.matmul(inputs, self.w) + self.b

inputs = keras.Input((4,))
outputs = CustomDense(10)(inputs)

model = keras.Model(inputs, outputs)

For serialization support in your custom layer, de ne a get_config method that returns the
constructor arguments of the layer instance:

https://keras.io/guides/functional_api/ 14/18
8/1/2020 The Functional API

class CustomDense(layers.Layer):
def __init__(self, units=32):
super(CustomDense, self).__init__()
self.units = units

def build(self, input_shape):

def call(self, inputs):

return tf.matmul(inputs, self.w) + self.b

def get_config(self):
return {"units": self.units}

inputs = keras.Input((4,))
outputs = CustomDense(10)(inputs)

model = keras.Model(inputs, outputs)

config = model.get_config()

new_model = keras.Model.from_config(config, custom_objects={"CustomDense": CustomDense})

Optionally, implement the classmethod from_config(cls, config) which is used when recreating a
layer instance given its con g dictionary. The default implementation of from_config is:

def from_config(cls, config):

return cls(**config)

When to use the functional API

When should you use the Keras functional API to create a new model, or just subclass the Model
class directly? In general, the functional API is higher-level, easier and safer, and has a number of
features that subclassed models do not support.

However, model subclassing provides greater exibility when building models that are not easily
expressible as directed acyclic graphs of layers. For example, you could not implement a Tree-RNN
with the functional API and would have to subclass Model directly.

For in-depth look at the di erences between the functional API and model subclassing, read What
are Symbolic and Imperative APIs in TensorFlow 2.0?.

Functional API strengths:

The following properties are also true for Sequential models (which are also data structures), but
are not true for subclassed models (which are Python bytecode, not data structures).

Less verbose
There is no super(MyClass, self).__init__(...), no def call(self, ...):, etc.

Compare:

inputs = keras.Input(shape=(32,))
x = layers.Dense(64, activation='relu')(inputs)
outputs = layers.Dense(10)(x)
mlp = keras.Model(inputs, outputs)

With the subclassed version:

https://keras.io/guides/functional_api/ 15/18
8/1/2020 The Functional API

class MLP(keras.Model):

def init(self, **kwargs):

super(MLP, self).__init__(**kwargs)
self.dense_1 = layers.Dense(64, activation='relu')
self.dense_2 = layers.Dense(10)

def call(self, inputs):

x = self.dense_1(inputs)
return self.dense_2(x)

# Instantiate the model.

mlp = MLP()
# Necessary to create the model's state.
# The model doesn't have a state until it's called at least once.
_ = mlp(tf.zeros((1, 32)))

Model validation while deﬁning its connectivity graph

In the functional API, the input speci cation (shape and dtype) is created in advance (using Input).
Every time you call a layer, the layer checks that the speci cation passed to it matches its
assumptions, and it will raise a helpful error message if not.

This guarantees that any model you can build with the functional API will run. All debugging -- other
than convergence-related debugging -- happens statically during the model construction and not at
execution time. This is similar to type checking in a compiler.

A functional model is plottable and inspectable

You can plot the model as a graph, and you can easily access intermediate nodes in this graph. For
example, to extract and reuse the activations of intermediate layers (as seen in a previous
example):

features_list = [layer.output for layer in vgg19.layers]

feat_extraction_model = keras.Model(inputs=vgg19.input, outputs=features_list)

A functional model can be serialized or cloned

Because a functional model is a data structure rather than a piece of code, it is safely serializable
and can be saved as a single le that allows you to recreate the exact same model without having
access to any of the original code. See the serialization & saving guide.

To serialize a subclassed model, it is necessary for the implementer to specify a get_config() and
from_config() method at the model level.

Functional API weakness:

It does not support dynamic architectures
The functional API treats models as DAGs of layers. This is true for most deep learning
architectures, but not all -- for example, recursive networks or Tree RNNs do not follow this
assumption and cannot be implemented in the functional API.

Mix-and-match API styles

Choosing between the functional API or Model subclassing isn't a binary decision that restricts you
into one category of models. All models in the tf.keras API can interact with each other, whether
they're Sequential models, functional models, or subclassed models that are written from scratch.

You can always use a functional model or Sequential model as part of a subclassed model or layer:

https://keras.io/guides/functional_api/ 16/18
8/1/2020 The Functional API

units = 32
timesteps = 10
input_dim = 5

# Define a Functional model

inputs = keras.Input((None, units))
x = layers.GlobalAveragePooling1D()(inputs)
outputs = layers.Dense(1)(x)
model = keras.Model(inputs, outputs)

class CustomRNN(layers.Layer):
def __init__(self):
super(CustomRNN, self).__init__()
self.units = units
self.projection_1 = layers.Dense(units=units, activation="tanh")
self.projection_2 = layers.Dense(units=units, activation="tanh")
# Our previously-defined Functional model
self.classifier = model

def call(self, inputs):

outputs = []
state = tf.zeros(shape=(inputs.shape[0], self.units))
for t in range(inputs.shape[1]):
x = inputs[:, t, :]
h = self.projection_1(x)
y = h + self.projection_2(state)
state = y
outputs.append(y)
features = tf.stack(outputs, axis=1)
print(features.shape)
return self.classifier(features)

rnn_model = CustomRNN()
_ = rnn_model(tf.zeros((1, timesteps, input_dim)))

(1, 10, 32)

You can use any subclassed layer or model in the functional API as long as it implements a call
method that follows one of the following patterns:

call(self, inputs, **kwargs) -- Where inputs is a tensor or a nested structure of tensors (e.g.
a list of tensors), and where **kwargs are non-tensor arguments (non-inputs).
call(self, inputs, training=None, **kwargs) -- Where training is a boolean indicating
whether the layer should behave in training mode and inference mode.
call(self, inputs, mask=None, **kwargs) -- Where mask is a boolean mask tensor (useful for
RNNs, for instance).
call(self, inputs, training=None, mask=None, **kwargs) -- Of course, you can have both
masking and training-speci c behavior at the same time.

Additionally, if you implement the get_config method on your custom Layer or model, the
functional models you create will still be serializable and cloneable.

Here's a quick example of a custom RNN, written from scratch, being used in a functional model:

https://keras.io/guides/functional_api/ 17/18
8/1/2020 The Functional API

units = 32
timesteps = 10
input_dim = 5
batch_size = 16

def call(self, inputs):

# Note that you specify a static batch size for the inputs with the `batch_shape`
# arg, because the inner computation of `CustomRNN` requires a static batch size
# (when you create the `state` zeros tensor).
inputs = keras.Input(batch_shape=(batch_size, timesteps, input_dim))
x = layers.Conv1D(32, 3)(inputs)
outputs = CustomRNN()(x)

model = keras.Model(inputs, outputs)

rnn_model = CustomRNN()
_ = rnn_model(tf.zeros((1, 10, 5)))

https://keras.io/guides/functional_api/ 18/18

CAPE Math Formula Booklet REVISED 2022
No ratings yet
CAPE Math Formula Booklet REVISED 2022
11 pages
Keras - Functional API
No ratings yet
Keras - Functional API
18 pages
KT 01 Intro2Keras
No ratings yet
KT 01 Intro2Keras
24 pages
01 Neural Network Regression With Tensorflow
No ratings yet
01 Neural Network Regression With Tensorflow
20 pages
Build The Model
No ratings yet
Build The Model
24 pages
Aurell Khanza Geizka_10521281_4PA41_M4
No ratings yet
Aurell Khanza Geizka_10521281_4PA41_M4
9 pages
Introduction To Deep Learning-Session3: Ravi Shukla
No ratings yet
Introduction To Deep Learning-Session3: Ravi Shukla
21 pages
Laporan M4
No ratings yet
Laporan M4
9 pages
Module 4
No ratings yet
Module 4
54 pages
Keras - Layers
No ratings yet
Keras - Layers
17 pages
Lecture - 4 Arrays in Java
No ratings yet
Lecture - 4 Arrays in Java
24 pages
Unit 2
No ratings yet
Unit 2
10 pages
Program 10: Objective
No ratings yet
Program 10: Objective
44 pages
CNN Case Studies With Dropout Layer
No ratings yet
CNN Case Studies With Dropout Layer
2 pages
DL7 3
No ratings yet
DL7 3
16 pages
09-Neural Networks
No ratings yet
09-Neural Networks
17 pages
A First Look On Nueral Network
No ratings yet
A First Look On Nueral Network
8 pages
Keras1 - 1.4 Advanced Model Architectures
No ratings yet
Keras1 - 1.4 Advanced Model Architectures
11 pages
_ Databricks & PySpark learning day-10
No ratings yet
_ Databricks & PySpark learning day-10
4 pages
Unit 5
No ratings yet
Unit 5
12 pages
CNN TF Keras
No ratings yet
CNN TF Keras
6 pages
Document 1
No ratings yet
Document 1
12 pages
Assignment 2
No ratings yet
Assignment 2
8 pages
Synopsys Part2
No ratings yet
Synopsys Part2
3 pages
Tensorflow and Keras (Draft) : KH Wong
No ratings yet
Tensorflow and Keras (Draft) : KH Wong
24 pages
Bca-Hc-4036: Object Oriented Programminf in C++
No ratings yet
Bca-Hc-4036: Object Oriented Programminf in C++
4 pages
Iris - Copy1 - Jupyter Notebook
No ratings yet
Iris - Copy1 - Jupyter Notebook
8 pages
TensorFlow With R
No ratings yet
TensorFlow With R
46 pages
17 Examples Using Structs
No ratings yet
17 Examples Using Structs
30 pages
Intro To Deep Learning With TensorFlow - Introduction To TensorFlow Cheatsheet - Codecademy
No ratings yet
Intro To Deep Learning With TensorFlow - Introduction To TensorFlow Cheatsheet - Codecademy
8 pages
07 Stack Implementation
No ratings yet
07 Stack Implementation
23 pages
CNN and RNN code
No ratings yet
CNN and RNN code
10 pages
Confusion Matrix
No ratings yet
Confusion Matrix
6 pages
Image Classification Using Convolutional Neural Network With Python
No ratings yet
Image Classification Using Convolutional Neural Network With Python
8 pages
Java Script (Functions, Arrays & Objects)
No ratings yet
Java Script (Functions, Arrays & Objects)
31 pages
Dinosaurus Island - Character-Level Language Model - (Final) - Learners - Ipynb
No ratings yet
Dinosaurus Island - Character-Level Language Model - (Final) - Learners - Ipynb
10 pages
JavaScript Interview Questions
No ratings yet
JavaScript Interview Questions
149 pages
Feature Extraction in TorchVision Using Torch FX - PyTorch
No ratings yet
Feature Extraction in TorchVision Using Torch FX - PyTorch
9 pages
Adobe Scan Jan 05, 2024
No ratings yet
Adobe Scan Jan 05, 2024
8 pages
1 An Introduction To Machine Learning With Scikit Learn
No ratings yet
1 An Introduction To Machine Learning With Scikit Learn
2 pages
Keras
No ratings yet
Keras
3 pages
Keras-tensorflow-IT Haarlem 2023
No ratings yet
Keras-tensorflow-IT Haarlem 2023
35 pages
FJP Unit 3
No ratings yet
FJP Unit 3
29 pages
Introduction To Core Java (At A Glance)
No ratings yet
Introduction To Core Java (At A Glance)
72 pages
String in CPP
No ratings yet
String in CPP
9 pages
Deep Learning Lab Mannual
No ratings yet
Deep Learning Lab Mannual
6 pages
Introduction To Keras
No ratings yet
Introduction To Keras
14 pages
L7 - Functional API
No ratings yet
L7 - Functional API
14 pages
DL_0801CS223D04_Assignment5.ipynb - Colab
No ratings yet
DL_0801CS223D04_Assignment5.ipynb - Colab
15 pages
2.3 Section Wrap-Up - 2. Handling Two-Dimensional Data - MATLAB Essentials - Edx
No ratings yet
2.3 Section Wrap-Up - 2. Handling Two-Dimensional Data - MATLAB Essentials - Edx
3 pages
Lab_Manual_OOP
No ratings yet
Lab_Manual_OOP
40 pages
Dimensionality - Reduction - Principal - Component - Analysis - Ipynb at Master Llsourcell - Dimensionality - Reduction GitHub
No ratings yet
Dimensionality - Reduction - Principal - Component - Analysis - Ipynb at Master Llsourcell - Dimensionality - Reduction GitHub
14 pages
A Neural Network Model Using Python
No ratings yet
A Neural Network Model Using Python
10 pages
Computer Notes
No ratings yet
Computer Notes
17 pages
ScalaMulti
No ratings yet
ScalaMulti
48 pages
Assignment 4x
No ratings yet
Assignment 4x
19 pages
Hendra Bayu - 12419795 - Robot M3
No ratings yet
Hendra Bayu - 12419795 - Robot M3
5 pages
python Function and string
No ratings yet
python Function and string
31 pages
First Model Exam 2021-22 - Semester 2
No ratings yet
First Model Exam 2021-22 - Semester 2
7 pages
Adobe Scan 10-Oct-2024
No ratings yet
Adobe Scan 10-Oct-2024
4 pages
120 Advanced JavaScript Interview Questions
From Everand
120 Advanced JavaScript Interview Questions
Hernando Abella
No ratings yet
Choosing Correct Statistical Tests
0% (1)
Choosing Correct Statistical Tests
3 pages
FX Broker Buster Manual v1.0
No ratings yet
FX Broker Buster Manual v1.0
9 pages
Possible Questions For The Final Exam - CEE 3500 - Fall 2005 Part 2 - Fluid Dynamics, Similitude, Pipe Flow
No ratings yet
Possible Questions For The Final Exam - CEE 3500 - Fall 2005 Part 2 - Fluid Dynamics, Similitude, Pipe Flow
5 pages
Integrated Chemistry Problems Answers
No ratings yet
Integrated Chemistry Problems Answers
3 pages
Female Pelvis-Clinical Anatomy
No ratings yet
Female Pelvis-Clinical Anatomy
24 pages
3.2 - Mixtures & Alligations - Qns Only
No ratings yet
3.2 - Mixtures & Alligations - Qns Only
7 pages
Context of Cryptography: Confidentiality
No ratings yet
Context of Cryptography: Confidentiality
13 pages
Trigo Bridging Final
No ratings yet
Trigo Bridging Final
9 pages
Chapter 11 - Section B - Non-Numerical Solutions: T T T P P P
No ratings yet
Chapter 11 - Section B - Non-Numerical Solutions: T T T P P P
8 pages
EC40EB Cabinet Product Description
100% (1)
EC40EB Cabinet Product Description
28 pages
Pressure Switch MFD
No ratings yet
Pressure Switch MFD
2 pages
6.Haloalkane and Haloareans RBSE PYQ Edition 2025
No ratings yet
6.Haloalkane and Haloareans RBSE PYQ Edition 2025
7 pages
Sketchuptolayout2015contents PDF
No ratings yet
Sketchuptolayout2015contents PDF
4 pages
0607_s12_qp_6
No ratings yet
0607_s12_qp_6
12 pages
Respiratory Examination
No ratings yet
Respiratory Examination
18 pages
NAITIK PROJECT
No ratings yet
NAITIK PROJECT
19 pages
Schematic Diagram
No ratings yet
Schematic Diagram
19 pages
Water Dispersed Polymers For Textile Conservation: A Molecular, Thermal, Structural, Mechanical and Optical Characterisation
No ratings yet
Water Dispersed Polymers For Textile Conservation: A Molecular, Thermal, Structural, Mechanical and Optical Characterisation
8 pages
Log
No ratings yet
Log
25 pages
Quantum Dots in Waste Water Treatment - Review Article
No ratings yet
Quantum Dots in Waste Water Treatment - Review Article
7 pages
ABS Pipes and Fittings
No ratings yet
ABS Pipes and Fittings
10 pages
Advanced Keylogger For Ethical Hacking
No ratings yet
Advanced Keylogger For Ethical Hacking
6 pages
Philips TL 25
No ratings yet
Philips TL 25
6 pages
Intro To LV in 3 Hours For Control and Sim
100% (5)
Intro To LV in 3 Hours For Control and Sim
81 pages
Tissue Management
No ratings yet
Tissue Management
35 pages
Ettercap PDF
No ratings yet
Ettercap PDF
13 pages
Backend Development Test 1
No ratings yet
Backend Development Test 1
49 pages
ASUS A6Rp Schematic Diagrams
No ratings yet
ASUS A6Rp Schematic Diagrams
50 pages
CLP-5xx TechMaterial Seftraining
No ratings yet
CLP-5xx TechMaterial Seftraining
70 pages