0% found this document useful (0 votes)

48 views

Transfer Learning: Objectives

The document discusses using transfer learning to create a personalized doggy door that recognizes a specific dog named Bo using only 30 pictures. It downloads a pre-trained VGG16 model from Keras, removing the last layer since the original output is 1000 classes from ImageNet rather than a binary classification of Bo vs not Bo. The pretrained weights are used to recognize dog features and generalize to the smaller dataset, despite not originally being trained on images of Bo.

Uploaded by

Praveen Singh

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

48 views

Transfer Learning: Objectives

Uploaded by

Praveen Singh

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 16

05b_presidential_doggy_door about:srcdoc

Transfer Learning

So far, we have trained accurate models on large datasets, and also downloaded a pre-trained model that we
used with no training necessary. But what if we cannot find a pre-trained model that does exactly what you
need, and what if we do not have a sufficiently large dataset to train a model from scratch? In this case, there
is a very helpful technique we can use called transfer learning (https://blogs.nvidia.com/blog/2019/02/07/what-
is-transfer-learning/).

With transfer learning, we take a pre-trained model and retrain it on a task that has some overlap with the
original training task. A good analogy for this is an artist who is skilled in one medium, such as painting, who
wants to learn to practice in another medium, such as charcoal drawing. We can imagine that the skills they
learned while painting would be very valuable in learning how to draw with charcoal.

As an example in deep learning, say we have a pre-trained model that is very good at recognizing different
types of cars, and we want to train a model to recognize types of motorcycles. A lot of the learnings of the car
model would likely be very useful, for instance the ability to recognize headlights and wheels.

Transfer learning is especially powerful when we do not have a large and varied dataset. In this case, a model
trained from scratch would likely memorize the training data quickly, but not be able to generalize well to new
data. With transfer learning, you can increase your chances of training an accurate and robust model on a
small dataset.

Objectives

Prepare a pretrained model for transfer learning

Perform transfer learning with your own small dataset on a pretrained model
Further fine tune the model for even better performance

A Personalized Doggy Door

In our last exercise, we used a pre-trained ImageNet (http://www.image-net.org/) model to let in all dogs, but
keep out other animals. In this exercise, we would like to create a doggy door that only lets in a particular dog.
In this case, we will make an automatic doggy door for a dog named Bo, the United States First Dog between
2009 and 2017. There are more pictures of Bo in the data/presidential_doggy_door folder.

1 of 16 12/07/2021, 05:01 pm
05b_presidential_doggy_door about:srcdoc

The challenge is that the pre-trained model was not trained to recognize this specific dog, and, we only have
30 pictures of Bo. If we tried to train a model from scratch using those 30 pictures we would experience
overfitting and poor generalization. However, if we start with a pre-trained model that is adept at detecting
dogs, we can leverage that learning to gain a generalized understanding of Bo using our smaller dataset. We
can use transfer learning to solve this challenge.

Downloading the Pretrained Model

The ImageNet pre-trained models (https://keras.io/api/applications/vgg/#vgg16-function) are often good

choices for computer vision transfer learning, as they have learned to classify various different types of
images. In doing this, they have learned to detect many different types of features
(https://developers.google.com/machine-learning/glossary#) that could be valuable in image recognition.
Because ImageNet models have learned to detect animals, including dogs, it is especially well suited for this
transfer learning task of detecting Bo.

Let us start by downloading the pre-trained model. Again, this is available directly from the Keras library. As
we are downloading, there is going to be an important difference. The last layer of an ImageNet model is a
dense layer (https://developers.google.com/machine-learning/glossary#dense-layer) of 1000 units,
representing the 1000 possible classes in the dataset. In our case, we want it to make a different
classification: is this Bo or not? Because we want the classification to be different, we are going to remove the
last layer of the model. We can do this by setting the flag include_top=False when downloading the
model. After removing this top layer, we can add new layers that will yield the type of classification that we
want:

In [1]: from tensorflow import keras

base_model = keras.applications.VGG16(
weights='imagenet', # Load weights pre-trained on ImageNet.
input_shape=(224, 224, 3),
include_top=False)

Downloading data from https://storage.googleapis.com/tensorflow/keras

-applications/vgg16/vgg16_weights_tf_dim_ordering_tf_kernels_notop.h5
58892288/58889256 [==============================] - 0s 0us/step

2 of 16 12/07/2021, 05:01 pm
05b_presidential_doggy_door about:srcdoc

In [2]: base_model.summary()

Model: "vgg16"
_________________________________________________________________
Layer (type) Output Shape Param #
=================================================================
input_1 (InputLayer) [(None, 224, 224, 3)] 0
_________________________________________________________________
block1_conv1 (Conv2D) (None, 224, 224, 64) 1792
_________________________________________________________________
block1_conv2 (Conv2D) (None, 224, 224, 64) 36928
_________________________________________________________________
block1_pool (MaxPooling2D) (None, 112, 112, 64) 0
_________________________________________________________________
block2_conv1 (Conv2D) (None, 112, 112, 128) 73856
_________________________________________________________________
block2_conv2 (Conv2D) (None, 112, 112, 128) 147584
_________________________________________________________________
block2_pool (MaxPooling2D) (None, 56, 56, 128) 0
_________________________________________________________________
block3_conv1 (Conv2D) (None, 56, 56, 256) 295168
_________________________________________________________________
block3_conv2 (Conv2D) (None, 56, 56, 256) 590080
_________________________________________________________________
block3_conv3 (Conv2D) (None, 56, 56, 256) 590080
_________________________________________________________________
block3_pool (MaxPooling2D) (None, 28, 28, 256) 0
_________________________________________________________________
block4_conv1 (Conv2D) (None, 28, 28, 512) 1180160
_________________________________________________________________
block4_conv2 (Conv2D) (None, 28, 28, 512) 2359808
_________________________________________________________________
block4_conv3 (Conv2D) (None, 28, 28, 512) 2359808
_________________________________________________________________
block4_pool (MaxPooling2D) (None, 14, 14, 512) 0
_________________________________________________________________
block5_conv1 (Conv2D) (None, 14, 14, 512) 2359808
_________________________________________________________________
block5_conv2 (Conv2D) (None, 14, 14, 512) 2359808
_________________________________________________________________
block5_conv3 (Conv2D) (None, 14, 14, 512) 2359808
_________________________________________________________________
block5_pool (MaxPooling2D) (None, 7, 7, 512) 0
=================================================================
Total params: 14,714,688
Trainable params: 14,714,688
Non-trainable params: 0
_________________________________________________________________

3 of 16 12/07/2021, 05:01 pm
05b_presidential_doggy_door about:srcdoc

Freezing the Base Model

Before we add our new layers onto the pre-trained model (https://developers.google.com/machine-learning
/glossary#pre-trained-model), we should take an important step: freezing the model's pre-trained layers. This
means that when we train, we will not update the base layers from the pre-trained model. Instead we will only
update the new layers that we add on the end for our new classification. We freeze the initial layers because
we want to retain the learning achieved from training on the ImageNet dataset. If they were unfrozen at this
stage, we would likely destroy this valuable information. There will be an option to unfreeze and train these
layers later, in a process called fine-tuning.

Freezing the base layers is as simple as setting trainable on the model to False .

In [3]: base_model.trainable = False

Adding New Layers

We can now add the new trainable layers to the pre-trained model. They will take the features from the pre-
trained layers and turn them into predictions on the new dataset. We will add two layers to the model. First will
be a pooling layer like we saw in our earlier convolutional neural network (https://developers.google.com
/machine-learning/glossary#convolutional_layer). (If you want a more thorough understanding of the role of
pooling layers in CNNs, please read this detailed blog post (https://machinelearningmastery.com/pooling-
layers-for-convolutional-neural-networks
/#:~:text=A%20pooling%20layer%20is%20a,Convolutional%20Layer)). We then need to add our final layer,
which will classify Bo or not Bo. This will be a densely connected layer with one output.

In [4]: inputs = keras.Input(shape=(224, 224, 3))

# Separately from setting trainable on the model, we set training to Fa
lse
x = base_model(inputs, training=False)
x = keras.layers.GlobalAveragePooling2D()(x)
# A Dense classifier with a single unit (binary classification)
outputs = keras.layers.Dense(1)(x)
model = keras.Model(inputs, outputs)

Let us take a look at the model, now that we have combined the pre-trained model with the new layers.

4 of 16 12/07/2021, 05:01 pm
05b_presidential_doggy_door about:srcdoc

In [5]: model.summary()

Model: "model"
_________________________________________________________________
Layer (type) Output Shape Param #
=================================================================
input_2 (InputLayer) [(None, 224, 224, 3)] 0
_________________________________________________________________
vgg16 (Model) (None, 7, 7, 512) 14714688
_________________________________________________________________
global_average_pooling2d (Gl (None, 512) 0
_________________________________________________________________
dense (Dense) (None, 1) 513
=================================================================
Total params: 14,715,201
Trainable params: 513
Non-trainable params: 14,714,688
_________________________________________________________________

Keras gives us a nice summary here, as it shows the vgg16 pre-trained model as one unit, rather than
showing all of the internal layers. It is also worth noting that we have many non-trainable parameters as we
have frozen the pre-trained model.

Compiling the Model

As with our previous exercises, we need to compile the model with loss and metrics options. We have to
make some different choices here. In previous cases we had many categories in our classification problem.
As a result, we picked categorical crossentropy for the calculation of our loss. In this case we only have a
binary classification problem (Bo or not Bo), and so we will use binary crossentropy
(https://www.tensorflow.org/api_docs/python/tf/keras/losses/BinaryCrossentropy). Further detail about the
differences between the two can found here (https://gombru.github.io/2018/05/23/cross_entropy_loss/). We
will also use binary accuracy instead of traditional accuracy.

By setting from_logits=True we inform the loss function (https://gombru.github.io/2018/05

/23/cross_entropy_loss/) that the output values are not normalized (e.g. with softmax).

In [6]: # Important to use binary crossentropy and binary accuracy as we now ha

ve a binary classification problem
model.compile(loss=keras.losses.BinaryCrossentropy(from_logits=True), m
etrics=[keras.metrics.BinaryAccuracy()])

Augmenting the Data

5 of 16 12/07/2021, 05:01 pm
05b_presidential_doggy_door about:srcdoc

Now that we are dealing with a very small dataset, it is especially important that we augment our data. As
before, we will make small modifications to the existing images, which will allow the model to see a wider
variety of images to learn from. This will help it learn to recognize new pictures of Bo instead of just
memorizing the pictures it trains on.

In [7]: from tensorflow.keras.preprocessing.image import ImageDataGenerator

# create a data generator
datagen = ImageDataGenerator(
samplewise_center=True, # set each sample mean to 0
rotation_range=10, # randomly rotate images in the range (degr
ees, 0 to 180)
zoom_range = 0.1, # Randomly zoom image
width_shift_range=0.1, # randomly shift images horizontally (f
raction of total width)
height_shift_range=0.1, # randomly shift images vertically (fr
action of total height)
horizontal_flip=True, # randomly flip images
vertical_flip=False) # we don't expect Bo to be upside-down so
we will not flip vertically

Loading the Data

We have seen datasets in a couple different formats so far. In the MNIST exercise, we were able to download
the dataset directly from within the Keras library. For the sign language dataset, the data was in CSV files. For
this exercise, we are going to load images directly from folders using Keras' flow_from_directory
(https://keras.io/api/preprocessing/image/) function. We have set up our directories to help this process go
smoothly as our labels are inferred from the folder names. In the data/presidential_doggy_door
directory, we have train and validation directories, which each have folders for images of Bo and not Bo. In
the not_bo directories, we have pictures of other dogs and cats, to teach our model to keep out other pets.
Feel free to explore the images to get a sense of our dataset.

Note that flow_from_directory (https://keras.io/api/preprocessing/image/) will also allow us to size our images
to match the model: 244x244 pixels with 3 channels.

6 of 16 12/07/2021, 05:01 pm
05b_presidential_doggy_door about:srcdoc

In [8]: # load and iterate training dataset

train_it = datagen.flow_from_directory('data/presidential_doggy_door/tr
ain/',
target_size=(224, 224),
color_mode='rgb',
class_mode='binary',
batch_size=8)
# load and iterate validation dataset
valid_it = datagen.flow_from_directory('data/presidential_doggy_door/va
lid/',
target_size=(224, 224),
color_mode='rgb',
class_mode='binary',
batch_size=8)

Found 139 images belonging to 2 classes.

Found 30 images belonging to 2 classes.

Training the Model

Time to train our model and see how it does. Recall that when using a data generator, we have to explicitly
set the number of steps_per_epoch :

7 of 16 12/07/2021, 05:01 pm
05b_presidential_doggy_door about:srcdoc

In [9]: model.fit(train_it, steps_per_epoch=12, validation_data=valid_it, valid

ation_steps=4, epochs=20)

8 of 16 12/07/2021, 05:01 pm
05b_presidential_doggy_door about:srcdoc

Epoch 1/20
12/12 [==============================] - 5s 441ms/step - loss: 1.4728
- binary_accuracy: 0.6979 - val_loss: 0.9218 - val_binary_accuracy:
0.8000
Epoch 2/20
12/12 [==============================] - 2s 187ms/step - loss: 0.4864
- binary_accuracy: 0.8242 - val_loss: 0.7172 - val_binary_accuracy:
0.8000
Epoch 3/20
12/12 [==============================] - 2s 144ms/step - loss: 0.4520
- binary_accuracy: 0.8681 - val_loss: 0.7320 - val_binary_accuracy:
0.8667
Epoch 4/20
12/12 [==============================] - 2s 155ms/step - loss: 0.3451
- binary_accuracy: 0.9062 - val_loss: 0.0649 - val_binary_accuracy:
1.0000
Epoch 5/20
12/12 [==============================] - 2s 138ms/step - loss: 0.1165
- binary_accuracy: 0.9121 - val_loss: 0.3346 - val_binary_accuracy:
0.9333
Epoch 6/20
12/12 [==============================] - 2s 139ms/step - loss: 0.0585
- binary_accuracy: 0.9670 - val_loss: 0.2352 - val_binary_accuracy:
0.9667
Epoch 7/20
12/12 [==============================] - 2s 155ms/step - loss: 0.0383
- binary_accuracy: 0.9896 - val_loss: 0.1609 - val_binary_accuracy:
0.9667
Epoch 8/20
12/12 [==============================] - 2s 141ms/step - loss: 0.0645
- binary_accuracy: 0.9780 - val_loss: 0.1469 - val_binary_accuracy:
0.9333
Epoch 9/20
12/12 [==============================] - 2s 131ms/step - loss: 0.0416
- binary_accuracy: 0.9780 - val_loss: 0.2644 - val_binary_accuracy:
0.9333
Epoch 10/20
12/12 [==============================] - 2s 149ms/step - loss: 0.0304
- binary_accuracy: 0.9896 - val_loss: 0.0225 - val_binary_accuracy:
1.0000
Epoch 11/20
12/12 [==============================] - 2s 151ms/step - loss: 0.0384
- binary_accuracy: 0.9780 - val_loss: 0.0927 - val_binary_accuracy:
0.9667
Epoch 12/20
12/12 [==============================] - 1s 124ms/step - loss: 0.0338
- binary_accuracy: 0.9780 - val_loss: 0.0147 - val_binary_accuracy:
1.0000
Epoch 13/20
12/12 [==============================] - 2s 140ms/step - loss: 0.0035
- binary_accuracy: 1.0000 - val_loss: 0.1087 - val_binary_accuracy:
0.9667
Epoch 14/20
12/12 [==============================] - 2s 151ms/step - loss: 0.0029
- binary_accuracy: 1.0000 - val_loss: 0.1541 - val_binary_accuracy:
0.9667

9 of 16 12/07/2021, 05:01 pm
05b_presidential_doggy_door about:srcdoc

Epoch 15/20
12/12 [==============================] - 2s 138ms/step - loss: 0.0040
- binary_accuracy: 1.0000 - val_loss: 0.0363 - val_binary_accuracy:
1.0000
Epoch 16/20
12/12 [==============================] - 2s 151ms/step - loss: 7.8249
e-04 - binary_accuracy: 1.0000 - val_loss: 0.0541 - val_binary_accura
cy: 0.9667
Epoch 17/20
12/12 [==============================] - 2s 132ms/step - loss: 0.0027
- binary_accuracy: 1.0000 - val_loss: 0.0649 - val_binary_accuracy:
0.9667
Epoch 18/20
12/12 [==============================] - 2s 142ms/step - loss: 0.0051
- binary_accuracy: 1.0000 - val_loss: 0.0456 - val_binary_accuracy:
0.9667
Epoch 19/20
12/12 [==============================] - 2s 150ms/step - loss: 4.5949
e-04 - binary_accuracy: 1.0000 - val_loss: 0.1243 - val_binary_accura
cy: 0.9667
Epoch 20/20
12/12 [==============================] - 2s 143ms/step - loss: 0.0042
- binary_accuracy: 1.0000 - val_loss: 0.0217 - val_binary_accuracy:
1.0000
Out[9]: <tensorflow.python.keras.callbacks.History at 0x7f25782c05c0>

Discussion of Results

Both the training and validation accuracy should be quite high. This is a pretty awesome result! We were able
to train on a small dataset, but because of the knowledge transferred from the ImageNet model, it was able to
achieve high accuracy and generalize well. This means it has a very good sense of Bo and pets who are not
Bo.

If you saw some fluctuation in the validation accuracy, that is okay too. We have a technique for improving our
model in the next section.

Fine-Tuning the Model

10 of 16 12/07/2021, 05:01 pm
05b_presidential_doggy_door about:srcdoc

Now that the new layers of the model are trained, we have the option to apply a final trick to improve the
model, called fine-tuning (https://developers.google.com/machine-learning/glossary#f). To do this we unfreeze
the entire model, and train it again with a very small learning rate (https://developers.google.com/machine-
learning/glossary#learning-rate). This will cause the base pre-trained layers to take very small steps and
adjust slightly, improving the model by a small amount.

Note that it is important to only do this step after the model with frozen layers has been fully trained. The
untrained pooling and classification layers that we added to the model earlier were randomly initialized. This
means they needed to be updated quite a lot to correctly classify the images. Through the process of
backpropagation (https://developers.google.com/machine-learning/glossary#backpropagation), large initial
updates in the last layers would have caused potentially large updates in the pre-trained layers as well. These
updates would have destroyed those important pre-trained features. However, now that those final layers are
trained and have converged, any updates to the model as a whole will be much smaller (especially with a very
small learning rate) and will not destroy the features of the earlier layers.

Let's try unfreezing the pre-trained layers, and then fine tuning the model:

In [10]: # Unfreeze the base model

base_model.trainable = True

# It's important to recompile your model after you make any changes
# to the `trainable` attribute of any inner layer, so that your changes
# are taken into account
model.compile(optimizer=keras.optimizers.RMSprop(learning_rate = .0000
1), # Very low learning rate
loss=keras.losses.BinaryCrossentropy(from_logits=True),
metrics=[keras.metrics.BinaryAccuracy()])

11 of 16 12/07/2021, 05:01 pm
05b_presidential_doggy_door about:srcdoc

In [11]: model.fit(train_it, steps_per_epoch=12, validation_data=valid_it, valid

ation_steps=4, epochs=10)

Epoch 1/10
12/12 [==============================] - 8s 682ms/step - loss: 0.0767
- binary_accuracy: 0.9890 - val_loss: 0.0057 - val_binary_accuracy:
1.0000
Epoch 2/10
12/12 [==============================] - 2s 176ms/step - loss: 1.0105
e-04 - binary_accuracy: 1.0000 - val_loss: 0.0217 - val_binary_accura
cy: 1.0000
Epoch 3/10
12/12 [==============================] - 2s 184ms/step - loss: 3.8384
e-05 - binary_accuracy: 1.0000 - val_loss: 0.0427 - val_binary_accura
cy: 0.9667
Epoch 4/10
12/12 [==============================] - 2s 180ms/step - loss: 0.0064
- binary_accuracy: 1.0000 - val_loss: 0.1059 - val_binary_accuracy:
0.9333
Epoch 5/10
12/12 [==============================] - 2s 171ms/step - loss: 3.4135
e-04 - binary_accuracy: 1.0000 - val_loss: 0.0040 - val_binary_accura
cy: 1.0000
Epoch 6/10
12/12 [==============================] - 2s 189ms/step - loss: 1.5092
e-05 - binary_accuracy: 1.0000 - val_loss: 0.0222 - val_binary_accura
cy: 1.0000
Epoch 7/10
12/12 [==============================] - 2s 178ms/step - loss: 1.0870
e-04 - binary_accuracy: 1.0000 - val_loss: 0.0164 - val_binary_accura
cy: 1.0000
Epoch 8/10
12/12 [==============================] - 2s 171ms/step - loss: 9.2534
e-06 - binary_accuracy: 1.0000 - val_loss: 0.0019 - val_binary_accura
cy: 1.0000
Epoch 9/10
12/12 [==============================] - 2s 168ms/step - loss: 5.3043
e-06 - binary_accuracy: 1.0000 - val_loss: 0.0011 - val_binary_accura
cy: 1.0000
Epoch 10/10
12/12 [==============================] - 2s 185ms/step - loss: 1.3720
e-05 - binary_accuracy: 1.0000 - val_loss: 0.0048 - val_binary_accura
cy: 1.0000

Out[11]: <tensorflow.python.keras.callbacks.History at 0x7f2578182b00>

Examining the Predictions

Now that we have a well-trained model, it is time to create our doggy door for Bo! We can start by looking at
the predictions that come from the model. We will preprocess the image in the same way we did for our last
doggy door.

12 of 16 12/07/2021, 05:01 pm
05b_presidential_doggy_door about:srcdoc

In [12]: import matplotlib.pyplot as plt

import matplotlib.image as mpimg
from tensorflow.keras.preprocessing import image as image_utils
from tensorflow.keras.applications.imagenet_utils import preprocess_inp
ut

def show_image(image_path):
image = mpimg.imread(image_path)
plt.imshow(image)

def make_predictions(image_path):
show_image(image_path)
image = image_utils.load_img(image_path, target_size=(224, 224))
image = image_utils.img_to_array(image)
image = image.reshape(1,224,224,3)
image = preprocess_input(image)
preds = model.predict(image)
return preds

Try this out on a couple images to see the predictions:

In [13]: make_predictions('data/presidential_doggy_door/valid/bo/bo_20.jpg')

Out[13]: array([[-22.069227]], dtype=float32)

13 of 16 12/07/2021, 05:01 pm
05b_presidential_doggy_door about:srcdoc

In [14]: make_predictions('data/presidential_doggy_door/valid/not_bo/121.jpg')

Out[14]: array([[26.034075]], dtype=float32)

It looks like a negative number prediction means that it is Bo and a positive number prediction means it is
something else. We can use this information to have our doggy door only let Bo in!

Exercise: Bo's Doggy Door

Fill in the following code to implement Bo's doggy door:

In [15]: def presidential_doggy_door(image_path):

preds = make_predictions(image_path)
if FIXME:
print("It's Bo! Let him in!")
else:
print("That's not Bo! Stay out!")

Solution

Click on the '...' below to see the solution.

In [16]: def presidential_doggy_door(image_path):

preds = make_predictions(image_path)
if preds[0] < 0:
print("It's Bo! Let him in!")
else:
print("That's not Bo! Stay out!")

14 of 16 12/07/2021, 05:01 pm
05b_presidential_doggy_door about:srcdoc

Let's try it out!

In [17]: presidential_doggy_door('data/presidential_doggy_door/valid/not_bo/131.
jpg')

That's not Bo! Stay out!

In [18]: presidential_doggy_door('data/presidential_doggy_door/valid/bo/bo_29.jp
g')

It's Bo! Let him in!

Summary

15 of 16 12/07/2021, 05:01 pm
05b_presidential_doggy_door about:srcdoc

Great work! With transfer learning, you have built a highly accurate model using a very small dataset. This
can be an extremely powerful technique, and be the difference between a successful project and one that
cannot get off the ground. We hope these techniques can help you out in similar situations in the future!

There is a wealth of helpful resources for transfer learning in the NVIDIA Transfer Learning Toolkit
(https://developer.nvidia.com/tlt-getting-started).

Clear the Memory

Before moving on, please execute the following cell to clear up the GPU memory.

In [19]: import IPython

app = IPython.Application.instance()
app.kernel.do_shutdown(True)

Out[19]: {'status': 'ok', 'restart': True}

So far, the focus of this workshop has primarily been on image classification. In the next section, in service of
giving you a more well-rounded introduction to deep learning, we are going to switch gears and address
working with sequential data, which requires a different approach.

16 of 16 12/07/2021, 05:01 pm

Altered States of Consciousnes - Charles Tart
100% (37)
Altered States of Consciousnes - Charles Tart
562 pages
Computer Vision With Keras
No ratings yet
Computer Vision With Keras
67 pages
Deep Learning TensorFlow and Keras
No ratings yet
Deep Learning TensorFlow and Keras
454 pages
C & C++ Interview Questions You'll Most Likely Be Asked
From Everand
C & C++ Interview Questions You'll Most Likely Be Asked
Vibrant Publishers
No ratings yet
ENGLISH 16: Overview Factors Affecting Interest in Literature
91% (11)
ENGLISH 16: Overview Factors Affecting Interest in Literature
3 pages
Mcdonald Motivation at Work PDF
100% (1)
Mcdonald Motivation at Work PDF
53 pages
Lesson 1 PDF
60% (5)
Lesson 1 PDF
6 pages
Pre-Trained Models: Objectives
No ratings yet
Pre-Trained Models: Objectives
12 pages
Cats and Dogs Classification
No ratings yet
Cats and Dogs Classification
12 pages
Session15 TransferLearning
No ratings yet
Session15 TransferLearning
13 pages
06 Pytorch Transfer Learning
No ratings yet
06 Pytorch Transfer Learning
18 pages
DL_EXP-6_16010422230
No ratings yet
DL_EXP-6_16010422230
8 pages
NB4-10 PT V Transfer Learning
No ratings yet
NB4-10 PT V Transfer Learning
16 pages
DL7 2
No ratings yet
DL7 2
11 pages
Program 5n6 Dl
No ratings yet
Program 5n6 Dl
9 pages
Transfer Learning and Fine-Tuning
No ratings yet
Transfer Learning and Fine-Tuning
32 pages
Slides CNN
No ratings yet
Slides CNN
17 pages
ch4_CNN
No ratings yet
ch4_CNN
35 pages
06 Transfer Learning With Tensorflow Part 3 Scaling Up
No ratings yet
06 Transfer Learning With Tensorflow Part 3 Scaling Up
29 pages
04 Transfer Learning With Tensorflow Part 1 Feature Extraction
No ratings yet
04 Transfer Learning With Tensorflow Part 1 Feature Extraction
18 pages
BEH41803. Using Google Colab To Train Image From Folder. Dog Vs Cat Step 1: Connect To Google Drive
No ratings yet
BEH41803. Using Google Colab To Train Image From Folder. Dog Vs Cat Step 1: Connect To Google Drive
7 pages
PROGRAM 5n6 Dl_final
No ratings yet
PROGRAM 5n6 Dl_final
9 pages
UNIT III
No ratings yet
UNIT III
26 pages
Transfer Learnring
No ratings yet
Transfer Learnring
5 pages
Using Pre-Trained Models
No ratings yet
Using Pre-Trained Models
16 pages
7 CNNWithCustomImage
No ratings yet
7 CNNWithCustomImage
11 pages
Lecture 3 - Pretrained CNN - CET
No ratings yet
Lecture 3 - Pretrained CNN - CET
33 pages
Make 04 00002 v2
No ratings yet
Make 04 00002 v2
20 pages
AAI Module 4
No ratings yet
AAI Module 4
13 pages
Deep Learning Lab Manual
100% (10)
Deep Learning Lab Manual
30 pages
Transfer Learning
No ratings yet
Transfer Learning
24 pages
Lab Report 08: Convolutional Networks For Images With Keras: Sukkur Institute of Business Administration University
No ratings yet
Lab Report 08: Convolutional Networks For Images With Keras: Sukkur Institute of Business Administration University
19 pages
02-DL-Deep Learning For Image Data (Convnets) 03
No ratings yet
02-DL-Deep Learning For Image Data (Convnets) 03
10 pages
05 Transfer Learning With Tensorflow Part 2 Fine Tuning
No ratings yet
05 Transfer Learning With Tensorflow Part 2 Fine Tuning
24 pages
Lecture 6 Deep Learning Training and Testing 2025
No ratings yet
Lecture 6 Deep Learning Training and Testing 2025
36 pages
Deep Learning With PyTorch
No ratings yet
Deep Learning With PyTorch
19 pages
Exercise 2 Building Convolution Neural Network
No ratings yet
Exercise 2 Building Convolution Neural Network
15 pages
Code
No ratings yet
Code
10 pages
Cad and Dog 2
No ratings yet
Cad and Dog 2
5 pages
Unit 4
No ratings yet
Unit 4
50 pages
[Fall 2024] Deep Learning 3
No ratings yet
[Fall 2024] Deep Learning 3
54 pages
Day 8
No ratings yet
Day 8
20 pages
Lecture 2 - CNN and Overfitting
No ratings yet
Lecture 2 - CNN and Overfitting
42 pages
Data Aug Trans
No ratings yet
Data Aug Trans
4 pages
Experiment 3
No ratings yet
Experiment 3
5 pages
Cognitive API Using Neural Network
No ratings yet
Cognitive API Using Neural Network
30 pages
TLM for CNN
No ratings yet
TLM for CNN
32 pages
MML Cours9 Convolutional Neural Networks
No ratings yet
MML Cours9 Convolutional Neural Networks
61 pages
Building A Convolutional Neural Network Using Tensorflow Keras
No ratings yet
Building A Convolutional Neural Network Using Tensorflow Keras
10 pages
AMLlab 06
No ratings yet
AMLlab 06
3 pages
Final Code
No ratings yet
Final Code
16 pages
Bay Learn 2015 Deep Mind
No ratings yet
Bay Learn 2015 Deep Mind
69 pages
Create Simple Deep Learning Neural Network For Classification
No ratings yet
Create Simple Deep Learning Neural Network For Classification
11 pages
cat-dog-classification-report
No ratings yet
cat-dog-classification-report
11 pages
NNDL Lab Record
No ratings yet
NNDL Lab Record
26 pages
dlweek7
No ratings yet
dlweek7
9 pages
Zhuang 2017
No ratings yet
Zhuang 2017
12 pages
Big Data Machine Learning Lab 4
No ratings yet
Big Data Machine Learning Lab 4
7 pages
(IJCST-V11I2P11) :dr. Girish Tere, Mr. Kuldeep Kandwal
No ratings yet
(IJCST-V11I2P11) :dr. Girish Tere, Mr. Kuldeep Kandwal
7 pages
Cad and Dog
No ratings yet
Cad and Dog
5 pages
DL Lab Manual Full_pagenumber - converted
No ratings yet
DL Lab Manual Full_pagenumber - converted
6 pages
Classic Cnn
No ratings yet
Classic Cnn
39 pages
Software Design Simplified
From Everand
Software Design Simplified
Liviu Catalin Dorobantu
No ratings yet
Hacking for Beginners: Comprehensive Guide on Hacking Websites, Smartphones, Wireless Networks, Conducting Social Engineering, Performing a Penetration Test, and Securing Your Network (2022)
From Everand
Hacking for Beginners: Comprehensive Guide on Hacking Websites, Smartphones, Wireless Networks, Conducting Social Engineering, Performing a Penetration Test, and Securing Your Network (2022)
Ross Menzie
No ratings yet
Assessment: The Dataset
No ratings yet
Assessment: The Dataset
5 pages
Deploying Your Model: Objectives
No ratings yet
Deploying Your Model: Objectives
9 pages
Image Classification of An American Sign Language Dataset: Objectives
No ratings yet
Image Classification of An American Sign Language Dataset: Objectives
11 pages
Image Classification With The MNIST Dataset: Objectives
No ratings yet
Image Classification With The MNIST Dataset: Objectives
21 pages
Data Augmentation: Objectives
No ratings yet
Data Augmentation: Objectives
10 pages
Convolutional Neural Networks: Objectives
No ratings yet
Convolutional Neural Networks: Objectives
10 pages
Sequence Data: Objectives
No ratings yet
Sequence Data: Objectives
15 pages
Fundamentals of Deep Learning: Part 5: Pre-Trained Models
No ratings yet
Fundamentals of Deep Learning: Part 5: Pre-Trained Models
18 pages
Jupyterlab: Clearing Gpu Memory
No ratings yet
Jupyterlab: Clearing Gpu Memory
2 pages
Fundamentals of Deep Learning: Part 6: Advanced Architectures
No ratings yet
Fundamentals of Deep Learning: Part 6: Advanced Architectures
35 pages
Symptomatically Brain Tumor Detection Using Convolutional Neural Networks
No ratings yet
Symptomatically Brain Tumor Detection Using Convolutional Neural Networks
10 pages
Fundamentals of Deep Learning: Part 2: How A Neural Network Trains
No ratings yet
Fundamentals of Deep Learning: Part 2: How A Neural Network Trains
54 pages
Brain Tumour Segmentation Using Convolutional Neural Network With Tensor Flow
No ratings yet
Brain Tumour Segmentation Using Convolutional Neural Network With Tensor Flow
7 pages
Unit - 3 8255: (Programmable Peripheral Interface)
No ratings yet
Unit - 3 8255: (Programmable Peripheral Interface)
7 pages
Advanced Mechatronic Systems: 2014 International Conference On
No ratings yet
Advanced Mechatronic Systems: 2014 International Conference On
6 pages
An Overview of Microprocessor
No ratings yet
An Overview of Microprocessor
16 pages
Iot Based Solar Energy Monitoring System: Suprita M. Patil Vijayalashmi M
No ratings yet
Iot Based Solar Energy Monitoring System: Suprita M. Patil Vijayalashmi M
6 pages
Anand
No ratings yet
Anand
3 pages
SUMMARY, PARAPHRASING AND NOTE MAKING Notes
No ratings yet
SUMMARY, PARAPHRASING AND NOTE MAKING Notes
2 pages
Weekly Home Learning Plan Grade 10 ARTS W 1 W8 3rd
No ratings yet
Weekly Home Learning Plan Grade 10 ARTS W 1 W8 3rd
3 pages
Parents Attitudes Towards Inclusive Education and Their Perceptions of Inclusive Teaching Practices and Resources PDF
No ratings yet
Parents Attitudes Towards Inclusive Education and Their Perceptions of Inclusive Teaching Practices and Resources PDF
20 pages
6S191 MIT DeepLearning L3
100% (1)
6S191 MIT DeepLearning L3
60 pages
Career Ebook Checklist
No ratings yet
Career Ebook Checklist
1 page
COT4 Signs 4th Q
100% (1)
COT4 Signs 4th Q
6 pages
University of Engineering & Technology, Peshawar - Jalozai Campus
No ratings yet
University of Engineering & Technology, Peshawar - Jalozai Campus
2 pages
Effective Office Management
No ratings yet
Effective Office Management
28 pages
Soccer Unit Grade 8b
No ratings yet
Soccer Unit Grade 8b
8 pages
Guía 003 - ONCE - INGLÉS-2PERIODO - SANDRA PAREDES
No ratings yet
Guía 003 - ONCE - INGLÉS-2PERIODO - SANDRA PAREDES
3 pages
11 Habits of An Effective Teacher: Carrie Lam
No ratings yet
11 Habits of An Effective Teacher: Carrie Lam
5 pages
Fundamental of Management Chap 16
No ratings yet
Fundamental of Management Chap 16
50 pages
ESE 301 Syllabus Spring 2019
No ratings yet
ESE 301 Syllabus Spring 2019
4 pages
Literature Review: Students and Anxiety Piña 1
No ratings yet
Literature Review: Students and Anxiety Piña 1
6 pages
Profed 6 True or False
No ratings yet
Profed 6 True or False
4 pages
From 2 Score 2
No ratings yet
From 2 Score 2
7 pages
Apoorva Jain: Bcom Hons, Mba (Pursuing)
No ratings yet
Apoorva Jain: Bcom Hons, Mba (Pursuing)
1 page
Syllabus English I VIRTUAL (7)
No ratings yet
Syllabus English I VIRTUAL (7)
3 pages
Ethics - (Handout 1 For Finals)
No ratings yet
Ethics - (Handout 1 For Finals)
8 pages
Human-Level Play in The Game of Diplomacy
No ratings yet
Human-Level Play in The Game of Diplomacy
91 pages
Vision: Performance Indicators (Pi) Peo # 1
No ratings yet
Vision: Performance Indicators (Pi) Peo # 1
5 pages
Towards A Social Theory of Rhythm - Nelson PDF
No ratings yet
Towards A Social Theory of Rhythm - Nelson PDF
10 pages
Synergistic Teams
No ratings yet
Synergistic Teams
2 pages
Creativity and Innovations
100% (1)
Creativity and Innovations
24 pages
Course Introduction NCM 119 Rle
No ratings yet
Course Introduction NCM 119 Rle
7 pages
Sample TOEFL Independent Essays
No ratings yet
Sample TOEFL Independent Essays
9 pages

Transfer Learning: Objectives

Uploaded by

Transfer Learning: Objectives

Uploaded by

05b_presidential_doggy_door about:srcdoc

Prepare a pretrained model for transfer learning

A Personalized Doggy Door

Downloading the Pretrained Model

The ImageNet pre-trained models (https://keras.io/api/applications/vgg/#vgg16-function) are often good

In [1]: from tensorflow import keras

Downloading data from https://storage.googleapis.com/tensorflow/keras

Freezing the Base Model

In [3]: base_model.trainable = False

Adding New Layers

In [4]: inputs = keras.Input(shape=(224, 224, 3))

Compiling the Model

By setting from_logits=True we inform the loss function (https://gombru.github.io/2018/05

In [6]: # Important to use binary crossentropy and binary accuracy as we now ha

Augmenting the Data

In [7]: from tensorflow.keras.preprocessing.image import ImageDataGenerator

Loading the Data

In [8]: # load and iterate training dataset

Found 139 images belonging to 2 classes.

Training the Model

In [9]: model.fit(train_it, steps_per_epoch=12, validation_data=valid_it, valid

Fine-Tuning the Model

In [10]: # Unfreeze the base model

In [11]: model.fit(train_it, steps_per_epoch=12, validation_data=valid_it, valid

Out[11]: <tensorflow.python.keras.callbacks.History at 0x7f2578182b00>

Examining the Predictions

In [12]: import matplotlib.pyplot as plt

Try this out on a couple images to see the predictions:

Out[13]: array([[-22.069227]], dtype=float32)

Out[14]: array([[26.034075]], dtype=float32)

Exercise: Bo's Doggy Door

Fill in the following code to implement Bo's doggy door:

In [15]: def presidential_doggy_door(image_path):

Click on the '...' below to see the solution.

In [16]: def presidential_doggy_door(image_path):

Let's try it out!

That's not Bo! Stay out!

It's Bo! Let him in!

Clear the Memory

In [19]: import IPython

Out[19]: {'status': 'ok', 'restart': True}

You might also like