0% found this document useful (0 votes)

396 views29 pages

Brief Introduction of Mobilenetv1 V2 V3 Lightweight Network

MobileNetV1 uses depthwise separable convolutions to build a lightweight network that reduces computation costs by 8-9 times compared to standard convolutions, with only a small reduction in accuracy. MobileNetV2 improves on this with bottleneck units that use linear bottlenecks and inverted residuals. It introduces a bottleneck structure with expanded intermediate channels to extract features before dimension reduction, improving accuracy while maintaining efficiency. MobileNetV3 further develops this approach with automated architecture search and manual tuning to create an even more lightweight network.

Uploaded by

adalberto soplatetas

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

396 views29 pages

Brief Introduction of Mobilenetv1 V2 V3 Lightweight Network

Uploaded by

adalberto soplatetas

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 29

Brief introduction of mobilenetv1 / V2 / V3 lightweight

network
developpaper.com/brief-introduction-of-mobilenetv1-v2-v3-lightweight-network

July 31, 2020

Mobilenet series is a very important lightweight network family. It is from Google.

Mobilenetv1 uses deep separable convolution to build lightweight network. Mobilenetv2
proposes innovative transformed residual with linear Although there are more layers in the
bottleneck unit, the overall network accuracy and speed have been improved. Mobilenetv3
uses automl technology and manual fine tuning to build a more lightweight network

Source: Xiaofei’s algorithm Engineering Notes official account

MobileNetV1

Paper: mobilenets: efficient volatile neural networks for mobile vision

applications

Thesis address: http://arxiv.org/pdf/1704.04861.pdf

Introduction

1/29
Mobilenet constructs a very lightweight and low delay model based on deep separable
convolution, and can further control the size of the model through two super parameters.
This model can be applied to terminal equipment, which has very important practical
significance.

Depthwise Separable Convolution

2/29
Suppose that the input and output of the standard convolution are $d_ F\times D_
Characteristic graphs of F / times M $, mathbb {f} $and $d_ F \times D_ The size of the
convolution kernel is $d_ K\times D_ Then the output characteristic graph is calculated
as follows:

3/29
The calculation amount is as follows:

The calculation amount and input dimension $M $, output dimension $n $, convolution

kernel size $d_ K $and feature map size $d_ F $.

Mobilenet optimizes the amount of computation through the depth separable convolution
optimization. It transforms the standard convolution into deep convolution and $1 /
times 1 $pointwise convolution. BN and relu are connected behind each layer. Each input
dimension of deep convolution corresponds to a convolution kernel. For the same input,
the output characteristic graph of deep convolution is calculated as follows:

4/29
The $\ hat {mathbb {K}} is the size of $d_ K\times D_ The deep convolution kernel of K /
times M $, $m of $/ hat {mathbb {K}} $_ The convolution kernel of {th} $corresponds to
$m of input $/ mathbb {f} $_ Characteristic graph of {th} $and $m of output $\ hat
{mathbb {g}} $_ The amount of computation of the depth convolution is as follows

Although depth convolution is more efficient, it does not fuse multiple input dimensions
well. Therefore, additional layers are needed to linearly combine the outputs. Here, a new
feature graph is generated by using $1 / times 1 $pointwise convolution, which is depth
separable convolution. The computational complexity is as follows:

The scaling ratio of the depth separable convolution and the standard convolution is as
follows

Mobilenet uses a depth of $3 / times 3 $to decouple convolution, so the amount of

computation will be reduced by 8-9 times, and the accuracy rate will be slightly reduced.

Network Structure and Training

5/29
The structure of mobilenet is shown in Table 1. Except for the first layer, other layers are
depth separable convolution. Except for the last full connection layer, each layer is
connected with BN and relu, with a total of 28 layers.

6/29
It is mentioned in the paper that the efficiency of the network can not be directly
represented by the amount of calculation, but also depends on the specific
implementation method of the operation. As shown in Table 2, most of the computation
and parameters of mobilenet are on Pointwise convolution. There are efficient
implementation methods for both CPU and GPU devices. As for the training settings, the
paper also has a more detailed reference, interested can look at the original text.

Width Multiplier: Thinner Models

Although mobilenet is already very lightweight, we can use the width scaling factor $/
alpha $to further reduce the weight. The input and output dimensions of each layer are
changed to $/ alpha M $and $/ alpha n $. The calculation amount after scaling is changed
to:

In (0,1] $, the calculation amount of the width scaling factor is about $/ alpha ^ 2 $,
which enables users to trade off the accuracy and speed according to the task.

Resolution Multiplier: Reduced Representation

Mobilenet can also scale the size of the model through the resolution scaling factor $/ Rho
$. Combined with the width scaling factor $/ alpha $, the calculation amount after scaling
is as follows:

In (0,1] $, the computational complexity of the resolution scaling factor is about $\ Rho ^
2 $.

7/29
The effects of depth separable convolution, width scaling factor and depth scaling factor
are also compared.

Experiments
The experiment of mobilenet is very detailed, and the performance comparison is
conducted on various tasks. Here, only part of the results are listed. The specific can be
seen in the original text, but it is a pity that there is no time-consuming comparison result
of reasoning.

Compare the full convolution version and depth separable convolution version of
mobilenet.

8/29
Compare the effect of width scaling and direct removal of the last five layers of $14 / times
14 / times 512 $deep deconvolution.

Compare the effect of different width scaling factors.

Compare the effect of different resolution scaling.

CONCLUSION

Mobilenet uses deep separable convolution to construct a lightweight network, which can
reduce the amount of parameters and calculation by about 8 times without a significant
decrease in accuracy, which is of great practical significance.

MobileNetV2

9/29
Paper: mobilenetv2: inverted residuals and linear bottlenecks

Thesis address: http://arxiv.org/pdf/1801.04381.pdf

Introduction

Mobilenetv2 proposes a new layer unit called transformed residual with linear bottleneck.
This structure is similar to the residual network unit and contains shorcut. The difference
is that the structure has less input and output dimensions. In the middle, linear
convolution is used to expand the dimension, then deep convolution is used to extract
features, and finally the dimension is reduced by mapping. The network performance can
be well maintained and the network is lighter.

Linear Bottlenecks

The key information in the high-dimensional features of neural networks is distributed in

a decentralized manner, which can be represented by compact low-dimensional features.
Therefore, in theory, the dimension of operation space can be reduced by reducing the
dimension of layer output. However, when there is nonlinear activation in the layer, the
above theory may be broken, so the nonlinear operation of low dimensional features is
removed

According to the properties of relu, if the output is non-zero, it is equivalent to a

linear change of the input space. It can be considered that part of the input space
has a linear change, and the network only processes these non-zero outputs. Since
the key information of the feature is usually non-zero after relu, it can be considered
that the key information (low dimensional feature) of relu is linear operation.

10/29
In this paper, the two-dimensional input is linearly increased to $d $by the matrix
$t $, and then nonlinearly activated by relu. Finally, the matrix $T ^ {- 1} $is used
to restore the two-dimensional input. From the visualization results, the lower the
dimension is, the more information is lost by relu. This shows that if the input
features of nonlinear operations can be compressed into lower dimensional features,
the complexity of input features should be large enough to keep the complete
information of nonlinear operations.

11/29
Assuming that the key information output from the layer can be represented by low-
dimensional features, linear bootleneck can be used for extraction. The structure is shown
in Fig. 2C. The dimension is reduced by the point wise convolution after the deep
convolution, but the nonlinear activation is not used after the dimension reduction, and
only the high-dimensional features are activated nonlinearly. Figure 2D is the first part of
structure C. the two together form a complete mobile netv2 bottleneck. First, dimension is
increased by pointwise convolution, then features are extracted by deep convolution, and
finally dimensionality is reduced by pointwise convolution. The proportion of dimension
increase is called expansion ratio.

Inverted residuals

12/29
Mobile netv2’s residual block is similar to RESNET’s residual block. The focus is to better
return gradient and feature reuse. The difference is that mobilenetv2 connects with
bottleneck features, that is, features with smaller dimensions. As described above, the
lower dimensional features contain all the necessary information, while the expansion
layer is only a means to realize nonlinear changes.

13/29
The operation and input and output of the residual block are shown in Table 1. Although
there is one more pointwise convolution compared with mobilenetv1, this structure allows
less input and output dimensions. As can be seen from the comparison in Table 3,
mobilenetv2 uses less memory. However, the setting of expansion ratio can allow many
structural changes. If it is set to 0, it means identity mapping; if it is greater than 1, it will
be RESNET’s residual block.

Model Architecture

14/29
Mobilenetv2 unit includes two types: stripe = 1 and stripe = 2.

15/29
The overall structure of mobilenetv2 is shown in Table 2. It is constructed by stacking the
structure of figure 4D. The first layer uses the ordinary convolution layer. In addition, the
width scaling factor and resolution scaling factor can be used to trade off the accuracy and
delay.

Experiments

16/29
17/29
This paper compares the performance of mobile netv2 and other networks in image
classification.

18/29
This paper compares the performance of mobile netv2 in target detection with other
networks.

19/29
This paper compares the performance of mobile netv2 in semantic segmentation with
other networks.

20/29
In addition, the paper verifies the improvement of transformed residual with linear bottle
neck.

Conclusions

Mobilenetv2 is based on converted residual with linear bottleneck for lightweight network
construction. The overall structure is quite innovative, including inverted residuals and
expansion layer. The analysis of linear bottlenecks is also very enlightening. Up to now,
many terminal algorithms still use mobilenetv2 as the backbone network.

MobileNetV3

Paper: searching for mobilenetv3

Thesis address: http://arxiv.org/pdf/1905.02244.pdf

21/29
Introduction

Mobile netv3 is built based on automl and optimized by manual fine tuning. Platform
aware NAS and netadapt are used for global search and local search respectively. Manual
tuning adjusts the structure of the front and rear layers of the network, adds se module to
bottleneck, and proposes computationally efficient h-swim nonlinear activation.

Network Search

MobileNetV3 first use MnasNet platform-aware NAS to search the structure of each
block, then search results in accordance with the default network structure, and interested
can go to see the official account. Platform aware NAS mainly uses the weighted $ACC
(m) times [lat (m) / tar] ^ w $of accuracy and actual delay as the optimization index to
approach Pareto optimization (accuracy and delay can not be increased at the same time).
In practice, it is found that for small models, the increase of delay will lead to a sharp
increase in precision, so it is necessary to increase $W = – 0.07 $to $W = – 0.15 $, and
increase the penalty for the increase of delay.

after completing the preliminary network search, the paper uses netadapt to adjust
layer by layer. As a supplement to the search method of mnasnet, the specific steps of
netadapt are as follows:

1. The seed network based on mnasnet search method is used as the beginning.
2. A new proposal set is generated. Each proposal represents a modification to the seed
network, which must bring a delay reduction of $/ delta = 0.01 $times.
3. For each proposal, the training model of the previous step is used to initialize the
parameters, and the missing parameters are randomly initialized. Then finetune $t
= 10000 $round is used to get the approximate accuracy rate.
4. Select the best proposal according to the index.
5. Iteration step 234 until the target delay is satisfied.

The original netadapt uses the delay as the index of step 4, and the paper is modified to
the ratio of accuracy and delay $\ frac {delta ACC} {mid / delta latency / mid} $, which
can achieve a good trade-off. The proposal of step 2 still needs to meet step 2. In addition
to the modified convolution kernel operation of netadapt, the proposal of step 2 includes
the following two types:

Reduce the size of any expansion layer

Reduce the size of all bottlenecks of the same size

Redesigning Expensive Layers

After getting the search results, the paper finds that the overhead of the front and back
layers of the network is relatively high, so specific modifications are made to these layers.

22/29
the transformation of the last few layers is shown in Fig. 5. AVG pool is pre installed, so
that the subsequent operations to high dimension can be carried out on the $1 / times 1
$feature map instead of the $7 / times 7 $feature map, saving a lot of time. Since the AVG
pool front operation has saved a lot of computation, there is no need for the dconv +
pointwise conv operation of the previous bottleneck (this operation can generate 320
dimensional features from 160 dimension to 1280 dimension) to reduce the calculation
amount, and directly remove it to further save the calculation amount. This improvement
can bring about a speed increase of 7 milliseconds (11%).

for the first few layers, the general network uses 32 dimensional convolution of $3 /
times 3 $. This paper considers that there is redundancy in these convolutions. Through
experiments, the dimension reduction of 16 dimensions does not affect the accuracy rate,
which brings about a 2-millisecond speed increase. The hwish non-linear activation
proposed in the paper is used for nonlinear activation, and the effect is not different from
other functions.

Nonlinearities

Swish, as a substitute for relu, can significantly improve the accuracy. Swish is defined as
follows:

Due to the sigmoid function included in swish, it is not well optimized on mobile devices.
Therefore, sigmoid is replaced by piecewise linear simulation $/ frac {relu6 (x + 3)} {6} $

23/29
From the visualization results in Figure 6, the curves of swish and h-swish are very close.
The deeper the network is, the less time-consuming the nonlinear operation will be (the
size of the feature map will be reduced by half). Therefore, h-swim is only used in the
second half of the network.

Large squeeze-and-excite

24/29
Mobile netv3’s bottleneck adds se module to V2, where se ratio is fixed to 0.25. It is
mentioned in the paper that the implementation here is different from mansnet, which is
fixed to 1 / 4 of the expansion layer. However, it seems to me that there is no difference.
Please inform me if you know it.

MobileNetV3 Definitions

25/29
Mobilenetv3 is divided into two versions: mobilenetv3 large and mobilenetv3 small.

Experiments

The experiment of this paper is very full, only the main experimental results of some tasks
are pasted here, others can view the original text.

26/29
27/29
This paper compares the performance of mobilenetv3 and other networks in image
classification.

This paper compares the performance of mobilenetv3 and other networks in target
detection.

Conclusion
Mobilenetv3 first uses automl method to obtain the optimal network structure, and then
achieves the final accuracy through manual partial modification. Although the network is
not directly obtained through search, the experimental effect is still there, and the
improvement in it is worth reference and reference.

Conclusion

Mobilenet family is a very important lightweight network family. Mobilenetv1 uses deep
separable convolution to construct lightweight network. Mobilenetv2 proposes an
innovative inverted residual with linear Although there are more layers in the bottleneck
unit, the overall network accuracy and speed have been improved. Mobilenetv3 uses
automl technology and manual fine tuning to build a more lightweight network

28/29
If this article is helpful to you, please give me a like or read it
More content, please pay attention to WeChat official account.

29/29

Mobilenetv2: Inverted Residuals and Linear Bottlenecks
No ratings yet
Mobilenetv2: Inverted Residuals and Linear Bottlenecks
11 pages
MobileNetV2 Inverted Residuals and Linear Bottlenecks
No ratings yet
MobileNetV2 Inverted Residuals and Linear Bottlenecks
11 pages
Mobilenetv2: Inverted Residuals and Linear Bottlenecks
No ratings yet
Mobilenetv2: Inverted Residuals and Linear Bottlenecks
14 pages
Mobile Net
No ratings yet
Mobile Net
9 pages
MobileNets for Efficient Mobile Vision
No ratings yet
MobileNets for Efficient Mobile Vision
2 pages
Searching For Mobilenetv3: Accuracy Vs Madds Vs Model Size
No ratings yet
Searching For Mobilenetv3: Accuracy Vs Madds Vs Model Size
11 pages
Mobilenet For Image Classification
No ratings yet
Mobilenet For Image Classification
3 pages
MobilenetV2 (Quantization)
No ratings yet
MobilenetV2 (Quantization)
4 pages
Mobilenet Part2 Ref
No ratings yet
Mobilenet Part2 Ref
1 page
Unit 4 Deeplearning
No ratings yet
Unit 4 Deeplearning
41 pages
Lecture2 Advanced CNN
No ratings yet
Lecture2 Advanced CNN
55 pages
GhostNet: Efficient CNN Feature Generation
No ratings yet
GhostNet: Efficient CNN Feature Generation
10 pages
Efficient CNN Architecture Design Guided by Visualization
No ratings yet
Efficient CNN Architecture Design Guided by Visualization
6 pages
Shuffle Net
No ratings yet
Shuffle Net
10 pages
ShuffleNet: Efficient CNN for Mobile Devices
No ratings yet
ShuffleNet: Efficient CNN for Mobile Devices
9 pages
GhostNet Paper
No ratings yet
GhostNet Paper
10 pages
Chapter 5 Deep Learning
No ratings yet
Chapter 5 Deep Learning
35 pages
MN906 AI Watermarking
No ratings yet
MN906 AI Watermarking
99 pages
Efficient CNNs for Mobile Devices
No ratings yet
Efficient CNNs for Mobile Devices
9 pages
CSCI417 Machine Intelligence - Lec11 RNN - V1
No ratings yet
CSCI417 Machine Intelligence - Lec11 RNN - V1
61 pages
19 ResNet 10 09 2024
No ratings yet
19 ResNet 10 09 2024
35 pages
A CNN Accelerator On FPGA Using Depthwise
No ratings yet
A CNN Accelerator On FPGA Using Depthwise
5 pages
Implemented MobileNet On PyTorch
No ratings yet
Implemented MobileNet On PyTorch
20 pages
Channelnets Compact and Efficient Convolutional Neural Networks Via Channel Wise Convolutions
No ratings yet
Channelnets Compact and Efficient Convolutional Neural Networks Via Channel Wise Convolutions
9 pages
Cours 8 B
No ratings yet
Cours 8 B
39 pages
A 12 Garbage Classification Using Deep Learning Techniques
No ratings yet
A 12 Garbage Classification Using Deep Learning Techniques
7 pages
Lightweight Design and Optimization Methods For DCNNS: Progress and Futures
No ratings yet
Lightweight Design and Optimization Methods For DCNNS: Progress and Futures
18 pages
MicroNet ICCV2021
No ratings yet
MicroNet ICCV2021
10 pages
Unit Iv - NNDL
No ratings yet
Unit Iv - NNDL
32 pages
Deep Learning for Visual Recognition
No ratings yet
Deep Learning for Visual Recognition
82 pages
Classify Webcam Images Using Deep Learning
No ratings yet
Classify Webcam Images Using Deep Learning
17 pages
RESNET
No ratings yet
RESNET
5 pages
(2020-ECCV) Rethinking Bottleneck Structure For Efficient Mobile Network Design
No ratings yet
(2020-ECCV) Rethinking Bottleneck Structure For Efficient Mobile Network Design
24 pages
Insights on Deep Residual Networks
No ratings yet
Insights on Deep Residual Networks
40 pages
Identify Web Cam Images Using Neural Networks
No ratings yet
Identify Web Cam Images Using Neural Networks
17 pages
Inception Network Architecture
No ratings yet
Inception Network Architecture
56 pages
Unit-Ii DLL
No ratings yet
Unit-Ii DLL
19 pages
Bascis of AI - Module 2 - Complementary Study Material - 4
No ratings yet
Bascis of AI - Module 2 - Complementary Study Material - 4
4 pages
Mergeddv
No ratings yet
Mergeddv
2 pages
TRes Net
No ratings yet
TRes Net
37 pages
Artificial Intelligence
No ratings yet
Artificial Intelligence
15 pages
CNN Models for Face Recognition
No ratings yet
CNN Models for Face Recognition
5 pages
Efficient Mobile Block Design
No ratings yet
Efficient Mobile Block Design
11 pages
MSCDA 605 Machine Learning Exam Model Answers May - 2019
No ratings yet
MSCDA 605 Machine Learning Exam Model Answers May - 2019
7 pages
Unit 4
No ratings yet
Unit 4
86 pages
CNN Basic
No ratings yet
CNN Basic
64 pages
Unit III
No ratings yet
Unit III
58 pages
ResNet: Deep Learning with Skip Connections
No ratings yet
ResNet: Deep Learning with Skip Connections
8 pages
DNN Architectures
No ratings yet
DNN Architectures
12 pages
Full-Resolution Image Compression RNN
No ratings yet
Full-Resolution Image Compression RNN
9 pages
Data Science Interview Preparation (# DAY 22)
No ratings yet
Data Science Interview Preparation (# DAY 22)
16 pages
Data Science Interview Questions #Week4
No ratings yet
Data Science Interview Questions #Week4
146 pages
Data Science Interview Preparation (# DAY 22)
No ratings yet
Data Science Interview Preparation (# DAY 22)
16 pages
DL Mod 3
No ratings yet
DL Mod 3
4 pages
Deep Learning Approach For Object Detection Using CNN: Abstract
No ratings yet
Deep Learning Approach For Object Detection Using CNN: Abstract
7 pages
Data Science Interview Prep: CNNs Explained
No ratings yet
Data Science Interview Prep: CNNs Explained
11 pages
Object Classification Using CNN
No ratings yet
Object Classification Using CNN
9 pages
Deep Learning Techniques Notes
No ratings yet
Deep Learning Techniques Notes
42 pages
Object Detection Techniques Explained
No ratings yet
Object Detection Techniques Explained
16 pages
Coroutines in C
No ratings yet
Coroutines in C
6 pages
Load Cell
No ratings yet
Load Cell
9 pages
How To Buffer An Op-Amp Output For Higher Current Part 1
No ratings yet
How To Buffer An Op-Amp Output For Higher Current Part 1
8 pages
Op-Amp Output Buffering for High Current
No ratings yet
Op-Amp Output Buffering for High Current
9 pages
MP Neuron Perceptrons
No ratings yet
MP Neuron Perceptrons
11 pages
Ma311 Numerical Techniques (Mid - SP23)
No ratings yet
Ma311 Numerical Techniques (Mid - SP23)
1 page
NP-Complete: Proof of Correctness More Reductions
No ratings yet
NP-Complete: Proof of Correctness More Reductions
20 pages
Knapsack Problem Using Greedy Method
No ratings yet
Knapsack Problem Using Greedy Method
14 pages
CSE 325 Numerical Methods: Sadia Tasnim Barsha Lecturer, CSE, SU
No ratings yet
CSE 325 Numerical Methods: Sadia Tasnim Barsha Lecturer, CSE, SU
13 pages
Lesson 10 11 - Polynomials Review Packet 2 Days
No ratings yet
Lesson 10 11 - Polynomials Review Packet 2 Days
6 pages
Wavelet and Multiresolution Image Processing
No ratings yet
Wavelet and Multiresolution Image Processing
61 pages
Assignment 4
No ratings yet
Assignment 4
3 pages
Numerical Methods for Engineers
No ratings yet
Numerical Methods for Engineers
33 pages
Problem Solving Techniques in Artificial Intelligence (AI)
No ratings yet
Problem Solving Techniques in Artificial Intelligence (AI)
9 pages
Data Structure: Chapter 1 - Basic Concepts
No ratings yet
Data Structure: Chapter 1 - Basic Concepts
32 pages
Matlab Optimization Toolbox Optimtool
No ratings yet
Matlab Optimization Toolbox Optimtool
77 pages
Lecture 10 - Supervised Learning in Neural Networks - (Part 3)
No ratings yet
Lecture 10 - Supervised Learning in Neural Networks - (Part 3)
2 pages
Transportation and Assignment Problem Solutions
No ratings yet
Transportation and Assignment Problem Solutions
4 pages
Final Assessment Test (FAT) - May 2017: Course: Class NBR(S) : Slot: Time: Three Hours Max. Marks: 100
No ratings yet
Final Assessment Test (FAT) - May 2017: Course: Class NBR(S) : Slot: Time: Three Hours Max. Marks: 100
2 pages
Radial Basis Function
No ratings yet
Radial Basis Function
35 pages
Bisection Method for Non-Linear Equations
No ratings yet
Bisection Method for Non-Linear Equations
163 pages
Chapter Four
100% (1)
Chapter Four
11 pages
Lecture Slides AMM Week 4 - Interpolation - Tagged
No ratings yet
Lecture Slides AMM Week 4 - Interpolation - Tagged
23 pages
Curve Fitting
No ratings yet
Curve Fitting
12 pages
Algorithm Design & Analysis Guide
No ratings yet
Algorithm Design & Analysis Guide
5 pages
Numerical Integration 8
No ratings yet
Numerical Integration 8
43 pages
Polynomial Linked List Representation
No ratings yet
Polynomial Linked List Representation
13 pages
CBNST
No ratings yet
CBNST
1 page
Engineering Computations: Solution of Non-Linear Equations
100% (1)
Engineering Computations: Solution of Non-Linear Equations
45 pages
2-1-Ads r23 Mid-2 Question Bank 2024-25
No ratings yet
2-1-Ads r23 Mid-2 Question Bank 2024-25
4 pages
Master Method for Recurrences
No ratings yet
Master Method for Recurrences
5 pages
Catalog of Springer Scientific Computing Books
No ratings yet
Catalog of Springer Scientific Computing Books
4 pages
Toc Presentation
No ratings yet
Toc Presentation
11 pages
Numerical Integration
No ratings yet
Numerical Integration
9 pages

Brief Introduction of Mobilenetv1 V2 V3 Lightweight Network

Uploaded by

Brief Introduction of Mobilenetv1 V2 V3 Lightweight Network

Uploaded by

Brief introduction of mobilenetv1 / V2 / V3 lightweight

July 31, 2020

Mobilenet series is a very important lightweight network family. It is from Google.

Source: Xiaofei’s algorithm Engineering Notes official account

Paper: mobilenets: efficient volatile neural networks for mobile vision

Thesis address: http://arxiv.org/pdf/1704.04861.pdf

Depthwise Separable Convolution

The calculation amount and input dimension $M $, output dimension $n $, convolution

Mobilenet uses a depth of $3 / times 3 $to decouple convolution, so the amount of

Network Structure and Training

Width Multiplier: Thinner Models

Resolution Multiplier: Reduced Representation

Compare the effect of different width scaling factors.

Compare the effect of different resolution scaling.

Thesis address: http://arxiv.org/pdf/1801.04381.pdf

The key information in the high-dimensional features of neural networks is distributed in

According to the properties of relu, if the output is non-zero, it is equivalent to a

Paper: searching for mobilenetv3

Thesis address: http://arxiv.org/pdf/1905.02244.pdf

Reduce the size of any expansion layer

Redesigning Expensive Layers

You might also like