0% found this document useful (0 votes)

3 views17 pages

XLA Final Report

Uploaded by

lekhongbaominh

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as DOCX, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

3 views17 pages

XLA Final Report

Uploaded by

lekhongbaominh

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as DOCX, PDF, TXT or read online on Scribd

You are on page 1/ 17

HO CHI MINH CITY UNIVERSITY OF TECHNOLOGY AND EDUCATION

FACULTY OF INTERNATIONAL EDUCATION

Image Processing in Industrial

FINAL PROJECT REPORT

APPLICATION OF CONVOLUTIONAL NEURAL NETWORK

ALEXNET IN OBJECT RECOGNITION WITH HISTOGRAM
CHECKING AND EQUALIZATION
Lecturer: Lê Mỹ Hà Ph.D

Students’ names:
Huỳnh Chí Nguyên – 22151032
Lê Khổng Bảo Minh – 22151255

Ho Chi Minh city, December 19th, 2024

1
Table of contents
CHAPTER 1. OVERVIEW.........................................................................................................................2
1.1 Problem statement.............................................................................................................................2
1.2 Objectives..........................................................................................................................................2
CHAPTER 2. METHODOLOGY AND CALCULATIONS.......................................................................3
2.1 AlexNet convolutional neural network (CNN)..................................................................................3
2.1.1 Introduction to AlexNet..............................................................................................................3
2.1.2 Structure of AlexNet Network....................................................................................................3
2.1.3 AlexNet Architecture..................................................................................................................5
2.1.4 AlexNet Applications:.................................................................................................................5
2.2 Theoretical foundation of histogram..................................................................................................7
2.2.1 Definition of Histogram..............................................................................................................7
2.2.2 Histogram balance......................................................................................................................7
2.2.3 Histogram equalization...............................................................................................................8
2.3 Program Workflow............................................................................................................................8
2.3.1 Block diagram for program.........................................................................................................8
2.3.2 Functions of each block..............................................................................................................9
2.3.3 Matlab program..........................................................................................................................9
CHAPTER 3. RESULTS...........................................................................................................................11
3.1 The original image is balanced in histogram....................................................................................11
3.2 The original image is too dark.........................................................................................................11
3.3 The original image is too bright.......................................................................................................12
CHAPTER 4. CONCLUSION..................................................................................................................13
REFERENCES..........................................................................................................................................14

2
CHAPTER 1. OVERVIEW

1.1 Problem statement

Nowadays, the development of artificial intelligence and neural networks has become
more rapid than ever. Among its applications, object recognition in images has
become one of the most prominent ones, serving various purposes such as
surveillance, traffic control, and life assistance. However, the quality of input images
greatly affects the accuracy of the recognition process. This necessitates
preprocessing steps, such as histogram equalization, to enhance image quality and
recognition results.

The project "APPLICATION OF ALEXNET IN OBJECT RECOGNITION WITH

HISTOGRAM CHECKING AND EQUALIZATION" focuses on solving the
problem of object recognition in images using the AlexNet neural network while
integrating histogram checking and equalization to ensure the quality of the input
images.

1.2 Objectives
The project aims to develop a program to recognize simple objects in input images
using the AlexNet neural network, along integrating a feature to check and equalize
the histogram of all three RGB channels to ensure image quality. Then, the program
will display both the original and equalized images (if needed) in a single frame and
announce the name of the recognized object for enhancing program interactivity.

3
CHAPTER 2. METHODOLOGY AND CALCULATIONS

2.1 AlexNet convolutional neural network (CNN)

2.1.1 Introduction to AlexNet
AlexNet, first introduced by Alex Krizhevsky, Ilya Sutskever, and Geoffrey Hinton in
2012, is one of the most influential convolutional neural networks (CNNs) in the history
of deep learning. It had 60 million parameters, 650,000 neurons, a training set of 1.2
million images, which revolutionized computer vision by significantly improving
performance on image classification tasks, particularly in the Image Net Large Scale
Visual Recognition Challenge (ILSVRC).
The architecture design provides training that is almost black box, plus the ability to self-
learn features through hidden layers.

2.1.2 Structure of AlexNet Network

a) Input Layer:
 Input: RGB images of size 227x227x3.
 Images are preprocessed (normalized and resized) to fit this input size.

b) Layer 1: Convolutional Layer (Conv1)

 Filters: 96
 Kernel Size: 11x11
 Stride: 4
 Activation: ReLU
 Output: Feature maps of size 55x55x96
 Additional Step: Max pooling with a 3x3 filter and stride 2 reduces the size to
27x27x96.

c) Layer 2: Convolutional Layer (Conv2)

 Filters: 256
 Kernel Size: 5x5
 Padding: 2 (same padding)
 Activation: ReLU
 Output: Feature maps of size 27x27x256
 Additional Step: Max pooling reduces the size to 13x13x256.

4
d) Layer 3: Convolutional Layer (Conv3)
 Filters: 384
 Kernel Size: 3x3
 Padding: 1 (same padding)
 Activation: ReLU
 Output: Feature maps of size 13x13x384.

e) Layer 4: Convolutional Layer (Conv4)

 Filters: 384
 Kernel Size: 3x3
 Padding: 1
 Activation: ReLU
 Output: Feature maps of size 13x13x384.

f) Layer 5: Convolutional Layer (Conv5)

 Filters: 256
 Kernel Size: 3x3
 Padding: 1
 Activation: ReLU
 Output: Feature maps of size 13x13x256.
 Additional Step: Max pooling reduces the size to 6x6x256.
 Flattening Layer:
 The 3D tensor (6x6x256) is flattened into a 1D vector with 9216 units.

g) Layer 6: Fully Connected Layer (FC1)

 Nodes: 4096
 Activation: ReLU
 Dropout: Applied to prevent overfitting.

h) Layer 7: Fully Connected Layer (FC2)

 Nodes: 4096
 Activation: ReLU
 Dropout: Applied again for regularization.

i) Layer 8: Output Layer (FC3)

 Nodes: 1000 (corresponding to the 1000 classes in the ImageNet dataset).
 Activation: Softmax, which outputs probabilities for each class.

5
2.1.3 AlexNet Architecture

Overlap
ping Max Pooling:

Max Pooling layer is often used to reduce the width and length of a tensor but keep the
depth the same. Overlapping Max Pool layer is similar to Max Pool layer, except that one
window of this step will have a part overlapping the window of the next step. We use
pooling of size 3x3 and a step of 2 between pooling. That means between this pooling
and another pooling will overlap with each other by 1 pixel.

2.1.4 AlexNet Applications:

a) Image Classification:

 AlexNet was originally designed for image classification tasks, particularly on large-
scale datasets like ImageNet.
 It can categorize images into thousands of classes, making it ideal for object
recognition and categorization.

b) Object Detection:

 AlexNet serves as a backbone for many object detection models like R-CNN.
 It helps detect and localize multiple objects within an image.
 Feature Extraction: The convolutional layers of AlexNet are used to extract high-
quality features from images. These features are applied to various downstream tasks
like transfer learning.

6
c) Medical Imaging:

 AlexNet is applied in medical fields for diagnosing diseases through imaging

techniques such as X-rays, CT scans, and MRIs.
 It helps identify abnormalities like tumors or organ damage.

d) Autonomous Vehicles:

 The network is used to process visual data from cameras in self-driving cars.
 It aids in recognizing road signs, obstacles, and pedestrians.

e) Facial Recognition: AlexNet is a foundational model for facial recognition systems.

 It helps in tasks such as identifying individuals, analyzing expressions, or detecting

faces in images.
 Agriculture: AlexNet assists in identifying plant diseases, analyzing crop health, and
classifying plant species from images.

f) Application experiments
Here are the eight ILSVRC-2010 test images and the five labels considered most likely in
the AlexNet model. What the network has learned can be qualitatively assessed by
calculating the top 5 estimates on the eight test images.

7
2.2 Theoretical foundation of histogram
2.2.1 Definition of Histogram
A histogram is a graphical representation of the frequency distribution of pixel intensity
values in an image. For digital images, each pixel value typically ranges between [0,255].
The histogram displays the brightness distribution of the image, from dark regions (low
pixel values) to bright regions (high pixel values).
Basically, we have three categories of images with different histogram. An image with
histogram concentrated in the low-value range is too dark. On the other hand, an image
with histogram concentrated in the high-value range is too bright. An image with a well-
distributed histogram tends to be better-looking.
2.2.2 Histogram balance
A balanced histogram is one in which the intensity values are distributed relatively evenly
across the range, avoiding concentrations in specific regions. This indicates the image has
good brightness and contrast and there is no overly dark or overly bright area. The
balance of a histogram can be assessed by calculating the deviation of pixel distributions
The balance of a histogram can be assessed by calculating the deviation of pixel
distributions.
Firstly, compute the relative frequency p(i) of each intensity level i:
h(i)
p(i) = N
h(i):the number of pixels with intensity i.
N: the total number of pixels in the image.
Secondly, compute the mean of the normalized histogram:
L−1
1
μ = L ∑ p (i)
i=0

L: the number of intensity levels

Calculate the maximum deviation from the mean

Dmax =max ⁡(| p ( i )−μ|)

If Dmax exceeds a predefined threshold, the histogram is considered unbalanced.

8
2.2.3 Histogram equalization
Histogram equalization is a technique used to improve its overall constract by adjusting
the intensity distribution. The goal is to redistribute pixel values such that the histogram
becomes more uniform.
For this program, we use the global histogram for discrete case because the desired
objectives are for regular images with simple objects. The formulas for the discrete of
histogram equalization are presented as:
L−1
n L−1
P L−1= where n=∑ nl
n l=0

k
gk =T [ f k ]=∑ Pi
i=0

K k k
nj L−1
sk = T(r k ) = (L−1) ∑ p r ( r j) =( L−1) ∑ = ∑n
j=0 j=0 MN MN j=0 j

for k = 0,1,…,L-1

9
2.3 Program Workflow
2.3.1 Block diagram for program

10
2.3.2 Functions of each block
 Start: marks the beginning of the program.
 Load image: loads the input image.
 Check histogram balance: checks whether the histogram of the image's R, G, and
B channels is balanced.
 Equalize histogram: performs histogram equalization for each R, G, and B channel
and combines the channels back into an RGB image.
 Resize image: resizes the image to 227x227 dimensions to meet AlexNet's input
size requirements.
 Object recognition: classifies the resized image to recognize the object.
 Display images: displays the original image and the equalized image (if needed)
side by side.
 Speak object name: announces the recognized object's name.
 End: marks the end of the program.

2.3.3 Matlab program

11
In order to use this program, the computer must be integrated with the Deep Learning
Toolbox Model for AlexNet Network support package, made by MathWorks Deep
Learning Toolbox Team.
The threshold value of 0.1 is chosen because this is the most common value of processing
image using Matlab, pixel levels can deviate by up to 10% from the mean, which is
generally considered acceptable in histogram processing.
The program uses discrete equation because it is based on fixed pixel intensity levels.
This is a common approach to regular digital image processing, where pixel intensity
levels are represented discretely. The same reason is for the usage of global histogram
equalization.

12
CHAPTER 3. RESULTS
3.1 The original image is balanced in histogram

Original Image tiger cat

3.2 The original image is too dark

Original Image mountain bike

13
3.3 The original image is too bright

Original Image snowmobile

14
CHAPTER 4. CONCLUSION

The project has completed the requirements, including recognizing common objects in
the simple input images, performing image preprocessing steps such as checking and
balancing the histogram to ensure the recognition process is effective, and pronouncing
the name of the recognized object. However, the program still has some limitations when
there are cases of mistakenly recognizing objects with similar shapes. This requires
further improvement in the input image normalization or preprocessing step, as well as
using another CNN network with larger and more diverse training data to increase the
ability to recognize and classify accurately. In the future, the program can be further
optimized to achieve higher performance in more complex recognition problems.

15
REFERENCES

[1]. Đặng Thị Hằng | Phạm Duy Tùng _ Tìm Hiểu Về Mạng Neural Network AlexNet _
2018 https://www.phamduytung.com/blog/2018-06-15-understanding-alexnet/

[2]. Alex Krizhevsky, Ilya Sutskever, and Geoffrey E. Hinton, ImageNet Classification
with Deep Convolutional Neural Networks _ 2012

[3]. Đặng Thị Hằng | Phạm Duy Tùng _ Tìm Hiểu Mạng AlexNet, Mô Hình Giành Chiến
Thắng Tại Cuộc Thi ILSVRC 2012 _ 2019 https://www.phamduytung.com/blog/2019-
05-27-alexnet/

[4]. Nuruzzaman Faruqui _ Matlab Tutorial: Text To Speech _2018

[5]. Lê Mỹ Hà _ Lecture 3: Image enhancement

Complete Bundle Statistical Issus in Drug Development 3rd Edition Stephen S Senn HQ File
100% (1)
Complete Bundle Statistical Issus in Drug Development 3rd Edition Stephen S Senn HQ File
411 pages
AlexNet Algorithm Presentation ML AI Deep Learning
No ratings yet
AlexNet Algorithm Presentation ML AI Deep Learning
10 pages
CS60010: Deep Learning CNN - Part 3: Sudeshna Sarkar
No ratings yet
CS60010: Deep Learning CNN - Part 3: Sudeshna Sarkar
167 pages
Transfer Learning - CNN Architectures
No ratings yet
Transfer Learning - CNN Architectures
120 pages
Unit - 3 - Object Recognition
No ratings yet
Unit - 3 - Object Recognition
12 pages
Deeplearning - PPT - Unit 4 and 5
No ratings yet
Deeplearning - PPT - Unit 4 and 5
154 pages
465-Lecture 7
No ratings yet
465-Lecture 7
46 pages
Different Deep CNN Architectures - LeNet, AlexNet, VGG
No ratings yet
Different Deep CNN Architectures - LeNet, AlexNet, VGG
13 pages
An Analysis of Convolutional Neural Network Architectures
No ratings yet
An Analysis of Convolutional Neural Network Architectures
54 pages
7 Architectures
No ratings yet
7 Architectures
68 pages
7 CNN
No ratings yet
7 CNN
66 pages
Classic CNN
No ratings yet
Classic CNN
39 pages
Architecture Handbook
No ratings yet
Architecture Handbook
19 pages
Unit V
No ratings yet
Unit V
84 pages
Intel Final Report Edit Lan 4
No ratings yet
Intel Final Report Edit Lan 4
64 pages
Convolutional Networks
No ratings yet
Convolutional Networks
211 pages
XCXC
No ratings yet
XCXC
16 pages
Image Processing With Deep Learning
No ratings yet
Image Processing With Deep Learning
39 pages
5b Dana
No ratings yet
5b Dana
67 pages
Military AI-Week 05-AI in Computer Vision
No ratings yet
Military AI-Week 05-AI in Computer Vision
65 pages
Convolutional Neural Network Ilsvrc Alexnet (2012) Zfnet (2013) Vggnet (2014) Googlenet 2014) Resnet (2015) Conclusion
No ratings yet
Convolutional Neural Network Ilsvrc Alexnet (2012) Zfnet (2013) Vggnet (2014) Googlenet 2014) Resnet (2015) Conclusion
82 pages
CNN Architectures - Transfer Learning
No ratings yet
CNN Architectures - Transfer Learning
64 pages
Notes
No ratings yet
Notes
15 pages
DLP&P Notes Faculty: Ms. Meenakshi Chaudhary: What Is A Convolutional Neural Network (CNN) ?
No ratings yet
DLP&P Notes Faculty: Ms. Meenakshi Chaudhary: What Is A Convolutional Neural Network (CNN) ?
50 pages
VGG (Simonyan and Zisserman)
No ratings yet
VGG (Simonyan and Zisserman)
14 pages
BEFA
No ratings yet
BEFA
23 pages
Neural Network Project Report.
No ratings yet
Neural Network Project Report.
12 pages
Aidl 2023s DL 08 CNN Architectures
No ratings yet
Aidl 2023s DL 08 CNN Architectures
51 pages
4 March 23 - DL
No ratings yet
4 March 23 - DL
79 pages
Harley MSC Thesis Menos Especializadpo
No ratings yet
Harley MSC Thesis Menos Especializadpo
71 pages
Mổ xẻ cái AlexNet network
No ratings yet
Mổ xẻ cái AlexNet network
5 pages
ML Lec 15 Alexnet CNN
No ratings yet
ML Lec 15 Alexnet CNN
8 pages
Unit 5a - Machine Vision
No ratings yet
Unit 5a - Machine Vision
55 pages
Alex Net
No ratings yet
Alex Net
26 pages
Unit 3
No ratings yet
Unit 3
38 pages
138 B Pretrained Networks Classification Complete
No ratings yet
138 B Pretrained Networks Classification Complete
47 pages
Unit 3
No ratings yet
Unit 3
37 pages
DL - Unit IV
No ratings yet
DL - Unit IV
36 pages
25-Deep Convolutional Models - ResNet, AlexNet, InceptionNet and Others-12!09!2024
No ratings yet
25-Deep Convolutional Models - ResNet, AlexNet, InceptionNet and Others-12!09!2024
4 pages
Difference of LeNet and AlexNet
No ratings yet
Difference of LeNet and AlexNet
11 pages
Untitled Document
No ratings yet
Untitled Document
15 pages
Alexnet Tugce Kyunghee
No ratings yet
Alexnet Tugce Kyunghee
35 pages
COMP3220 Lect 11 - Introduction To Convolutional Neural Networks
No ratings yet
COMP3220 Lect 11 - Introduction To Convolutional Neural Networks
13 pages
Modern CNN Architectures
No ratings yet
Modern CNN Architectures
32 pages
Unit 2 CNN
No ratings yet
Unit 2 CNN
15 pages
Alex Net
No ratings yet
Alex Net
2 pages
Identify Web Cam Images Using Neural Networks
No ratings yet
Identify Web Cam Images Using Neural Networks
17 pages
Exercise 8
No ratings yet
Exercise 8
6 pages
Deep Learning Assign 2
No ratings yet
Deep Learning Assign 2
5 pages
DoMinhQuan 521H0290
No ratings yet
DoMinhQuan 521H0290
4 pages
ML II - Unit IV
No ratings yet
ML II - Unit IV
20 pages
Res Net 4
No ratings yet
Res Net 4
23 pages
Robotics: Presenter: Dr. Duc Thien, Tran
No ratings yet
Robotics: Presenter: Dr. Duc Thien, Tran
112 pages
Deep Convolutional Neural Networks For Image Classification: Many Slides From Rob Fergus (NYU and Facebook)
No ratings yet
Deep Convolutional Neural Networks For Image Classification: Many Slides From Rob Fergus (NYU and Facebook)
55 pages
Trustworthy - Final Essay
No ratings yet
Trustworthy - Final Essay
21 pages
Convolutional Neural Networks: Computer Vision CS 543 / ECE 549 University of Illinois Jia-Bin Huang
No ratings yet
Convolutional Neural Networks: Computer Vision CS 543 / ECE 549 University of Illinois Jia-Bin Huang
76 pages
DL Ass 742
No ratings yet
DL Ass 742
14 pages
Seeya SY103WAM01 Specification v1-0 20210415
No ratings yet
Seeya SY103WAM01 Specification v1-0 20210415
54 pages
Classify Webcam Images Using Deep Learning
No ratings yet
Classify Webcam Images Using Deep Learning
17 pages
Questions For Final Test - EN
No ratings yet
Questions For Final Test - EN
7 pages
Local Control Panel Mgs Local Dse 8610 Mkii
No ratings yet
Local Control Panel Mgs Local Dse 8610 Mkii
6 pages
Alexnet: The Architecture That Challenged Cnns
No ratings yet
Alexnet: The Architecture That Challenged Cnns
6 pages
Le y Yang - Tiny ImageNet Visual Recognition Challenge
No ratings yet
Le y Yang - Tiny ImageNet Visual Recognition Challenge
6 pages
Hcxpsktool
No ratings yet
Hcxpsktool
59 pages
Computer Revision Test 4
No ratings yet
Computer Revision Test 4
5 pages
Fuel System - Repair Procedures
No ratings yet
Fuel System - Repair Procedures
48 pages
Week3 Chapter5 Session2 Lab
No ratings yet
Week3 Chapter5 Session2 Lab
41 pages
BOOKLET Inglés II - Com 7-8 - Nivel B-Wednesday9 PM
No ratings yet
BOOKLET Inglés II - Com 7-8 - Nivel B-Wednesday9 PM
12 pages
bài báo cáo assignmnet của Lê Khổng Bảo Minh 22151255
No ratings yet
bài báo cáo assignmnet của Lê Khổng Bảo Minh 22151255
9 pages
Understanding AlexNet
No ratings yet
Understanding AlexNet
8 pages
Lab 7
No ratings yet
Lab 7
6 pages
Lab 4
No ratings yet
Lab 4
4 pages
Spare Part List - 231-9100019-TFM For The Governate Building and Mosques Contract
No ratings yet
Spare Part List - 231-9100019-TFM For The Governate Building and Mosques Contract
12 pages
Open B3 D
No ratings yet
Open B3 D
57 pages
Part 1 Report of Le KIhong Bao Minh
No ratings yet
Part 1 Report of Le KIhong Bao Minh
4 pages
2 Restricting and Sorting Data
No ratings yet
2 Restricting and Sorting Data
28 pages
Tutorial Letter 101/3/2024: Safety Management Systems
No ratings yet
Tutorial Letter 101/3/2024: Safety Management Systems
12 pages
Connection Diagram Betewwn PLC and Inverter 2
No ratings yet
Connection Diagram Betewwn PLC and Inverter 2
1 page
Giải thích cách đưa ra được hàm truyền
No ratings yet
Giải thích cách đưa ra được hàm truyền
1 page
MEMO Oncall
No ratings yet
MEMO Oncall
5 pages
Bai Listening Cua Le Khong Bao Minh
No ratings yet
Bai Listening Cua Le Khong Bao Minh
1 page
Instruction Manual - PMC Mixing Machine
No ratings yet
Instruction Manual - PMC Mixing Machine
9 pages
Bss Fds360 Service Manual
No ratings yet
Bss Fds360 Service Manual
8 pages
Mahdi Afrad (English Job Resume)
No ratings yet
Mahdi Afrad (English Job Resume)
4 pages
Upere Intermediate Tests Answer - Key
No ratings yet
Upere Intermediate Tests Answer - Key
6 pages
KUKA Ethernet/IP 2.0: Controller Option
No ratings yet
KUKA Ethernet/IP 2.0: Controller Option
51 pages
Solutions For 4D Cadastre - With A Case Study On Utility Networks
No ratings yet
Solutions For 4D Cadastre - With A Case Study On Utility Networks
20 pages
Unrecoded Liability
No ratings yet
Unrecoded Liability
3 pages
Millyards Wheel Loaders: Load Capacity Curves 966M
No ratings yet
Millyards Wheel Loaders: Load Capacity Curves 966M
4 pages
Computer System Diagnosis and Maintenance Prelab Questions 4
No ratings yet
Computer System Diagnosis and Maintenance Prelab Questions 4
2 pages
Brochure TEQIP STC IITG
No ratings yet
Brochure TEQIP STC IITG
6 pages
Tesla Script
No ratings yet
Tesla Script
3 pages
Cluster Setup
No ratings yet
Cluster Setup
4 pages
ASTM D2247 - Presto
No ratings yet
ASTM D2247 - Presto
2 pages
SG285 - 320 - 333 - 350HX EMC Certificate 64.772.21.80078.03
No ratings yet
SG285 - 320 - 333 - 350HX EMC Certificate 64.772.21.80078.03
2 pages
In Pneumatic Systems The Medium Used Is
No ratings yet
In Pneumatic Systems The Medium Used Is
8 pages
CS360 ML Syllabus - 12102022
No ratings yet
CS360 ML Syllabus - 12102022
5 pages
Liquid Drainers
No ratings yet
Liquid Drainers
5 pages
Machine Learning - Advanced Concepts
From Everand
Machine Learning - Advanced Concepts
Derrick Mwiti
No ratings yet
Digital Image Processing: Fundamentals and Applications
From Everand
Digital Image Processing: Fundamentals and Applications
Fouad Sabry
No ratings yet

XLA Final Report

Uploaded by

XLA Final Report

Uploaded by

HO CHI MINH CITY UNIVERSITY OF TECHNOLOGY AND EDUCATION

FACULTY OF INTERNATIONAL EDUCATION

FINAL PROJECT REPORT

APPLICATION OF CONVOLUTIONAL NEURAL NETWORK

Ho Chi Minh city, December 19th, 2024

1.1 Problem statement

The project "APPLICATION OF ALEXNET IN OBJECT RECOGNITION WITH

2.1 AlexNet convolutional neural network (CNN)

2.1.2 Structure of AlexNet Network

b) Layer 1: Convolutional Layer (Conv1)

c) Layer 2: Convolutional Layer (Conv2)

e) Layer 4: Convolutional Layer (Conv4)

f) Layer 5: Convolutional Layer (Conv5)

g) Layer 6: Fully Connected Layer (FC1)

h) Layer 7: Fully Connected Layer (FC2)

i) Layer 8: Output Layer (FC3)

2.1.4 AlexNet Applications:

 AlexNet is applied in medical fields for diagnosing diseases through imaging

e) Facial Recognition: AlexNet is a foundational model for facial recognition systems.

 It helps in tasks such as identifying individuals, analyzing expressions, or detecting

L: the number of intensity levels

Dmax =max ⁡(| p ( i )−μ|)

If Dmax exceeds a predefined threshold, the histogram is considered unbalanced.

2.3.3 Matlab program

Original Image tiger cat

3.2 The original image is too dark

Original Image mountain bike

Original Image snowmobile

[4]. Nuruzzaman Faruqui _ Matlab Tutorial: Text To Speech _2018

[5]. Lê Mỹ Hà _ Lecture 3: Image enhancement

You might also like