0% found this document useful (0 votes)

52 views8 pages

1) Explain Better Weight Initialization Methods

The document discusses various topics in deep learning, including weight initialization methods, architectures like ResNet and LeNet, reinforcement learning concepts such as Q-learning and policy gradients, and advanced techniques like inverse reinforcement learning and guided backpropagation. It highlights the importance of effective weight initialization for training neural networks, the role of residual learning in ResNet, and the visualization techniques for understanding CNNs. Additionally, it covers recent trends in deep learning, including transformer networks and self-supervised learning.

Uploaded by

Piyush Kaithwas

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

52 views8 pages

1) Explain Better Weight Initialization Methods

Uploaded by

Piyush Kaithwas

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 8

1) Explain Better Weight Initialization Methods

Weight initialization is crucial for training deep neural networks effectively. Poor initialization can lead to
slow convergence, poor performance, or vanishing/exploding gradients. Here are some better weight
initialization methods:

 Zero Initialization: All weights are set to zero. However, this approach leads to problems because each
neuron will learn the same feature, making the model ineffective.

 Random Initialization: Initialize weights randomly, usually with small values. While this can break
symmetry, it often leads to issues with gradients during backpropagation, especially with deep
networks.

 Xavier Initialization (Glorot Initialization):

o Designed for activation functions like tanh and sigmoid.

o Initializes weights by drawing from a distribution with a variance of 1/n1 / n1/n, where nnn is
the number of input units.

o Formula: W∼U(−6/(nin+nout),6/(nin+nout))W \sim \mathcal{U}(-\sqrt{6 / (n_{\text{in}} +

n_{\text{out}})}, \sqrt{6 / (n_{\text{in}} + n_{\text{out}})})W∼U(−6/(nin+nout),6/(nin+nout))

o It helps avoid exploding or vanishing gradients in shallow to moderately deep networks.

 He Initialization:

o Designed for ReLU activation functions.

o Initializes weights with a variance of 2/nin2 / n_{\text{in}}2/nin, where ninn_{\text{in}}nin is

the number of input units.

o Formula: W∼N(0,2nin)W \sim \mathcal{N}(0, \frac{2}{n_{\text{in}}})W∼N(0,nin2)

o This method is particularly effective when using ReLU activations, helping prevent the dying
ReLU problem.

 LeCun Initialization:

o Used for Leaky ReLU or sigmoid activations.

o Similar to He but with a variance of 1/nin1 / n_{\text{in}}1/nin, which works better in

networks with Leaky ReLU.

 Orthogonal Initialization:

o Initializes the weight matrix as an orthogonal matrix, ensuring the activation covariance
remains close to the identity matrix. This approach can work well in deeper networks and
avoid vanishing/exploding gradients.

 Sparse Initialization:

o Weights are initialized to be sparse, with only a few non-zero values. This is useful in certain
cases where the network can benefit from sparse representations, such as in sparse
autoencoders.
2….Explain ResNet and LeNet in detail

ResNet (Residual Networks)

 Introduction: ResNet, introduced by Kaiming He et al. in 2015, is a deep convolutional network

architecture that introduced residual learning, a technique where layers learn residual mappings (the
difference between the output and input), rather than direct mappings.

 Key Concept: The idea is to allow the gradient to flow more easily through deeper layers by using skip
connections or identity shortcuts, which help mitigate the vanishing gradient problem.

 Architecture:

o Composed of blocks of layers that have skip connections.

o Each block performs a series of convolutions, batch normalizations, and activation functions.
The output of the block is the sum of the input (via the skip connection) and the result of the
convolutional layers.
o Formula: output=F(x,{Wi})+x\text{output} = F(x, \{W_i\}) + xoutput=F(x,{Wi})+x where
F(x,{Wi})F(x, \{W_i\})F(x,{Wi}) is the output of the convolutional layers and xxx is the input.
 Advantages:

o Deep ResNets (e.g., ResNet-50, ResNet-101) have been shown to achieve excellent results in
image classification tasks by addressing the degradation problem of very deep networks.
o Easy to scale to deeper architectures without a significant drop in performance.

LeNet (LeNet-5)

 Introduction: LeNet, introduced by Yann LeCun in 1998, is one of the earliest CNN architectures
designed for handwritten digit recognition (e.g., the MNIST dataset).

 Architecture: LeNet-5 consists of:

1. Convolutional Layer 1: Applies 6 filters with a kernel size of 5x5 to input images of size 32x32.
2. Subsampling Layer 1 (Pooling): Max pooling layer to downsample the feature map.
3. Convolutional Layer 2: Applies 16 filters to the pooled image.
4. Subsampling Layer 2 (Pooling): Another max pooling layer.
5. Fully Connected Layers: Two fully connected layers, with the final output layer being a
softmax layer for classification.

 Advantages:
o A simple and early architecture that set the foundation for modern deep learning models.
o Uses weight sharing (same weights for different patches) and local receptive fields, which
helps reduce computational costs.
3…..Explain the Q function and Q Learning algorithm.

 Q Function (Action-Value Function): The Q-function Q(s,a)Q(s, a)Q(s,a) represents the expected
cumulative reward an agent can achieve from state sss, by taking action aaa, and then following the
optimal policy thereafter.

 Q-Learning Algorithm:
o A model-free reinforcement learning algorithm that seeks to learn the optimal action-value
function Q∗(s,a)Q^*(s, a)Q∗(s,a).
o Update Rule: Q(st,at)←Q(st,at)+α[R(st,at)+γmax⁡a′Q(st+1,a′)−Q(st,at)]Q(s_t, a_t) \leftarrow
Q(s_t, a_t) + \alpha \left[ R(s_t, a_t) + \gamma \max_{a'} Q(s_{t+1}, a') - Q(s_t, a_t)
\right]Q(st,at)←Q(st,at)+α[R(st,at)+γa′maxQ(st+1,a′)−Q(st,at)] where:
 α\alphaα is the learning rate,
 γ\gammaγ is the discount factor,
 R(st,at)R(s_t, a_t)R(st,at) is the immediate reward,
 max⁡a′Q(st+1,a′)\max_{a'} Q(s_{t+1}, a')maxa′Q(st+1,a′) is the maximum predicted Q-
value for the next state.

4)...Explain policy gradient Algorithm for full RL.

Policy Gradient Algorithm for Full RL

 Policy Gradient methods aim to directly optimize the policy by updating the parameters θ\thetaθ of a
parameterized policy function πθ(a∣s)\pi_\theta(a|s)πθ(a∣s).
 Objective: Maximize the expected return (reward) by optimizing the following objective function:
J(θ)=Eπθ[∑t=0TγtR(st,at)]J(\theta) = \mathbb{E}_{\pi_\theta} \left[ \sum_{t=0}^{T} \gamma^t R(s_t,
a_t) \right]J(θ)=Eπθ[t=0∑TγtR(st,at)]
 Gradient of Objective: ∇θJ(θ)=Eπθ[∇θlog⁡πθ(at∣st)⋅Rt]\nabla_\theta J(\theta) =
\mathbb{E}_{\pi_\theta} \left[ \nabla_\theta \log \pi_\theta(a_t | s_t) \cdot R_t \right]∇θJ(θ)=Eπθ
[∇θlogπθ(at∣st)⋅Rt]
 This can be solved using Monte Carlo methods or more sophisticated approaches like REINFORCE.
5…Explain Value Iteration, Policy Iteration, and Temporal Difference
Learning
 Value Iteration: A dynamic programming algorithm to compute the optimal value function. It updates
the value function iteratively using:

V(s)=max⁡a[R(s,a)+γ∑s′P(s′∣s,a)V(s′)]V(s) = \max_{a} \left[ R(s, a) + \gamma \sum_{s'} P(s'|s, a) V(s')

\right]V(s)=amax[R(s,a)+γs′∑P(s′∣s,a)V(s′)]

 Policy Iteration: Alternates between two steps:

1. Policy Evaluation: Compute the value function for the current policy.

2. Policy Improvement: Update the policy by choosing the action that maximizes the expected
value.

 Temporal Difference (TD) Learning: A combination of Monte Carlo methods and dynamic
programming. It updates estimates based on other learned estimates without needing a model of the
environment.

--------------------------------------------------------------------------------------------------------------------------------------
6.Discuss Bandit Algorithms in details.
Bandit Algorithms

 Multi-Armed Bandit Problem: A classic problem in reinforcement learning where an agent must
choose between multiple options (arms) to maximize the cumulative reward. Each arm has an
unknown reward distribution.

 Algorithms:

1. Epsilon-Greedy: Selects the best-known action with probability 1−ϵ1 - \epsilon1−ϵ, and
randomly selects an action with probability ϵ\epsilonϵ.

2. UCB (Upper Confidence Bound): Chooses actions that balance between exploration (trying
unknown actions) and exploitation (choosing the best-known action).

3. Thompson Sampling: A Bayesian approach that samples from the posterior distribution of
each arm’s reward and chooses the arm with the highest sample.

7) EXPLAIN Recent Trends in Deep Learning Architecture

 Transformer Networks: Initially developed for NLP tasks, transformers use self-attention
mechanisms and have revolutionized deep learning for tasks like language modeling, image
generation, and more.

 Attention Mechanisms: Help networks focus on important parts of the input, widely used in NLP,
vision, and reinforcement learning.

 Self-Supervised Learning: Uses unlabeled data and generates labels from the data itself to pre-train
models.

 Meta-Learning: Models that can learn how to learn, typically used in few-shot learning.

8) Explain Inverse Reinforcement Learning and Maximum Entropy Deep

Inverse Reinforcement Learning
Inverse Reinforcement Learning (IRL):

 Definition: Inverse Reinforcement Learning (IRL) is a type of machine learning where the goal is to
infer the reward function given the observed behavior of an agent. Instead of specifying a reward
function in advance (as in traditional reinforcement learning), the agent learns a reward function by
observing the expert's actions in a given environment.

 Process:

1. Demonstration: The agent observes an expert performing tasks in a particular environment.

2. Learning: The agent tries to deduce the reward function based on the expert's actions. It aims to figure
out the reward function that would make the expert’s behavior appear optimal under a reinforcement
learning framework .
3.Application: Once the reward function is inferred, the agent can use it to make decisions in future tasks.

 Applications:

o Autonomous driving (learning from human drivers),

o Robotics (teaching robots tasks by demonstration),

o Game AI (learning from human gameplay).

Maximum Entropy Deep Inverse Reinforcement Learning:

 Definition: This is an extension of traditional IRL that incorporates the maximum entropy principle,
which encourages the model to select policies that are not just optimal but also have higher
uncertainty or randomness, helping avoid overfitting to the expert’s behavior.

 Concept:

o Instead of simply recovering a deterministic policy from expert demonstrations, the

maximum entropy approach adds a regularization term that encourages more diverse
behaviors, thus balancing exploration and exploitation.

o The idea is to learn a policy that maximizes both the expected reward and the entropy of
the policy, ensuring a more robust model.

 Key Formula:

L=E[log⁡πθ(a∣s)⋅R(s,a)]−λH(πθ(a∣s))\mathcal{L} = \mathbb{E} \left[ \log \pi_{\theta}(a|s) \cdot R(s, a) \right]

- \lambda \mathbb{H}(\pi_{\theta}(a|s))L=E[logπθ(a∣s)⋅R(s,a)]−λH(πθ(a∣s))

where H(πθ)\mathbb{H}(\pi_{\theta})H(πθ) is the entropy of the policy, and λ\lambdaλ is a

hyperparameter that controls the trade-off between reward maximization and entropy.

 Applications:

o Robotics (learning from human demonstration with less overfitting),

o Autonomous systems that need more generalized behaviors,

o Learning more robust policies in uncertain environments.

9) Explain Guided Back Propagation

 Definition: Guided Backpropagation is a visualization technique for neural networks, particularly
convolutional neural networks (CNNs). It is an enhancement over traditional backpropagation,
designed to help visualize and understand which parts of an input image (or data) contribute most
significantly to a network's prediction.

 How It Works:

o In standard backpropagation, gradients are propagated backward through the network to

update weights, and all activations are considered during the backward pass.

o In guided backpropagation, during the backpropagation step, negative gradients are ignored
(i.e., the backpropagated gradients are replaced by zeros when they are negative). This
modification leads to a more focused and interpretable visualization, highlighting areas of the
input that strongly influence the network's predictions.
10) Explain Visualizing Convolutional Neural Networks (CNNs)
 Introduction: CNNs are often used in image recognition tasks, but they are considered "black box"
models, making it difficult to understand why certain predictions are made. Visualizing CNNs helps in
interpreting the internal workings of these models and understanding how they make decisions.

 Methods of Visualizing CNNs:

1. Filter Visualization:

 CNNs learn to recognize features through convolutional filters. Visualizing these filters
can give insight into what types of features (edges, textures, shapes, etc.) the model
is focusing on at different layers.

 How: You can visualize the weights of the convolution filters directly by plotting them.

2. Feature Map Visualization:

 After applying filters, the input image is passed through different layers, producing
feature maps. Visualizing feature maps helps understand how different regions of an
image activate in response to certain filters.

 How: You can visualize the activations at each layer to see how features become
more abstract as they progress through the network.

3. Class Activation Mapping (CAM):

 CAM helps visualize the parts of an image that are most important for a given class
prediction. It uses the global average pooling layer before the final softmax layer to
produce a heatmap that shows which areas of the image are crucial for the final
prediction.

 Method: The heatmap is generated by weighting the feature maps of the last
convolutional layer by the importance of each channel for the class of interest.

4. Saliency Maps:

 Saliency maps show the regions of an image that most affect the prediction by
highlighting pixels that have a high gradient with respect to the output.

 How: This can be done by computing the derivative of the output class with respect
to the input image.

5. Guided Backpropagation and DeconvNet:

 As mentioned earlier, guided backpropagation and DeconvNet (deconvolutional

networks) are techniques to visualize which input features are responsible for a
model's decision.

 DeconvNet: Unlike guided backpropagation, deconvolution networks attempt to

reverse the convolution process, reconstructing the input from feature maps to
identify what part of the input contributed to certain activations.
 Applications:

o Model Interpretability: Helps in understanding how CNNs make predictions.

o Debugging Models: Allows practitioners to verify if the model is learning the right
features.

o Improving Trust: By understanding which features are being focused on, CNNs can be
trusted more for applications in sensitive fields like healthcare, finance, etc.

Visualizing CNNs in this manner provides insights into the decision-making process of deep neural
networks, which is essential for improving models and ensuring they are functioning as intended.

Algorithmic Advances
No ratings yet
Algorithmic Advances
5 pages
Genai See
No ratings yet
Genai See
51 pages
Exam Gen AI
No ratings yet
Exam Gen AI
14 pages
DL
No ratings yet
DL
1 page
Tutorial 1,2
No ratings yet
Tutorial 1,2
12 pages
Deep Learning Viva Questions (1-3)
No ratings yet
Deep Learning Viva Questions (1-3)
4 pages
SS 2020
No ratings yet
SS 2020
21 pages
DL Questions
No ratings yet
DL Questions
5 pages
Deep Learning 15
No ratings yet
Deep Learning 15
13 pages
Deep Nets
No ratings yet
Deep Nets
25 pages
Deep Learning Important Questions Answers
No ratings yet
Deep Learning Important Questions Answers
4 pages
SS 2021
No ratings yet
SS 2021
16 pages
Deep FFNN QA Final Clean
No ratings yet
Deep FFNN QA Final Clean
4 pages
Lecture5 MCQ Guide
No ratings yet
Lecture5 MCQ Guide
9 pages
Fivth
No ratings yet
Fivth
32 pages
Deep Learing
No ratings yet
Deep Learing
37 pages
Iva Unit-5 Edited
No ratings yet
Iva Unit-5 Edited
42 pages
Deep Learning Interview Questions
No ratings yet
Deep Learning Interview Questions
4 pages
DL - QB Solution
No ratings yet
DL - QB Solution
19 pages
Mock Endterm ADL 2021
No ratings yet
Mock Endterm ADL 2021
8 pages
DL CO1 and CO2 Answers
No ratings yet
DL CO1 and CO2 Answers
36 pages
Ch4 and Ch5 Notes
No ratings yet
Ch4 and Ch5 Notes
38 pages
ML Prep For Samsung
No ratings yet
ML Prep For Samsung
73 pages
Deep Learning Questions
No ratings yet
Deep Learning Questions
17 pages
Introduction To ANN
No ratings yet
Introduction To ANN
6 pages
DL Question Paper Solved
No ratings yet
DL Question Paper Solved
12 pages
L7 Lecture Image - classification.DNN v4
No ratings yet
L7 Lecture Image - classification.DNN v4
61 pages
Chapter 9
No ratings yet
Chapter 9
73 pages
Unit II
No ratings yet
Unit II
4 pages
ADL Question Bank
No ratings yet
ADL Question Bank
23 pages
Neural Network - Test Questions
No ratings yet
Neural Network - Test Questions
9 pages
ISE-2 Imp DL
No ratings yet
ISE-2 Imp DL
18 pages
Deep Network Questions Answers Final
No ratings yet
Deep Network Questions Answers Final
3 pages
Unit 5 (Second Half)
No ratings yet
Unit 5 (Second Half)
10 pages
SS 2021 Solutions
No ratings yet
SS 2021 Solutions
16 pages
Unit IV V Deep Learning Material
No ratings yet
Unit IV V Deep Learning Material
32 pages
Jntuk r20 Unit-V Deep Learning Techniques (WWW - Jntumaterials.co - In)
No ratings yet
Jntuk r20 Unit-V Deep Learning Techniques (WWW - Jntumaterials.co - In)
61 pages
Quiz Deep Learning
No ratings yet
Quiz Deep Learning
7 pages
QuestionBank C# and
No ratings yet
QuestionBank C# and
3 pages
Unit-V Deep Learning Techniques
100% (1)
Unit-V Deep Learning Techniques
31 pages
Review of Deep Learning Algorithms and Architectur
No ratings yet
Review of Deep Learning Algorithms and Architectur
29 pages
Deep Learning Question Bank
No ratings yet
Deep Learning Question Bank
8 pages
Mla Cat2
No ratings yet
Mla Cat2
8 pages
Deep Learning Meets Sparse Regularization: A Signal Processing Perspective
No ratings yet
Deep Learning Meets Sparse Regularization: A Signal Processing Perspective
23 pages
Self-Driving Car Racing: Application of Deep Reinforcement Learning
No ratings yet
Self-Driving Car Racing: Application of Deep Reinforcement Learning
12 pages
SocrAI Day 4
No ratings yet
SocrAI Day 4
38 pages
Deep Learning for Tech Enthusiasts
No ratings yet
Deep Learning for Tech Enthusiasts
50 pages
Module 2
No ratings yet
Module 2
13 pages
Revision Questions - Lecture 1
No ratings yet
Revision Questions - Lecture 1
6 pages
Answer Key Set-2
No ratings yet
Answer Key Set-2
24 pages
Section 6
No ratings yet
Section 6
6 pages
QB1 DL
No ratings yet
QB1 DL
20 pages
NN DL Unit - III
No ratings yet
NN DL Unit - III
19 pages
2mrk Answers
No ratings yet
2mrk Answers
6 pages
SS 2020 Solutions
No ratings yet
SS 2020 Solutions
22 pages
DLRL Module 2
No ratings yet
DLRL Module 2
22 pages
DL Quiz1
No ratings yet
DL Quiz1
5 pages
Encodec Trans
No ratings yet
Encodec Trans
5 pages
Face Mask Detection - Docx Report - Docx New
No ratings yet
Face Mask Detection - Docx Report - Docx New
31 pages
A Capsule Network-Enhanced Unet Architecture
No ratings yet
A Capsule Network-Enhanced Unet Architecture
13 pages
A Comparative Study of Tuberculosis Detection
No ratings yet
A Comparative Study of Tuberculosis Detection
5 pages
CNN-Fuzzy Hand Gesture Recognition
No ratings yet
CNN-Fuzzy Hand Gesture Recognition
5 pages
Artificial Intelligence in Autonomous Systems and Robotics
No ratings yet
Artificial Intelligence in Autonomous Systems and Robotics
12 pages
DumbLoc Dumb Indoor Localization Framework Using Wi-Fi Fingerprinting
No ratings yet
DumbLoc Dumb Indoor Localization Framework Using Wi-Fi Fingerprinting
8 pages
Hepatocellular Carcinoma Recognition From Ultrasound Images by Fusing Convolutio
No ratings yet
Hepatocellular Carcinoma Recognition From Ultrasound Images by Fusing Convolutio
6 pages
Hybrid Brain Tumor Classification
No ratings yet
Hybrid Brain Tumor Classification
11 pages
Supervised Machine Learning For Text Analysis in R 1st Edition Emil Hvitfeldt Julia Silge Download
No ratings yet
Supervised Machine Learning For Text Analysis in R 1st Edition Emil Hvitfeldt Julia Silge Download
161 pages
Artificial Neural Network
No ratings yet
Artificial Neural Network
75 pages
Biomarker Detection
No ratings yet
Biomarker Detection
18 pages
Deepfake 5
No ratings yet
Deepfake 5
7 pages
Performance Analysis of Google Colaboratory As A Tool For Accelerating Deep Learning Applications
No ratings yet
Performance Analysis of Google Colaboratory As A Tool For Accelerating Deep Learning Applications
9 pages
Explainable AI for Researchers
No ratings yet
Explainable AI for Researchers
8 pages
CVPR 2020 Paper Digests
No ratings yet
CVPR 2020 Paper Digests
169 pages
Deep Learning Based Sign Language Recognition System Using Convolutional Neural Network
No ratings yet
Deep Learning Based Sign Language Recognition System Using Convolutional Neural Network
68 pages
9th SEM Final Project Amrita
No ratings yet
9th SEM Final Project Amrita
46 pages
Deep Learning for Vision: FDP Feb 2024
No ratings yet
Deep Learning for Vision: FDP Feb 2024
2 pages
Lab Manual R20A6610 Deep Learning Year-IV Semester-I
No ratings yet
Lab Manual R20A6610 Deep Learning Year-IV Semester-I
68 pages
Voice Biometrics for Mask Detection
No ratings yet
Voice Biometrics for Mask Detection
24 pages
# 30 Ojekunle Olamide Paul
No ratings yet
# 30 Ojekunle Olamide Paul
29 pages
Geophysical Research Letters - 2018 - Zhang - CloudNet Ground Based Cloud Classification With Deep Convolutional Neural (Cssn+swimcat Dataset)
No ratings yet
Geophysical Research Letters - 2018 - Zhang - CloudNet Ground Based Cloud Classification With Deep Convolutional Neural (Cssn+swimcat Dataset)
8 pages
Neural Network Types
No ratings yet
Neural Network Types
26 pages
Detecting Anatomical Landmarks From Limited Medical Imaging Data Using Two-Stage Task-Oriented Deep Neural Networks
No ratings yet
Detecting Anatomical Landmarks From Limited Medical Imaging Data Using Two-Stage Task-Oriented Deep Neural Networks
30 pages
Assignment 3
No ratings yet
Assignment 3
2 pages
498R Final Report Template v1
No ratings yet
498R Final Report Template v1
18 pages
Emotion Recognition From Facial Expression of Autism Spectrum Disordered Children Using Image Processing and Machine Learning Algorithms
No ratings yet
Emotion Recognition From Facial Expression of Autism Spectrum Disordered Children Using Image Processing and Machine Learning Algorithms
47 pages
Digital Forensics and Cyber Crime 13th Eai International Conference Icdf2c 2022 Boston Ma November 1618 2022 Proceedings Sanjay Goel Instant Download
100% (7)
Digital Forensics and Cyber Crime 13th Eai International Conference Icdf2c 2022 Boston Ma November 1618 2022 Proceedings Sanjay Goel Instant Download
87 pages
01 - Introduction To Deep Learning
No ratings yet
01 - Introduction To Deep Learning
56 pages

1) Explain Better Weight Initialization Methods

Uploaded by

1) Explain Better Weight Initialization Methods

Uploaded by

1) Explain Better Weight Initialization Methods

 Xavier Initialization (Glorot Initialization):

o Designed for activation functions like tanh and sigmoid.

o Formula: W∼U(−6/(nin+nout),6/(nin+nout))W \sim \mathcal{U}(-\sqrt{6 / (n_{\text{in}} +

o It helps avoid exploding or vanishing gradients in shallow to moderately deep networks.

o Designed for ReLU activation functions.

o Initializes weights with a variance of 2/nin2 / n_{\text{in}}2/nin, where ninn_{\text{in}}nin is

o Formula: W∼N(0,2nin)W \sim \mathcal{N}(0, \frac{2}{n_{\text{in}}})W∼N(0,nin2)

o Used for Leaky ReLU or sigmoid activations.

o Similar to He but with a variance of 1/nin1 / n_{\text{in}}1/nin, which works better in

ResNet (Residual Networks)

 Introduction: ResNet, introduced by Kaiming He et al. in 2015, is a deep convolutional network

o Composed of blocks of layers that have skip connections.

 Architecture: LeNet-5 consists of:

4)...Explain policy gradient Algorithm for full RL.

V(s)=max⁡a[R(s,a)+γ∑s′P(s′∣s,a)V(s′)]V(s) = \max_{a} \left[ R(s, a) + \gamma \sum_{s'} P(s'|s, a) V(s')

 Policy Iteration: Alternates between two steps:

7) EXPLAIN Recent Trends in Deep Learning Architecture

8) Explain Inverse Reinforcement Learning and Maximum Entropy Deep

1. Demonstration: The agent observes an expert performing tasks in a particular environment.

o Autonomous driving (learning from human drivers),

o Robotics (teaching robots tasks by demonstration),

o Game AI (learning from human gameplay).

Maximum Entropy Deep Inverse Reinforcement Learning:

o Instead of simply recovering a deterministic policy from expert demonstrations, the

L=E[log⁡πθ(a∣s)⋅R(s,a)]−λH(πθ(a∣s))\mathcal{L} = \mathbb{E} \left[ \log \pi_{\theta}(a|s) \cdot R(s, a) \right]

where H(πθ)\mathbb{H}(\pi_{\theta})H(πθ) is the entropy of the policy, and λ\lambdaλ is a

o Robotics (learning from human demonstration with less overfitting),

o Autonomous systems that need more generalized behaviors,

o Learning more robust policies in uncertain environments.

9) Explain Guided Back Propagation

o In standard backpropagation, gradients are propagated backward through the network to

 Methods of Visualizing CNNs:

2. Feature Map Visualization:

3. Class Activation Mapping (CAM):

5. Guided Backpropagation and DeconvNet:

 As mentioned earlier, guided backpropagation and DeconvNet (deconvolutional

 DeconvNet: Unlike guided backpropagation, deconvolution networks attempt to

o Model Interpretability: Helps in understanding how CNNs make predictions.

You might also like