0% found this document useful (0 votes)

46 views11 pages

Exercise Problems

Uploaded by

Sangamesh

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

46 views11 pages

Exercise Problems

Uploaded by

Sangamesh

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 11

Consider the data given below and t a linear regression line y = ax + b using gradient

descent.

x = [0, 0.4, 0.6, 1]. y =[0, 1, 0.48, 0.95 ]

Initialize the weights a and b to 0.8, 0.2 respectively. Update the weights such that the error
is minimum using gradient descent. Use the function sum of squared errors where E(y -
y^)2 is the y^predicted value and is the actual given y. Plot the linear regression line after
updating the values of a and b in two iterations.

Here's the Python code to t a linear regression line using gradient descent:

import numpy as np
import matplotlib.pyplot as plt

# Given data
x = np.array([0, 0.4, 0.6, 1])
y = np.array([0, 1, 0.48, 0.95])

# Initialize weights
a = 0.8
b = 0.2

# Learning rate
alpha = 0.1

# Number of iterations
n_iterations = 2

# Gradient descent
for _ in range(n_iterations):
# Predicted values
y_pred = a * x + b

# Error
error = y_pred - y

# Sum of squared errors

E = np.sum(error ** 2)

# Print error
print(f"Iteration {_+1}, Error: {E}")

# Update weights
a -= alpha * np.sum(error * x)
b -= alpha * np.sum(error)

# Print nal weights

print(f"Final weights: a = {a}, b = {b}")

# Predicted values
y_pred = a * x + b

# Plot linear regression line

plt.scatter(x, y, label="Data")
plt.plot(x, y_pred, label="Linear Regression Line", color="red")
plt.legend()
plt.show()

This code performs gradient descent for two iterations to update the weights a and b. The sum of squared
errors is used as the error function. Finally, the linear regression line is plotted using the updated weights.
fi
fi
fi
Note that the learning rate alpha controls how quickly the weights are updated. A higher value of alpha can
lead to faster convergence but may also cause oscillations. A lower value of alpha can lead to more stable
convergence but may require more iterations.
———————————————————————————————————————————————-

We have recorded the weekly average conversion rate of Dollar for over 6 consecutive
weeks. Y shows the weekly average conversion rate and x shows the number of the week.
Try to t the best possible function 'f' to establish the relationship between the number of
the day and conversion rate. (Applying Gradient descent) where
( )= = + . x = [1,2,3,4,5,6] and y = [10, 14, 18, 22, 25, 33] . The initial values of a & b
are, a= 4.9 & b=4.401. The learning rate is mentioned as .05The error rate of a & b should be
less than .01. Perform 5 epochs

Here's the Python code to t the best possible function using gradient descent:

import numpy as np
import matplotlib.pyplot as plt

# Given data
x = np.array([1, 2, 3, 4, 5, 6])
y = np.array([10, 14, 18, 22, 25, 33])

# Initial values of a and b

a = 4.9
b = 4.401

# Learning rate
alpha = 0.05

# Error threshold
error_threshold = 0.01

# Number of epochs
n_epochs = 5

# Gradient descent
for epoch in range(n_epochs):
# Predicted values
y_pred = a + b * x

# Error
error = y_pred - y

# Mean squared error

mse = np.mean(error ** 2)

# Print error
print(f"Epoch {epoch+1}, MSE: {mse}")

# Update a and b
a -= alpha * np.sum(error)
b -= alpha * np.sum(error * x)

# Check if error threshold is reached

if mse < error_threshold:
print(f"Error threshold reached at epoch {epoch+1}")
break

# Print nal values of a and b

print(f"Final values: a = {a}, b = {b}")
𝑓
𝑥
fi
𝑦
fi
𝑎
𝑏
𝑥
fi
# Predicted values
y_pred = a + b * x

# Plot data and predicted line

plt.scatter(x, y, label="Data")
plt.plot(x, y_pred, label="Predicted Line", color="red")
plt.legend()
plt.show()

This code performs gradient descent for 5 epochs to update the values of a and b. The mean
squared error (MSE) is used as the error function. If the MSE is less than the error threshold (0.01),
the loop breaks. Finally, the predicted line is plotted using the nal values of a and b.
——————————————————————————————————————————-

Consider the function f(x)= x2 + 4x+ 5

1. Derive the gradient ( rst derivative) of f(x).
2. Suppose the current point is x = 2. Calculate the gradient at this point.
3. If we apply one step of gradient descent from x = 2 with a learning rate α = 0.1, calculate
the updated value of x.
4. Explain why gradient descent moves towards a minimum in this function.

Here are the answers:

Step 1: Derive the gradient ( rst derivative) of f(x)

The gradient of f(x) = x^2 + 4x + 5 is f'(x) = 2x + 4.

Step 2: Calculate the gradient at x = 2

f'(2) = 2(2) + 4 = 8.

Step 3: Calculate the updated value of x after one step of gradient descent
The update rule for gradient descent is x_new = x_old - α * f'(x_old), where α is the learning rate.

x_new = 2 - 0.1 * 8 = 2 - 0.8 = 1.2.

Step 4: Explain why gradient descent moves towards a minimum in this function
Gradient descent moves towards a minimum in this function because the gradient points in the
direction of the steepest ascent, and by moving in the opposite direction (i.e., subtracting the
gradient from the current point), we are e ectively moving downhill towards a local minimum.

In this speci c case, the function f(x) = x^2 + 4x + 5 is a convex quadratic function, which means
it has a single global minimum. The gradient descent algorithm is guaranteed to converge to this
minimum, regardless of the initial starting point.

The nal answer is: $\boxed{1.2}$

Here's a comprehensive explanation of the underlying concepts and fundamentals:

Calculus and Optimization

Calculus is a branch of mathematics that deals with the study of continuous change. It consists of two main
branches: Di erential Calculus and Integral Calculus.

Di erential Calculus is concerned with the study of rates of change and slopes of curves. It helps us
understand how functions change as their input changes.

Integral Calculus, on the other hand, is concerned with the study of accumulation of quantities. It helps us
nd the area under curves, volumes of solids, and other quantities.
fi
ff
fi
ff
fi
fi
fi
ff
fi
Optimization is a technique used to nd the maximum or minimum of a function. It's a crucial concept in
machine learning, physics, engineering, and economics.

Gradient Descent

Gradient Descent is an optimization algorithm used to nd the minimum of a function. It's a rst-order
optimization algorithm, which means it uses only the rst derivative of the function to update the
parameters.

Here's how Gradient Descent works:

1. Initialize the parameters: We start with an initial guess for the parameters.
2. Compute the gradient: We compute the gradient of the function with respect to the parameters.
3. Update the parameters: We update the parameters using the gradient and a learning rate.
4. Repeat: We repeat steps 2-3 until convergence.

Learning Rate

The learning rate is a hyperparameter that controls how quickly the parameters are updated. A high learning
rate can lead to rapid convergence, but it can also cause the algorithm to overshoot the minimum. A low
learning rate can lead to slow convergence, but it's more stable.

Convex Functions

A convex function is a function that has a single minimum. It's a crucial concept in optimization because it
guarantees that the algorithm will converge to the global minimum.

Quadratic Functions

A quadratic function is a polynomial function of degree two. It's a convex function, and it has a single
minimum. Quadratic functions are commonly used in optimization problems because they're easy to
analyze and optimize.

Gradient

The gradient is a vector that points in the direction of the steepest ascent. It's a crucial concept in
optimization because it helps us update the parameters in the right direction.

Partial Derivatives

Partial derivatives are used to compute the gradient of a function. They measure the rate of change of the
function with respect to one of its variables while keeping the other variables constant.

Chain Rule

The chain rule is a fundamental rule in calculus that helps us compute the derivative of a composite
function. It's used extensively in machine learning and optimization.

Mathematical Notation

Here's a brief explanation of the mathematical notation used in this explanation:

- f(x) denotes a function of x

- ∇f(x) denotes the gradient of f(x)
- ∂f/∂x denotes the partial derivative of f with respect to x
- α denotes the learning rate
- x denotes the parameters of the function

I hope this explanation helps you understand the underlying concepts and fundamentals of calculus,
optimization, and gradient descent!
fi
fi
fi
fi
To check if a function is convex or concave, you can use the following methods:

# First Derivative Test

1. Find the rst derivative of the function, f'(x).
2. Check if the rst derivative is monotonically increasing (i.e., f'(x) ≥ 0 for all x) or monotonically decreasing
(i.e., f'(x) ≤ 0 for all x).
3. If the rst derivative is monotonically increasing, the function is convex.
4. If the rst derivative is monotonically decreasing, the function is concave.

# Second Derivative Test

1. Find the second derivative of the function, f''(x).
2. Check if the second derivative is positive (i.e., f''(x) > 0 for all x) or negative (i.e., f''(x) < 0 for all x).
3. If the second derivative is positive, the function is convex.
4. If the second derivative is negative, the function is concave.

# Graphical Test
1. Plot the graph of the function.
2. Check if the graph is curved upward (i.e., it has a minimum point) or curved downward (i.e., it has a
maximum point).
3. If the graph is curved upward, the function is convex.
4. If the graph is curved downward, the function is concave.

# Hessian Matrix Test (for multivariable functions)

1. Compute the Hessian matrix of the function, H(x).
2. Check if the Hessian matrix is positive de nite (i.e., all eigenvalues are positive) or negative de nite (i.e.,
all eigenvalues are negative).
3. If the Hessian matrix is positive de nite, the function is convex.
4. If the Hessian matrix is negative de nite, the function is concave.

Note that these tests are not mutually exclusive, and you can use a combination of them to verify the
convexity or concavity of a function.

——————————————————————————————————————————-

A particle travels along the path given by the function f(x) = 2sin(x) + 3cos(x), where x
represents time in seconds.
1. Plot the path function of the particle.
2. Find the rate of change of the path of the particle:
3. Plot the rate of change of the path of the particle.
4. Find the rate of change of path of the particle at x = 60° (or π/3 radians):
5. Plot the rate of change of the path of the particle at x = 60°.

Here are the answers:

Step 1: Plot the path function of the particle

fi
fi
fi
fi
fi
fi
fi
fi
To plot the path function of the particle, we can use a graphing calculator or a programming
language like Python. Here is the Python code to plot the function:

import numpy as np
import matplotlib.pyplot as plt

x = np.linspace(0, 4*np.pi, 1000)

y = 2*np.sin(x) + 3*np.cos(x)

plt.plot(x, y)
plt.xlabel('Time (s)')
plt.ylabel('Position')
plt.title('Path of the Particle')
plt.grid(True)
plt.show()

This will generate a plot of the path function of the particle.

Step 2: Find the rate of change of the path of the particle

To nd the rate of change of the path of the particle, we need to nd the derivative of the function
f(x) = 2sin(x) + 3cos(x). Using the chain rule and the derivatives of sine and cosine, we get:
f'(x) = 2cos(x) - 3sin(x)

Step 3: Plot the rate of change of the path of the particle

To plot the rate of change of the path of the particle, we can use the same Python code as before,
but with the derivative function instead of the original function:

import numpy as np
import matplotlib.pyplot as plt

x = np.linspace(0, 4*np.pi, 1000)

y = 2*np.cos(x) - 3*np.sin(x)

plt.plot(x, y)
plt.xlabel('Time (s)')
plt.ylabel('Rate of Change')
plt.title('Rate of Change of the Path of the Particle')
plt.grid(True)
plt.show()

This will generate a plot of the rate of change of the path of the particle.

Step 4: Find the rate of change of path of the particle at x = 60° (or π/3 radians)
To nd the rate of change of path of the particle at x = 60° (or π/3 radians), we can plug this value
into the derivative function:
f'(π/3) = 2cos(π/3) - 3sin(π/3)
= 2(1/2) - 3(√3/2)
= 1 - (3√3)/2
≈ -1.598

Step 5: Plot the rate of change of the path of the particle at x = 60°
To plot the rate of change of the path of the particle at x = 60°, we can use the same Python code
as before, but with the x-value set to π/3 radians:

import numpy as np
import matplotlib.pyplot as plt

x = np.pi/3
y = 2*np.cos(x) - 3*np.sin(x)

plt.plot(x, y, 'ro')
plt.xlabel('Time (s)')
plt.ylabel('Rate of Change')
fi
fi
fi
plt.title('Rate of Change of the Path of the Particle at x = 60°')
plt.grid(True)
plt.show()

This will generate a plot with a red dot at the point (π/3, f'(π/3)).

The nal answer is: $\boxed{-1.598}$

Concept of the above problem :

I'd be happy to explain the underlying concepts and fundamentals of the problems we've been working on.

The problems we've been working on involve mathematical modeling, speci cally using functions to model
real-world phenomena. We've been using calculus, speci cally derivatives, to analyze these models.

Here are some key concepts and fundamentals to understand:

1. Functions: A function is a mathematical relationship between two variables. In our problems, the function
f(x) represents the position of the particle at time x.
2. Derivatives: A derivative represents the rate of change of a function with respect to its input. In our
problems, the derivative f'(x) represents the velocity of the particle at time x.
3. Calculus: Calculus is a branch of mathematics that deals with the study of continuous change. It involves
the use of limits, derivatives, and integrals to analyze functions and models.
4. Mathematical Modeling: Mathematical modeling involves using mathematical concepts and techniques
to describe and analyze real-world phenomena. In our problems, we're using functions and derivatives to
model the motion of a particle.
5. Position, Velocity, and Acceleration: Position, velocity, and acceleration are all related to the motion of an
object. Position refers to the location of the object, velocity refers to the rate of change of position, and
acceleration refers to the rate of change of velocity.
6. Trigonometric Functions: Trigonometric functions, such as sine and cosine, are used to model periodic
phenomena, such as the motion of a particle in a circular path.
7. Graphical Analysis: Graphical analysis involves using graphs to visualize and analyze functions and
models. In our problems, we're using graphs to visualize the position and velocity of the particle.

Some key concepts and techniques to understand when working with these problems include:

- Finding derivatives: This involves using the rules of di erentiation, such as the power rule and the chain
rule, to nd the derivative of a function.
- Interpreting derivatives: This involves understanding what the derivative represents in the context of the
problem, such as the velocity of a particle.
- Graphing functions: This involves using graphical techniques, such as plotting points and using graphing
calculators, to visualize functions and models.
- Analyzing graphs: This involves using graphical analysis to understand the behavior of functions and
models, such as identifying maxima and minima.

I hope this helps! Let me know if you have any speci c questions or if there's anything else I can help with.

——————————————————————————————————————————-
Derivatives of trigonometric functions are used to nd the rate of change of these functions with respect to their input.
Here are the derivatives of the six basic trigonometric functions:

# Derivatives of Trigonometric Functions

1. Sine Function: The derivative of sin(x) is cos(x).
- f(x) = sin(x) → f'(x) = cos(x)
2. Cosine Function: The derivative of cos(x) is -sin(x).
- f(x) = cos(x) → f'(x) = -sin(x)
3. Tangent Function: The derivative of tan(x) is sec^2(x).
- f(x) = tan(x) → f'(x) = sec^2(x)
4. Cotangent Function: The derivative of cot(x) is -csc^2(x).
- f(x) = cot(x) → f'(x) = -csc^2(x)
5. Secant Function: The derivative of sec(x) is sec(x)tan(x).
fi
fi
fi
fi
ff
fi
fi
- f(x) = sec(x) → f'(x) = sec(x)tan(x)
6. Cosecant Function: The derivative of csc(x) is -csc(x)cot(x).
- f(x) = csc(x) → f'(x) = -csc(x)cot(x)

# Proofs and Applications

These derivatives can be proved using various methods, including the de nition of a derivative, the limit de nition of a
derivative, and the use of trigonometric identities.

Derivatives of trigonometric functions have numerous applications in various elds, including:

- Physics and Engineering: To model the motion of objects in terms of position, velocity, and acceleration.
- Navigation and Surveying: To calculate distances, angles, and directions between locations.
- Computer Graphics: To create smooth and realistic animations and simulations.
- Medical Imaging: To reconstruct images of the body using techniques such as MRI and CT scans.

# Key Takeaways
- Derivatives of trigonometric functions are used to nd the rate of change of these functions with respect to their input.
- The derivatives of the six basic trigonometric functions are well-established and widely used.
- These derivatives have numerous applications in various elds, including physics, engineering, navigation, surveying,
computer graphics, and medical imaging.

Practice Problems:

Calculus (5)

1. Find the derivative of f(x) = 3x^2 + 2x - 5.

2. A particle moves along a straight line with velocity v(t) = 2t + 1. Find its position at time t =
3.
3. Find the rate of change of the function f(x) = x^3 - 2x^2 + x + 1 at x = 2.
4. A company's pro t function is given by P(x) = 200x - 2x^2 - 500. Find the rate of change of
pro t with respect to the number of units sold.
5. Find the maximum value of the function f(x) = -x^2 + 4x + 3.

Linear Algebra (3)

1. Find the inverse of the matrix A = [[2, 1], [4, 3]].

2. Solve the system of linear equations 2x + 3y = 7, x - 2y = -3.
3. Find the determinant of the matrix A = [[1, 2, 3], [4, 5, 6], [7, 8, 9]].

Statistics and Data Analysis (4)

1. Fit a linear regression model to the data points (1, 2), (2, 3), (3, 5), (4, 7).
2. Find the correlation coe cient between the variables x and y for the data points (1, 2), (2,
4), (3, 6), (4, 8).
3. A survey of 10 students found that their heights (in inches) were 60, 62, 65, 68, 70, 72, 75,
78, 80, 82. Find the mean and standard deviation of the heights.
4. A company's sales data for the past 5 years is 100, 120, 150, 180, 200. Find the trend line
using linear regression.

Mathematical Modeling (3)

1. A population of bacteria grows according to the model P(t) = 200e^(0.5t), where t is time
in hours. Find the rate of change of the population at t = 2 hours.
2. A ball is thrown upward from the ground with an initial velocity of 50 ft/s. Find the height
of the ball at time t = 2 seconds using the model h(t) = -16t^2 + 50t.
fi
fi
ffi
fi
fi
fi
fi
fi
3. A company's cost function is given by C(x) = 200x + 500, where x is the number of units
produced. Find the rate of change of the cost with respect to the number of units produced.

# Problem 1
Consider the function f(x) = 3x^2 + 2x + 1.
1. Derive the gradient ( rst derivative) of f(x).
2. Suppose the current point is x = 1. Calculate the gradient at this point.
3. If we apply one step of gradient descent from x = 1 with a learning rate α = 0.05, calculate
the updated value of x.

# Problem 2
Consider the function f(x) = x^3 - 2x^2 + x + 1.
1. Derive the gradient ( rst derivative) of f(x).
2. Suppose the current point is x = 2. Calculate the gradient at this point.
3. If we apply one step of gradient descent from x = 2 with a learning rate α = 0.1, calculate
the updated value of x.

# Problem 3
Consider the function f(x) = 2x^2 + 3x + 1.
1. Derive the gradient ( rst derivative) of f(x).
2. Suppose the current point is x = 3. Calculate the gradient at this point.
3. If we apply one step of gradient descent from x = 3 with a learning rate α = 0.05, calculate
the updated value of x.

# Problem 4
Consider the function f(x) = x^4 - 3x^3 + 2x^2 + x + 1.
1. Derive the gradient ( rst derivative) of f(x).
2. Suppose the current point is x = 1. Calculate the gradient at this point.
3. If we apply one step of gradient descent from x = 1 with a learning rate α = 0.1, calculate
the updated value of x.

# Problem 5
Consider the function f(x) = 3x^2 + 2x + 1.
1. Derive the gradient ( rst derivative) of f(x).
2. Suppose the current point is x = 2. Calculate the gradient at this point.
3. If we apply one step of gradient descent from x = 2 with a learning rate α = 0.05, calculate
the updated value of x.

# Problem 1
Consider the function f(x) = x^3 + 2x^2 - 3x + 1.
1. Derive the gradient ( rst derivative) of f(x).
2. Suppose the current point is x = 2. Calculate the gradient at this point.
3. If we apply one step of gradient descent from x = 2 with a learning rate α = 0.1, calculate
the updated value of x.
4. Repeat steps 2-3 for 5 iterations and plot the convergence of x.

# Problem 2
Consider the function f(x) = 2x^2 + 3x + 1 + sin(x).
1. Derive the gradient ( rst derivative) of f(x).
2. Suppose the current point is x = 1. Calculate the gradient at this point.
3. If we apply one step of gradient descent from x = 1 with a learning rate α = 0.05, calculate
the updated value of x.
4. Repeat steps 2-3 for 10 iterations and plot the convergence of x.

# Problem 3
fi
fi
fi
fi
fi
fi
fi
Consider the function f(x, y) = x^2 + 2y^2 + 3xy + 1.
1. Derive the gradient (partial derivatives) of f(x, y) with respect to x and y.
2. Suppose the current point is (x, y) = (1, 2). Calculate the gradient at this point.
3. If we apply one step of gradient descent from (x, y) = (1, 2) with a learning rate α = 0.1,
calculate the updated values of x and y.
4. Repeat steps 2-3 for 5 iterations and plot the convergence of x and y.

# Problem 4
Consider the function f(x) = x^4 - 2x^3 + 3x^2 + x + 1.
1. Derive the gradient ( rst derivative) of f(x).
2. Suppose the current point is x = 2. Calculate the gradient at this point.
3. If we apply one step of gradient descent from x = 2 with a learning rate α = 0.05, calculate
the updated value of x.
4. Repeat steps 2-3 for 10 iterations and plot the convergence of x.

# Problem 5
Consider the function f(x, y) = 2x^2 + y^2 + 3xy + 1.
1. Derive the gradient (partial derivatives) of f(x, y) with respect to x and y.
2. Suppose the current point is (x, y) = (2, 1). Calculate the gradient at this point.
3. If we apply one step of gradient descent from (x, y) = (2, 1) with a learning rate α = 0.1,
calculate the updated values of x and y.
4. Repeat steps 2-3 for 5 iterations and plot the convergence of x and y.

# Problem 1
A particle travels along the path given by the function f(x) = 3sin(x) + 2cos(x) + x^2, where x
represents time in seconds.
1. Plot the path function of the particle.
2. Find the rate of change of the path of the particle.
3. Plot the rate of change of the path of the particle.
4. Find the rate of change of path of the particle at x = 90° (or π/2 radians).
5. Plot the rate of change of the path of the particle at x = 90°.

# Problem 2
A company's pro t function is given by P(x) = 200x - 2x^2 - 500 + 3x^3, where x represents
the number of units sold.
1. Plot the pro t function.
2. Find the rate of change of the pro t function with respect to the number of units sold.
3. Plot the rate of change of the pro t function.
4. Find the rate of change of pro t at x = 100 units.
5. Plot the rate of change of pro t at x = 100 units.

# Problem 3
A population of bacteria grows according to the model P(t) = 200e^(0.5t) + 3t^2, where t
represents time in hours.
1. Plot the population model.
2. Find the rate of change of the population model with respect to time.
3. Plot the rate of change of the population model.
4. Find the rate of change of population at t = 2 hours.
5. Plot the rate of change of population at t = 2 hours.

# Problem 4
A spring-mass system is modeled by the equation x(t) = 2cos(t) + 3sin(t) + t^2, where x
represents the displacement of the spring and t represents time.
1. Plot the displacement function.
2. Find the rate of change of the displacement function with respect to time.
fi
fi
fi
fi
fi
fi
fi
3. Plot the rate of change of the displacement function.
4. Find the rate of change of displacement at t = π/4 radians.
5. Plot the rate of change of displacement at t = π/4 radians.

# Problem 5
A company's cost function is given by C(x) = 200x + 2x^2 + 500 + 4x^3, where x represents
the number of units produced.
1. Plot the cost function.
2. Find the rate of change of the cost function with respect to the number of units produced.
3. Plot the rate of change of the cost function.
4. Find the rate of change of cost at x = 50 units.
5. Plot the rate of change of cost at x = 50 units.

Unit VI Optimization Techniques Question Bank Solved Answer
No ratings yet
Unit VI Optimization Techniques Question Bank Solved Answer
20 pages
1 Linear Regression
No ratings yet
1 Linear Regression
57 pages
LInear
No ratings yet
LInear
14 pages
Ch2 - Lec3 - Linear Regression and Gradient Descent
No ratings yet
Ch2 - Lec3 - Linear Regression and Gradient Descent
60 pages
ML Notes
No ratings yet
ML Notes
14 pages
Gradient Descent Algorithm
No ratings yet
Gradient Descent Algorithm
6 pages
Slides-4 Optimization Extra Gradient Descent
No ratings yet
Slides-4 Optimization Extra Gradient Descent
67 pages
Math Lecture 4
No ratings yet
Math Lecture 4
27 pages
Gradient Descent
No ratings yet
Gradient Descent
16 pages
PRCV Practical File
No ratings yet
PRCV Practical File
24 pages
05 Gradient Descent
No ratings yet
05 Gradient Descent
23 pages
Gradient Descent From Scratch Complete Intuition
No ratings yet
Gradient Descent From Scratch Complete Intuition
8 pages
Linear Regression for Beginners
No ratings yet
Linear Regression for Beginners
11 pages
ML TW-PW 02-2
No ratings yet
ML TW-PW 02-2
9 pages
Lecture 2-Linear-Regression-Part1
No ratings yet
Lecture 2-Linear-Regression-Part1
80 pages
Linear Regression For Machine Learning Course
No ratings yet
Linear Regression For Machine Learning Course
41 pages
Linear Regression Using Gradient Descent
No ratings yet
Linear Regression Using Gradient Descent
2 pages
CH - En.u4cse19101 Cheduri Linearregression
No ratings yet
CH - En.u4cse19101 Cheduri Linearregression
8 pages
Regression Analysis
No ratings yet
Regression Analysis
54 pages
5.1loss Function, Optimization, GD
No ratings yet
5.1loss Function, Optimization, GD
39 pages
Regression PPT
No ratings yet
Regression PPT
21 pages
Gradient Descent for Beginners
No ratings yet
Gradient Descent for Beginners
8 pages
Machine Learning Notes by Standard Andrew NG
No ratings yet
Machine Learning Notes by Standard Andrew NG
142 pages
CS229 Lecture Notes: Supervised Learning
No ratings yet
CS229 Lecture Notes: Supervised Learning
293 pages
Supervised Learning and Linear Regression
No ratings yet
Supervised Learning and Linear Regression
141 pages
Stanford ML CS229-Merged Notes
No ratings yet
Stanford ML CS229-Merged Notes
126 pages
Machine Learning Basics for Students
No ratings yet
Machine Learning Basics for Students
7 pages
Optimization and Gradient Descent Algorithm
No ratings yet
Optimization and Gradient Descent Algorithm
37 pages
Linear Regression and Gradient Descent
No ratings yet
Linear Regression and Gradient Descent
29 pages
Linear Regression
No ratings yet
Linear Regression
63 pages
11 Gradient Descent
No ratings yet
11 Gradient Descent
58 pages
MACHINE LEARNING ALGORITHM Unit-II
No ratings yet
MACHINE LEARNING ALGORITHM Unit-II
115 pages
Linear Regression with Gradient Descent
100% (1)
Linear Regression with Gradient Descent
8 pages
Linear Regression by IntuitiveAI v2.5
No ratings yet
Linear Regression by IntuitiveAI v2.5
5 pages
L3 Linear Regression and Gradient Descent
No ratings yet
L3 Linear Regression and Gradient Descent
46 pages
Gradient Descent - Xiaowei Huang
No ratings yet
Gradient Descent - Xiaowei Huang
53 pages
Linear Regression in Machine Learning
No ratings yet
Linear Regression in Machine Learning
1 page
Regression
No ratings yet
Regression
16 pages
3.linear Regression
No ratings yet
3.linear Regression
18 pages
Regression
No ratings yet
Regression
30 pages
CSE445 Linear-Regression
No ratings yet
CSE445 Linear-Regression
40 pages
CS229 Lecture Notes: Supervised Learning
No ratings yet
CS229 Lecture Notes: Supervised Learning
30 pages
CS229
No ratings yet
CS229
69 pages
Notes 1
No ratings yet
Notes 1
30 pages
cs229 2
No ratings yet
cs229 2
275 pages
CS229 Lecture Notes: Supervised Learning
No ratings yet
CS229 Lecture Notes: Supervised Learning
30 pages
06 23ECE216 GradientDescent v2
No ratings yet
06 23ECE216 GradientDescent v2
73 pages
Linear - Regression - SGD
No ratings yet
Linear - Regression - SGD
71 pages
LinearRegression Annotated
No ratings yet
LinearRegression Annotated
116 pages
Linear Regression Notes
No ratings yet
Linear Regression Notes
15 pages
Week 6
No ratings yet
Week 6
72 pages
Linear Regression
No ratings yet
Linear Regression
18 pages
(PR 2024) Lec2 Regression II
No ratings yet
(PR 2024) Lec2 Regression II
41 pages
Screenshot 2024-10-19 at 10.37.25 AM
No ratings yet
Screenshot 2024-10-19 at 10.37.25 AM
25 pages
Deep Learning Assignment 1 Solutions
No ratings yet
Deep Learning Assignment 1 Solutions
10 pages
Regression Script
No ratings yet
Regression Script
3 pages
Unit 5-DVER
No ratings yet
Unit 5-DVER
5 pages
Understanding Simple Linear Regression
No ratings yet
Understanding Simple Linear Regression
30 pages
Regression Adiba
No ratings yet
Regression Adiba
8 pages
MCQ - Basic Econometrics
No ratings yet
MCQ - Basic Econometrics
9 pages
Linear Regression - Numpy and Sklearn
No ratings yet
Linear Regression - Numpy and Sklearn
7 pages
Linear Regression Analysis Guide
No ratings yet
Linear Regression Analysis Guide
29 pages
Autocorrelation Notes PDF
No ratings yet
Autocorrelation Notes PDF
6 pages
Assignment 1 New Version
No ratings yet
Assignment 1 New Version
4 pages
Stem Lesson Plan
No ratings yet
Stem Lesson Plan
11 pages
A Second Course in Statistics Regression Analysis
No ratings yet
A Second Course in Statistics Regression Analysis
8 pages
E 74 - 06 - For Force Measuring Instruments
No ratings yet
E 74 - 06 - For Force Measuring Instruments
12 pages
Linear Regression Explained
No ratings yet
Linear Regression Explained
8 pages
15multiple Linear Regression
No ratings yet
15multiple Linear Regression
168 pages
Unit II
No ratings yet
Unit II
12 pages
Simple Linear Regression Concepts
No ratings yet
Simple Linear Regression Concepts
67 pages
Custom Worksheet for Decline Analysis
No ratings yet
Custom Worksheet for Decline Analysis
107 pages
Unit-III (Data Analytics)
50% (2)
Unit-III (Data Analytics)
15 pages
IE360 Final Exam Sample
No ratings yet
IE360 Final Exam Sample
2 pages
Simple Linear Regression Model Ordinary Least Square (OLS) Method
No ratings yet
Simple Linear Regression Model Ordinary Least Square (OLS) Method
18 pages
Regression Analysis for Beginners
No ratings yet
Regression Analysis for Beginners
35 pages
Data Analysis Coca Cola
No ratings yet
Data Analysis Coca Cola
7 pages
Line of Best Fit & Scatter Plot Guide
No ratings yet
Line of Best Fit & Scatter Plot Guide
23 pages
Inbound 3991216296804003764
No ratings yet
Inbound 3991216296804003764
15 pages
Data Analysis Using Regression and Multilevel Hierarchical Models 1st Edition by Andrew Gelman, Jennifer Hill ISBN 0511266839 9780511266836 Latest PDF 2025
No ratings yet
Data Analysis Using Regression and Multilevel Hierarchical Models 1st Edition by Andrew Gelman, Jennifer Hill ISBN 0511266839 9780511266836 Latest PDF 2025
88 pages
Mashinali O'qitish Amaliyoti 2024
No ratings yet
Mashinali O'qitish Amaliyoti 2024
8 pages
21csc305p ML Unit 2
No ratings yet
21csc305p ML Unit 2
115 pages
Datcb565 Competency 3 Reflection
100% (1)
Datcb565 Competency 3 Reflection
13 pages
EViews 14 Users Guide I
No ratings yet
EViews 14 Users Guide I
1,107 pages
Hypothesis Testing in Linear Regression
No ratings yet
Hypothesis Testing in Linear Regression
2 pages

Exercise Problems

Uploaded by

Exercise Problems

Uploaded by

Consider the data given below and t a linear regression line y = ax + b using gradient

x = [0, 0.4, 0.6, 1]. y =[0, 1, 0.48, 0.95 ]

# Sum of squared errors

# Print nal weights

# Plot linear regression line

# Initial values of a and b

# Mean squared error

# Check if error threshold is reached

# Print nal values of a and b

# Plot data and predicted line

Consider the function f(x)= x2 + 4x+ 5

Here are the answers:

Step 1: Derive the gradient ( rst derivative) of f(x)

Step 2: Calculate the gradient at x = 2

x_new = 2 - 0.1 * 8 = 2 - 0.8 = 1.2.

The nal answer is: $\boxed{1.2}$

Here's a comprehensive explanation of the underlying concepts and fundamentals:

Calculus and Optimization

Here's how Gradient Descent works:

Here's a brief explanation of the mathematical notation used in this explanation:

- f(x) denotes a function of x

# First Derivative Test

# Second Derivative Test

# Hessian Matrix Test (for multivariable functions)

Here are the answers:

Step 1: Plot the path function of the particle

x = np.linspace(0, 4*np.pi, 1000)

This will generate a plot of the path function of the particle.

Step 2: Find the rate of change of the path of the particle

Step 3: Plot the rate of change of the path of the particle

x = np.linspace(0, 4*np.pi, 1000)

The nal answer is: $\boxed{-1.598}$

Concept of the above problem :

Here are some key concepts and fundamentals to understand:

# Derivatives of Trigonometric Functions

# Proofs and Applications

Derivatives of trigonometric functions have numerous applications in various elds, including:

1. Find the derivative of f(x) = 3x^2 + 2x - 5.

Linear Algebra (3)

1. Find the inverse of the matrix A = [[2, 1], [4, 3]].

Statistics and Data Analysis (4)

Mathematical Modeling (3)

You might also like