房价预测的BP神经网络实现_python代码

共2个文件

py：1个

csv：1个

BP神经网络

回归预测

Python

4星 · 超过85%的资源需积分: 50 186 浏览量 2018-08-29 08:53:07 上传评论 73 收藏 8KB ZIP 举报

标题中的“房价预测的BP神经网络实现”表明我们将探讨如何运用反向传播（BP）神经网络来预测房价。BP神经网络是一种广泛应用于机器学习领域的多层前馈神经网络，通过反向传播算法调整权重，以最小化损失函数，从而进行预测。描述中提到的“波士顿房价预测”是经典的数据科学案例，数据集来源于1978年波士顿郊区的房屋信息，包含了14个特征，如犯罪率、房间数量等，目标变量是中位房价。我们将使用这些数据来训练我们的BP神经网络模型。 “训练数据 housing.csv”是指用于模型训练的数据文件。CSV（逗号分隔值）是一种常见的数据格式，通常包含列名和行数据，便于数据分析和处理。在这个案例中，housing.csv可能包含波士顿房价数据集的所有特征和目标变量。 “使用Python代码实现前向和后向传播”说明了我们将用Python编程语言编写神经网络的前向传播和反向传播算法。前向传播是将输入数据通过网络计算得到预测结果的过程；而反向传播则是根据预测结果与真实值的差距，逆向更新网络权重以减少预测误差。 “损失函数使用方差”意味着在训练过程中，我们采用方差作为衡量模型预测效果的指标。方差是预测值与真实值之间差异的平方和的平均值，它可以量化模型预测的精度。较小的方差表示模型的预测更接近实际值。综合以上信息，我们可以预期这段Python代码（pred_nn.py）将实现以下步骤： 1. 导入必要的库，如pandas读取CSV文件，numpy进行数值计算，以及可能的sklearn库来加载和预处理数据。 2. 加载housing.csv数据集，对数据进行预处理，可能包括缺失值处理、标准化或归一化等。 3. 定义BP神经网络的结构，包括输入层、隐藏层和输出层。可能使用随机初始化权重。 4. 实现前向传播函数，计算输入数据经过各层神经元的激活输出。 5. 定义损失函数，这里是方差，计算预测值与真实值之间的差异。 6. 实现反向传播算法，根据损失函数的梯度更新权重。 7. 设定训练迭代次数，每次迭代执行一次前向传播和反向传播。 8. 训练完成后，使用测试集评估模型性能，可能包括计算均方误差（MSE）、决定系数（R^2）等指标。 9. 可能还包括模型的预测功能，输入新的房屋特征，预测其价格。整个过程将涉及Python编程、数据预处理、神经网络理论以及模型评估等多个方面，是学习和实践深度学习预测任务的一个典型实例。

资源推荐

资源详情

资源评论

收起资源包目录

housing_bpnn.zip （2个子文件）

housing.csv 12KB

pred_nn.py 12KB

# Package imports import pandas as pd import numpy as np import matplotlib.pyplot as plt def clc_accuracy(y_true, y_predict): """ use sklearn to calcuate the R2 score""" from sklearn.metrics import r2_score score = r2_score(y_true, y_predict) return score def sigmoid(Z): """ Compute the sigmoid of x Arguments: x -- A scalar or numpy array of any size. Return: s -- sigmoid(x) """ A = 1.0/(1.0+np.exp(-Z)) return A def sigmoid_backward(dA, cache): """ Implement the backward propagation for a single SIGMOID unit. Arguments: dA -- post-activation gradient, of any shape cache -- 'Z' where we store for computing backward propagation efficiently Returns: dZ -- Gradient of the cost with respect to Z """ Z = cache s = 1.0 / (1.0 + np.exp(-Z)) dZ = dA * s * (1 - s) assert (dZ.shape == Z.shape) return dZ def relu(Z): """ Implement the RELU function. Arguments: Z -- Output of the linear layer, of any shape Returns: A -- Post-activation parameter, of the same shape as Z cache -- a python dictionary containing "A" ; stored for computing the backward pass efficiently """ A = np.maximum(0, Z) assert (A.shape == Z.shape) return A def relu_backward(dA, cache): """ Implement the backward propagation for a single RELU unit. Arguments: dA -- post-activation gradient, of any shape cache -- 'Z' where we store for computing backward propagation efficiently Returns: dZ -- Gradient of the cost with respect to Z """ Z = cache dZ = np.array(dA, copy=True) # just converting dz to a correct object. # When z <= 0, you should set dz to 0 as well. dZ[Z <= 0] = 0 assert (dZ.shape == Z.shape) return dZ def layer_sizes(X, Y): """ Arguments: X -- input dataset of shape (input size, number of examples) Y -- labels of shape (output size, number of examples) Returns: n_x -- the size of the input layer n_h -- the size of the hidden layer n_y -- the size of the output layer """ n_x = X.shape[0] # size of input layer, number of features ? n_h = 4 n_y = Y.shape[0] # size of output layer return (n_x, n_h, n_y) def initialize_parameters(n_x, n_h, n_y): """ Argument: n_x -- size of the input layer n_h -- size of the hidden layer n_y -- size of the output layer Returns: params -- python dictionary containing your parameters: W1 -- weight matrix of shape (n_h, n_x) b1 -- bias vector of shape (n_h, 1) W2 -- weight matrix of shape (n_y, n_h) b2 -- bias vector of shape (n_y, 1) """ np.random.seed(2) # we set up a seed so that your output matches ours although the initialization is random. W1 = np.random.randn(n_h, n_x) * 0.01 b1 = np.zeros((n_h, 1)) W2 = np.random.randn(n_y, n_h) * 0.01 b2 = np.zeros((n_y, 1)) assert (W1.shape == (n_h, n_x)) assert (b1.shape == (n_h, 1)) assert (W2.shape == (n_y, n_h)) assert (b2.shape == (n_y, 1)) parameters = {"W1": W1, "b1": b1, "W2": W2, "b2": b2} return parameters def forward_propagation(X, parameters): """ Argument: X -- input data of size (n_x, m) parameters -- python dictionary containing your parameters (output of initialization function) Returns: A2 -- The sigmoid output of the second activation cache -- a dictionary containing "Z1", "A1", "Z2" and "A2" """ # Retrieve each parameter from the dictionary "parameters" W1 = parameters["W1"] # b1 = parameters["b1"] W2 = parameters["W2"] b2 = parameters["b2"] # Implement Forward Propagation to calculate A2 (probabilities) Z1 = np.dot(W1, X) + b1 #A1 = np.tanh(Z1) A1 = sigmoid(Z1) Z2 = np.dot(W2, A1) + b2 #A2 = sigmoid(Z2) A2 = Z2 assert (A2.shape == (1, X.shape[1])) cache = {"Z1": Z1, "A1": A1, "Z2": Z2, "A2": A2} return A2, cache def compute_cost(A2, Y, parameters): """ Computes the cross-entropy cost given in equation (13) Arguments: A2 -- The sigmoid output of the second activation, of shape (1, number of examples) Y -- "true" labels vector of shape (1, number of examples) parameters -- python dictionary containing your parameters W1, b1, W2 and b2 Returns: cost -- cross-entropy cost given equation (13) """ m = Y.shape[1] # number of example # Compute the cross-entropy cost #logprobs = np.multiply(np.log(A2), Y) + np.multiply(np.log(1 - A2), 1 - Y) logprobs = - 0.5 * np.multiply(A2-Y, A2-Y) cost = - 1.0 / m * np.sum(logprobs) cost = np.squeeze(cost) # makes sure cost is the dimension we expect. assert (isinstance(cost, float)) return cost def backward_propagation(parameters, cache, X, Y): """ Implement the backward propagation using the instructions above. Arguments: parameters -- python dictionary containing our parameters(W1,b1,W2,b2) cache -- a dictionary containing "Z1", "A1", "Z2" and "A2". X -- input data of shape (2, number of examples) Y -- "true" labels vector of shape (1, number of examples) Returns: grads -- python dictionary containing your gradients with respect to different parameters """ m = X.shape[1] # First, retrieve W1 and W2 from the dictionary "parameters". W1 = parameters["W1"] W2 = parameters["W2"] # Retrieve also A1 and A2 from dictionary "cache". A1 = cache["A1"] A2 = cache["A2"] Z1 = cache["Z1"] # Backward propagation: calculate dW1, db1, dW2, db2. dZ2 = A2 - Y dW2 = 1.0/m * np.dot(dZ2, A1.T) db2 = 1.0/m * np.sum(dZ2, axis=1, keepdims=True) #add each row, keep as 2D array dA1 = np.dot(W2.T, dZ2) #dZ1 = np.dot(W2.T, dZ2) * (1 - np.power(A1, 2)) dZ1 = sigmoid_backward(dA1, Z1) dW1 = 1.0/m * np.dot(dZ1, X.T) db1 = 1.0/m * np.sum(dZ1, axis=1, keepdims=True) grads = {"dW1": dW1, "db1": db1, "dW2": dW2, "db2": db2} return grads def update_parameters(parameters, grads, learning_rate=1.2): """ Updates parameters using the gradient descent update rule given above Arguments: parameters -- python dictionary containing your parameters grads -- python dictionary containing your gradients Returns: parameters -- python dictionary containing your updated parameters """ # Retrieve each parameter from the dictionary "parameters" W1 = parameters["W1"] b1 = parameters["b1"] W2 = parameters["W2"] b2 = parameters["b2"] # Retrieve each gradient from the dictionary "grads" dW1 = grads["dW1"] db1 = grads["db1"] dW2 = grads["dW2"] db2 = grads["db2"] # Update rule for each parameter W1 = W1 - learning_rate * dW1 b1 = b1 - learning_rate * db1 W2 = W2 - learning_rate * dW2 b2 = b2 - learning_rate * db2 parameters = {"W1": W1, "b1": b1, "W2": W2, "b2": b2} return parameters def nn_model(X, Y, n_h, num_iterations=10000, print_cost=False): """ Arguments: X -- dataset of shape (2, number of examples) Y -- labels of shape (1, number of examples) n_h -- size of the hidden layer num_iterations -- Number of iterations in gradient descent loop print_cost -- if True, print the cost every 1000 iterations Returns: parameters -- parameters learnt by the model. They can then be used to predict. """ np.random.seed(3) n_x = layer_sizes(X, Y)[0] # (n_x, n_h, n_y) = layer_sizes(X, Y)

评论收藏

内容反馈

普通网友

2024-07-29

这篇文章写得太好了！作者的观点深刻，论据充分，读起来让人深思。
ink_forest

2022-12-14

下载了python文件打不开……
weixin_46182418

2022-08-29

X_train, X_test, Y_train, Y_test = load_dataset()这里报错能给解释一下吗
CNU-ZQQ

2021-05-27

人做的挺好的，可以用，总不能所有的代码直接能在自己电脑上运行吧，自己总要会调一点东西吧

鲸鱼52
上传者
2021-09-14

确实是的，代码是几年前用python2.7写的，里面有些模块的版本可能已经更新了。
byc1987

2020-11-16

把map(list,y_test)改成list(y_test)就行了，另一个变量同上！