0% found this document useful (0 votes)

10 views

Housing prices linear regression

The document outlines a Python script that uses the scikit-learn library to perform linear regression on a housing dataset. It includes data preprocessing steps such as label encoding for categorical variables and normalization of features. The model is trained and evaluated using metrics like mean absolute error, mean squared error, and R-squared score.

Uploaded by

rananavdeep65

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

10 views

Housing prices linear regression

Uploaded by

rananavdeep65

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 3

from sklearn.

linear_model import LinearRegression

from sklearn.metrics import mean_squared_error,mean_absolute_error,r2_score
from sklearn.model_selection import train_test_split

import pandas as pd
data=pd.read_csv("Housing.csv")
data

price area bedrooms bathrooms stories mainroad guestroom basement hotwaterheating airconditioning parking prefarea

0 13300000 7420 4 2 3 yes no no no yes 2

1 12250000 8960 4 4 4 yes no no no yes 3

2 12250000 9960 3 2 2 yes no yes no no 2

3 12215000 7500 4 2 2 yes no yes no yes 3

4 11410000 7420 4 1 2 yes yes yes no yes 2

... ... ... ... ... ... ... ... ... ... ... ...

540 1820000 3000 2 1 1 yes no yes no no 2

541 1767150 2400 3 1 1 no no no no no 0

542 1750000 3620 2 1 1 yes no no no no 0

543 1750000 2910 3 1 1 no no no no no 0

544 1750000 3850 3 1 2 yes no no no no 0

545 rows × 13 columns

data.head(5) #first 5 rows will be printed.

price area bedrooms bathrooms stories mainroad guestroom basement hotwaterheating airconditioning parking prefarea

0 13300000 7420 4 2 3 yes no no no yes 2 yes

1 12250000 8960 4 4 4 yes no no no yes 3 no

2 12250000 9960 3 2 2 yes no yes no no 2 yes

3 12215000 7500 4 2 2 yes no yes no yes 3 yes

4 11410000 7420 4 1 2 yes yes yes no yes 2 no

data.head(10)

price area bedrooms bathrooms stories mainroad guestroom basement hotwaterheating airconditioning parking prefarea

0 13300000 7420 4 2 3 yes no no no yes 2 yes

1 12250000 8960 4 4 4 yes no no no yes 3

2 12250000 9960 3 2 2 yes no yes no no 2 yes

3 12215000 7500 4 2 2 yes no yes no yes 3 yes

4 11410000 7420 4 1 2 yes yes yes no yes 2

5 10850000 7500 3 3 1 yes no yes no yes 2 yes

6 10150000 8580 4 3 4 yes no no no yes 2 yes

7 10150000 16200 5 3 2 yes no no no no 0

8 9870000 8100 4 1 2 yes yes yes no yes 2 yes

9 9800000 5750 3 2 4 yes yes no no yes 1 yes

data.shape #tells us the number of rows and columns present in the csv file.

(545, 13)

data.info() #this returns not null values,column,datatype,and information about the data.
<class 'pandas.core.frame.DataFrame'>
RangeIndex: 545 entries, 0 to 544
Data columns (total 13 columns):
# Column Non-Null Count Dtype
--- ------ -------------- -----
0 price 545 non-null int64
1 area 545 non-null int64
2 bedrooms 545 non-null int64
3 bathrooms 545 non-null int64
4 stories 545 non-null int64
5 mainroad 545 non-null object
6 guestroom 545 non-null object
7 basement 545 non-null object
8 hotwaterheating 545 non-null object
9 airconditioning 545 non-null object
10 parking 545 non-null int64
11 prefarea 545 non-null object
12 furnishingstatus 545 non-null object
dtypes: int64(6), object(7)
memory usage: 55.5+ KB

from sklearn.preprocessing import LabelEncoder, MinMaxScaler #this command will convert object datatype into integer
le=LabelEncoder() #it converts the categorical entries into numerical entries.
data["mainroad"]=le.fit_transform(data["mainroad"])
data
#change raw feature vectors into a representation that is more suitable for the downstream estimators-sklearn.preproc

price area bedrooms bathrooms stories mainroad guestroom basement hotwaterheating airconditioning parking prefarea

0 13300000 7420 4 2 3 1 no no no yes 2

1 12250000 8960 4 4 4 1 no no no yes 3

2 12250000 9960 3 2 2 1 no yes no no 2

3 12215000 7500 4 2 2 1 no yes no yes 3

4 11410000 7420 4 1 2 1 yes yes no yes 2

... ... ... ... ... ... ... ... ... ... ... ...

540 1820000 3000 2 1 1 1 no yes no no 2

541 1767150 2400 3 1 1 0 no no no no 0

542 1750000 3620 2 1 1 1 no no no no 0

543 1750000 2910 3 1 1 0 no no no no 0

544 1750000 3850 3 1 2 1 no no no no 0

545 rows × 13 columns

from sklearn.preprocessing import LabelEncoder,MinMaxScaler

data["guestroom"]=le.fit_transform(data["guestroom"])
data.head(5)

price area bedrooms bathrooms stories mainroad guestroom basement hotwaterheating airconditioning parking prefarea

0 13300000 7420 4 2 3 1 0 no no yes 2 yes

1 12250000 8960 4 4 4 1 0 no no yes 3 no

2 12250000 9960 3 2 2 1 0 yes no no 2 yes

3 12215000 7500 4 2 2 1 0 yes no yes 3 yes

4 11410000 7420 4 1 2 1 1 yes no yes 2 no

from sklearn.preprocessing import LabelEncoder,MinMaxScaler

data["basement"]=le.fit_transform(data["basement"])
data.head(5)

price area bedrooms bathrooms stories mainroad guestroom basement hotwaterheating airconditioning parking prefarea

0 13300000 7420 4 2 3 1 0 0 no yes 2 yes

1 12250000 8960 4 4 4 1 0 0 no yes 3 no

2 12250000 9960 3 2 2 1 0 1 no no 2 yes

3 12215000 7500 4 2 2 1 0 1 no yes 3 yes

4 11410000 7420 4 1 2 1 1 1 no yes 2 no

from sklearn.preprocessing import LabelEncoder,MinMaxScaler
data["hotwaterheating"]=le.fit_transform(data["hotwaterheating"])
data.head(5)

price area bedrooms bathrooms stories mainroad guestroom basement hotwaterheating airconditioning parking prefarea

0 13300000 7420 4 2 3 1 0 0 0 yes 2 yes

1 12250000 8960 4 4 4 1 0 0 0 yes 3 no

2 12250000 9960 3 2 2 1 0 1 0 no 2 yes

3 12215000 7500 4 2 2 1 0 1 0 yes 3 yes

4 11410000 7420 4 1 2 1 1 1 0 yes 2 no

from sklearn.preprocessing import LabelEncoder,MinMaxScaler

data["prefarea"]=le.fit_transform(data["prefarea"])
data.head(5)

price area bedrooms bathrooms stories mainroad guestroom basement hotwaterheating airconditioning parking prefarea

0 13300000 7420 4 2 3 1 0 0 0 1 2

1 12250000 8960 4 4 4 1 0 0 0 1 3

2 12250000 9960 3 2 2 1 0 1 0 0 2

3 12215000 7500 4 2 2 1 0 1 0 1 3

4 11410000 7420 4 1 2 1 1 1 0 1 2

from sklearn.preprocessing import LabelEncoder,MinMaxScaler

data["furnishingstatus"]=le.fit_transform(data["furnishingstatus"])
data.head(5)

price area bedrooms bathrooms stories mainroad guestroom basement hotwaterheating airconditioning parking prefarea

0 13300000 7420 4 2 3 1 0 0 0 1 2

1 12250000 8960 4 4 4 1 0 0 0 1 3

2 12250000 9960 3 2 2 1 0 1 0 0 2

3 12215000 7500 4 2 2 1 0 1 0 1 3

4 11410000 7420 4 1 2 1 1 1 0 1 2

x=data.drop(columns=["price"])
y=data["price"]
y=y.values.reshape(-1,1)

scaler=MinMaxScaler()
x=scaler.fit_transform(x)
y=scaler.fit_transform(y)

lr=LinearRegression()
x_train,x_test,y_train,y_test=train_test_split(x,y,test_size=0.2)
lr.fit(x_train,y_train)
y_predict=lr.predict(x_test)

mae=mean_absolute_error(y_test,y_predict)
mse=mean_squared_error(y_test,y_predict)
r2=r2_score(y_test,y_predict)
print(mae,mse,r2)

0.06995281320799962 0.007960782075320859 0.6594122430015953

Loading [MathJax]/jax/output/CommonHTML/fonts/TeX/fontdata.js

Instant Download AI For Games 3e Millington PDF All Chapters
100% (4)
Instant Download AI For Games 3e Millington PDF All Chapters
62 pages
Multiple - Linear - Regression - AirBNB - Student - File0.2 - New (1) .Ipynb - Colaboratory
No ratings yet
Multiple - Linear - Regression - AirBNB - Student - File0.2 - New (1) .Ipynb - Colaboratory
8 pages
vertopal.com_housing_linear
No ratings yet
vertopal.com_housing_linear
3 pages
a
No ratings yet
a
2 pages
1722414346054
No ratings yet
1722414346054
18 pages
DA_lab2
No ratings yet
DA_lab2
5 pages
Mlext
No ratings yet
Mlext
1 page
Regression Algorithm
No ratings yet
Regression Algorithm
9 pages
Chirag HOusing Price Pred
No ratings yet
Chirag HOusing Price Pred
12 pages
Code 1
No ratings yet
Code 1
3 pages
Report
No ratings yet
Report
40 pages
Prac - 8 (1) - Jupyter Notebook
No ratings yet
Prac - 8 (1) - Jupyter Notebook
6 pages
178 - Regulinear - Ipynb - Colab
No ratings yet
178 - Regulinear - Ipynb - Colab
3 pages
House Price Prediction Models
No ratings yet
House Price Prediction Models
16 pages
ML Regression
No ratings yet
ML Regression
9 pages
T2_summary_VHA
No ratings yet
T2_summary_VHA
14 pages
House Price Prediction
No ratings yet
House Price Prediction
14 pages
IoT Task4 21BEC0384
No ratings yet
IoT Task4 21BEC0384
9 pages
ml manual
No ratings yet
ml manual
9 pages
Ash Regression
No ratings yet
Ash Regression
11 pages
1684918425867
No ratings yet
1684918425867
14 pages
Evan Marie Carr - Python and SKlearn
No ratings yet
Evan Marie Carr - Python and SKlearn
32 pages
Faisal Nadeem (SAP# 30601)
No ratings yet
Faisal Nadeem (SAP# 30601)
7 pages
unit 3 5
No ratings yet
unit 3 5
4 pages
Housing Prices Notebook
No ratings yet
Housing Prices Notebook
14 pages
Assignment1
No ratings yet
Assignment1
3 pages
f3683849-7ca6-4854-8f96-af11b6e837ec
No ratings yet
f3683849-7ca6-4854-8f96-af11b6e837ec
20 pages
Deep Learning - House Price Prediction
No ratings yet
Deep Learning - House Price Prediction
17 pages
ML LinearRegression
No ratings yet
ML LinearRegression
10 pages
Week 12
No ratings yet
Week 12
2 pages
QB 1
No ratings yet
QB 1
11 pages
DT as Regressor-Follow
No ratings yet
DT as Regressor-Follow
4 pages
Ml Manual
No ratings yet
Ml Manual
30 pages
1 Data Mining 2 Lab - 2 3 Vinay Sirohi 4 2139472 5 Select Appropriate Dataset and Apply Data Reduction
No ratings yet
1 Data Mining 2 Lab - 2 3 Vinay Sirohi 4 2139472 5 Select Appropriate Dataset and Apply Data Reduction
7 pages
Kaggle Machine Learning
No ratings yet
Kaggle Machine Learning
6 pages
Machine Learning - Code - Jupiter
No ratings yet
Machine Learning - Code - Jupiter
14 pages
0.1 Guilherme Marthe - Boston House Pricing Challenge
100% (1)
0.1 Guilherme Marthe - Boston House Pricing Challenge
15 pages
Exercise - First Machine Learning Model
No ratings yet
Exercise - First Machine Learning Model
2 pages
Introduction To Machine Learning (ML) With Sklearn
No ratings yet
Introduction To Machine Learning (ML) With Sklearn
10 pages
DL_LR_1.ipynb - Colab
No ratings yet
DL_LR_1.ipynb - Colab
5 pages
Copy of Project 4 _ House Price Prediction.ipynb - Colab
No ratings yet
Copy of Project 4 _ House Price Prediction.ipynb - Colab
5 pages
DMV - 3 - Jupyter Notebook
No ratings yet
DMV - 3 - Jupyter Notebook
2 pages
Multiple - Linear - Regression - AirBNB - Solution-0.2 - New - Ipynb - Colaboratory
No ratings yet
Multiple - Linear - Regression - AirBNB - Solution-0.2 - New - Ipynb - Colaboratory
11 pages
California Housing Price Prediction .
No ratings yet
California Housing Price Prediction .
1 page
Document From Jahnavi
No ratings yet
Document From Jahnavi
20 pages
houses prices prediction model
No ratings yet
houses prices prediction model
11 pages
1_Lab Manual (ML)
No ratings yet
1_Lab Manual (ML)
42 pages
Data Analysis With Python - Jupyter Notebook
No ratings yet
Data Analysis With Python - Jupyter Notebook
10 pages
HOUSEPRICENB - Ipynb - Colab
No ratings yet
HOUSEPRICENB - Ipynb - Colab
2 pages
Setup: Chapter 2 - End-To-End Machine Learning Project
No ratings yet
Setup: Chapter 2 - End-To-End Machine Learning Project
31 pages
Real Estate Valuation Data Set: Section Order
No ratings yet
Real Estate Valuation Data Set: Section Order
17 pages
Linear Regression Analysis - Polynomial Regression
No ratings yet
Linear Regression Analysis - Polynomial Regression
25 pages
EDA
No ratings yet
EDA
14 pages
Capstone Project Report
No ratings yet
Capstone Project Report
187 pages
Machine Learning
No ratings yet
Machine Learning
1 page
Decision Tree Algorithm in Machine Learning
No ratings yet
Decision Tree Algorithm in Machine Learning
13 pages
One Hot Encoding
No ratings yet
One Hot Encoding
12 pages
Data Science Record_05
No ratings yet
Data Science Record_05
20 pages
Train
No ratings yet
Train
17 pages
DOC-20250405-WA0009.
No ratings yet
DOC-20250405-WA0009.
4 pages
365 Days of Gratitude: Feel It – Live It – Enjoy It
From Everand
365 Days of Gratitude: Feel It – Live It – Enjoy It
Emy Fortune
No ratings yet
Combining Artificial Intelligence With A Time-Tested Technical Analysis Indicator
No ratings yet
Combining Artificial Intelligence With A Time-Tested Technical Analysis Indicator
12 pages
Dr. JPSM PPT - Python
No ratings yet
Dr. JPSM PPT - Python
50 pages
Code With Firebase
No ratings yet
Code With Firebase
4 pages
Using Message Box
0% (1)
Using Message Box
2 pages
Introduction To Python Part 3
No ratings yet
Introduction To Python Part 3
2 pages
Software Testing and Quality Assurance: ETCS - 453
No ratings yet
Software Testing and Quality Assurance: ETCS - 453
53 pages
Working Elevator Java Code
No ratings yet
Working Elevator Java Code
11 pages
Dept Clearance FS 23 24
No ratings yet
Dept Clearance FS 23 24
1 page
CSC208 356 CSC208 395 162-CSC208
No ratings yet
CSC208 356 CSC208 395 162-CSC208
5 pages
Introduction To Computer Programming With Python Harris Wang pdf download
No ratings yet
Introduction To Computer Programming With Python Harris Wang pdf download
84 pages
Xi - C.SC - PT Iii - WS
No ratings yet
Xi - C.SC - PT Iii - WS
6 pages
Static Keyword
No ratings yet
Static Keyword
11 pages
Module1 - PCD Engineering Notes
No ratings yet
Module1 - PCD Engineering Notes
28 pages
Java For Beginners Get From Zero To Object Oriented Programming
100% (1)
Java For Beginners Get From Zero To Object Oriented Programming
162 pages
IMAT1908 CW Specification 2022
No ratings yet
IMAT1908 CW Specification 2022
6 pages
Common Intermediate Language
No ratings yet
Common Intermediate Language
7 pages
Exp 4 5 6 - Ddco Bcs302
No ratings yet
Exp 4 5 6 - Ddco Bcs302
16 pages
Microsoft Excel VBA programming for the absolute beginner 3rd ed Edition Duane Birnbaum - Read the ebook now with the complete version and no limits
50% (2)
Microsoft Excel VBA programming for the absolute beginner 3rd ed Edition Duane Birnbaum - Read the ebook now with the complete version and no limits
47 pages
Toxic Comment Detection Code Using LSTM: A Project On
No ratings yet
Toxic Comment Detection Code Using LSTM: A Project On
11 pages
Roshan Ass 6
No ratings yet
Roshan Ass 6
24 pages
Agr I
No ratings yet
Agr I
74 pages
C PDF
No ratings yet
C PDF
258 pages
Marwadi University Faculty of Diploma Studies Information and Communication Technology
No ratings yet
Marwadi University Faculty of Diploma Studies Information and Communication Technology
5 pages
Instructions: 1. This Examination Consists of FIVE Questions. 2. Answer Question ONE (COMPULSORY) and Any Other TWO Questions. Question ONE
No ratings yet
Instructions: 1. This Examination Consists of FIVE Questions. 2. Answer Question ONE (COMPULSORY) and Any Other TWO Questions. Question ONE
3 pages
Acp - Imp
No ratings yet
Acp - Imp
2 pages
Top 50+ Java Collections Interview Questions (2024)
No ratings yet
Top 50+ Java Collections Interview Questions (2024)
44 pages
Curriculum for Electronics and Coding With Arduino
No ratings yet
Curriculum for Electronics and Coding With Arduino
5 pages
QP Xii CS Set 2
No ratings yet
QP Xii CS Set 2
9 pages
A2SV G5 - Bitwise Operation - With Code-Merged
No ratings yet
A2SV G5 - Bitwise Operation - With Code-Merged
59 pages

Housing prices linear regression

Uploaded by

Housing prices linear regression

Uploaded by

from sklearn.

linear_model import LinearRegression

0 13300000 7420 4 2 3 yes no no no yes 2

1 12250000 8960 4 4 4 yes no no no yes 3

2 12250000 9960 3 2 2 yes no yes no no 2

3 12215000 7500 4 2 2 yes no yes no yes 3

4 11410000 7420 4 1 2 yes yes yes no yes 2

540 1820000 3000 2 1 1 yes no yes no no 2

541 1767150 2400 3 1 1 no no no no no 0

542 1750000 3620 2 1 1 yes no no no no 0

543 1750000 2910 3 1 1 no no no no no 0

544 1750000 3850 3 1 2 yes no no no no 0

545 rows × 13 columns

data.head(5) #first 5 rows will be printed.

0 13300000 7420 4 2 3 yes no no no yes 2 yes

1 12250000 8960 4 4 4 yes no no no yes 3 no

2 12250000 9960 3 2 2 yes no yes no no 2 yes

3 12215000 7500 4 2 2 yes no yes no yes 3 yes

4 11410000 7420 4 1 2 yes yes yes no yes 2 no

0 13300000 7420 4 2 3 yes no no no yes 2 yes

1 12250000 8960 4 4 4 yes no no no yes 3

2 12250000 9960 3 2 2 yes no yes no no 2 yes

3 12215000 7500 4 2 2 yes no yes no yes 3 yes

4 11410000 7420 4 1 2 yes yes yes no yes 2

5 10850000 7500 3 3 1 yes no yes no yes 2 yes

6 10150000 8580 4 3 4 yes no no no yes 2 yes

7 10150000 16200 5 3 2 yes no no no no 0

8 9870000 8100 4 1 2 yes yes yes no yes 2 yes

9 9800000 5750 3 2 4 yes yes no no yes 1 yes

0 13300000 7420 4 2 3 1 no no no yes 2

1 12250000 8960 4 4 4 1 no no no yes 3

2 12250000 9960 3 2 2 1 no yes no no 2

3 12215000 7500 4 2 2 1 no yes no yes 3

4 11410000 7420 4 1 2 1 yes yes no yes 2

540 1820000 3000 2 1 1 1 no yes no no 2

541 1767150 2400 3 1 1 0 no no no no 0

542 1750000 3620 2 1 1 1 no no no no 0

543 1750000 2910 3 1 1 0 no no no no 0

544 1750000 3850 3 1 2 1 no no no no 0

545 rows × 13 columns

from sklearn.preprocessing import LabelEncoder,MinMaxScaler

0 13300000 7420 4 2 3 1 0 no no yes 2 yes

1 12250000 8960 4 4 4 1 0 no no yes 3 no

2 12250000 9960 3 2 2 1 0 yes no no 2 yes

3 12215000 7500 4 2 2 1 0 yes no yes 3 yes

4 11410000 7420 4 1 2 1 1 yes no yes 2 no

from sklearn.preprocessing import LabelEncoder,MinMaxScaler

0 13300000 7420 4 2 3 1 0 0 no yes 2 yes

1 12250000 8960 4 4 4 1 0 0 no yes 3 no

2 12250000 9960 3 2 2 1 0 1 no no 2 yes

3 12215000 7500 4 2 2 1 0 1 no yes 3 yes

4 11410000 7420 4 1 2 1 1 1 no yes 2 no

0 13300000 7420 4 2 3 1 0 0 0 yes 2 yes

1 12250000 8960 4 4 4 1 0 0 0 yes 3 no

2 12250000 9960 3 2 2 1 0 1 0 no 2 yes

3 12215000 7500 4 2 2 1 0 1 0 yes 3 yes

4 11410000 7420 4 1 2 1 1 1 0 yes 2 no

from sklearn.preprocessing import LabelEncoder,MinMaxScaler

from sklearn.preprocessing import LabelEncoder,MinMaxScaler

0.06995281320799962 0.007960782075320859 0.6594122430015953

You might also like