Synopsis For Mini Project
Synopsis For Mini Project
CS (AI – ML)
October, 2022
Page | 1
Car Price Predicting Model
PROJECT SYNOPSIS
For
BACHELOR OF TECHNOLOGY
October, 2022
Page | 2
Index
a. Cover Page
b. Declaration
c. Acknowledgement
d. Abstract
e. Index
1. Introduction
1.1. Problem Statement
1.2. Scope of Project
2. Tools/Environment Used
3. Analysis Document
3.1. E-R diagrams/Class diagrams/any related diagrams
3.2. Data flow diagrams/other similar diagrams,
.Limitations of the Project
4. Result and Future Scope of the Project
Page | 3
Declaration
I hereby declare that this project work entitled “Car Price Predicting Model”
has been prepared by me during the year 2020 – 21 under the guidance of Mrs.
Rachana Singh Sisodia, Department of CS (AI – ML), GNIOT, Greater Noida in the
partial fulfillment of Btech degree prescribed by the college.
I also declare that this project is the outcome of my own effort, that it has not been
submitted to any other university for the award of any degree.
Date: 10/10/2022
Aditya Mishra
2101321530003
Anuj Mishra
2101321530007
Page | 4
Acknowldegement
I have taken efforts in this project. However, it would not have been possible without the kind
support and help of many individuals and organizations. I would like to extend my sincere thanks
to all of them.
I am highly indebted to Mrs. Rachana Singh Sisodia for their guidance and constant supervision
as well as for providing necessary information regarding the project & also for their support in
completing the project.
I would like to express my gratitude towards my parents & member of GNIOT for their kind co-
operation and encouragement which help me in completion of this project.
I would like to express my special gratitude and thanks to industry persons for giving me such
attention and time.
My thanks and appreciations also go to my colleague in developing the project and people who
have willingly helped me out with their abilities.
Page | 5
Intoduction
What is machine learning?
Machine learning (ML) is a type of artificial intelligence (AI) that allows software applications
to become more accurate at predicting outcomes without being explicitly programmed to do so.
Machine learning algorithms use historical data as input to predict new output values.
Recommendation engines are a common use case for machine learning. Other popular uses
include fraud detection, spam filtering, malware threat detection, business process
automation (BPA) and Predictive maintenance.
Unsupervised learning: This type of machine learning involves algorithms that train
on unlabeled data. The algorithm scans through data sets looking for any meaningful
Page | 6
connection. The data that algorithms train on as well as the predictions or
recommendations they output are predetermined.
Page | 7
quality checking at the warehouse to product recommendation and sales demographics
on the website, all employ machine learning at various scales.
Price forecasting may be a feature of consumer-facing travel apps, such as Trainline or Hopper,
used to increase customer loyalty and engagement. At the same time, other businesses may also
use information about future prices. Entrepreneurs may need to define an optimal time to buy a
commodity to adjust prices of products or services that require a commodity (lumber, coffee,
gold), or evaluate the investment appeal of fixed assets.
In general, price forecasting is done by the means of descriptive and predictive analytics.
Page | 8
ML problem
A Car company has provided user-inputted text descriptions of its products,
including details like product category name, brand name, item condition and
other required data or feature of a car. Using this data, we have to come up
with a model that predicts the price of a car listed on car company as
accurately as possible. This looks like a standard regression problem.
Page | 9
Tools/Environment Used
learning terminology, we call this yardstick performance metric or simply metric. There are
various metrics to measure the performance of a regression model, e.g. mean absolute error,
mean squared error, mean squared logarithmic error, maximum residual error, median absolute
For this problem, Kaggle uses Root Mean Squared Logarithmic Error(RMSLE). Lesser the
Page | 10
Analysis Document
1.
Page | 11
Data Overview
2.
Page | 12
Result
We will built a Price predicting module based on supervised machine learning, with
the help of various regression tools.
Though this is the most simple model we’ve to built, the final predictors still
seem to have high correlations. One can go ahead and remove some of these
Thus, for now, the final model consists of the 7 variables mentioned above.
Future Work
Using deep learning was productive and yielded a very good score on test data. More
complex models such as LSTMs and Convolutional Neural Nets can be tried.
We can experiment with more complex MLPs by adding additional layers and larger
number of units in hidden layers.
Other vectorization schemes such as Wordbatch can be experimented with ML
models.
Regression models like FTRL and FM_FTRL can also be tried.
Page | 13
Future Scope
The scope of Machine Learning is not limited to the investment sector. Rather, it is expanding
across all fields such as banking and finance, information technology, media & entertainment,
gaming, and the automotive industry.
What is the scope of data science in future? Especially in a developing country like India, there is
scope for huge data-related operations such as data scientists, data analytics, big data engineers,
big data managers, and data architects. Take up this Advanced Certification in Data Science and
AI to make a lucrative career as a data scientist in India.
Page | 14