0% found this document useful (0 votes)
11 views

Internship

Uploaded by

gururaj guru
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as PPTX, PDF, TXT or read online on Scribd
0% found this document useful (0 votes)
11 views

Internship

Uploaded by

gururaj guru
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as PPTX, PDF, TXT or read online on Scribd
You are on page 1/ 23

MVM COLLEGE OF ARTS SCIENCE

AND MANAGEMENT

NAME : GURURAJ
REG NO : U18KN21S0007
TRAINER : BHAVANI
Predicting Used Car Prices
Based on their Features
CONTENT:
• INTRODUCTION
• PROBLEM STATEMENT
• OVERVIEW
• LIBRARIES
• ANALYSING
• CLEANING
• VISUALIZATION
• CONCLUSION
INTRODUCTION:
• Data cleaning and visualization are
fundamental steps in the data analysis
process, essential for ensuring the accuracy
and interpretability of any dataset.

• Data cleaning involves identifying and


rectifying errors or inconsistencies in the data
to make it suitable for analysis.
PROBLEM STATEMENT:

• Given a dataset containing information about


various cars such as their make, model, year,
price, mileage, fuel type, and other attributes,
the task is to develop a predictive model that
can accurately estimate the price of a car
based on its features.
Overview of the Dataset:
• Name: The name of the car model.

• location: The location where the car is being sold.

• Year: The year in which the car was manufactured.

• km_Driven:The number of kilometers the car has been driven.

• Fuel_Type:The type of fuel the car uses (e.g., petrol, diesel, CNG,LPG).

• Transmission: The type of transmission the car has (e.g., manual, automatic).

• Owner_Type: The number of previous owners the car has had.


• Mileage:The fuel efficiency of the car in kilometers per
liter.

• Engine:The size of the car's engine in cubic centimeters.

• Power:The maximum power output of the car's engine in


bhp.

• Seats: The number of seats in the car.

• Price:The price at which the car is being sold.


LIBRARIES IMPORTED:
• PANDAS
• NUMPY
• MATPLOTLIB
• SEABORN
PANDAS:
• It eases data analysis, data manipulation,
and cleaning of data. Pandas support
operations like Sorting, Re-indexing,
Iteration, Concatenation, Conversion of
data, Visualizations, Aggregations, etc.
NUMPY:
• The name “Numpy” stands for “Numerical
Python”.
• NumPy is used for working with arrays.
• How to import NumPy and to access
NumPy and its functions.
MATPLOTLIB:
• This library is responsible for plotting
numerical data. It is also an open-source
library and plots high-defined figures like pie
charts, histograms, scatterplots, graphs, etc.
SEABORN:
• Seaborn is an amazing visualization library
for statistical graphics plotting in Python.It
provides beautiful default styles and color
palettes to make statistical plots more
attractive. It is built on top matplotlib library
and is also closely integrated with the data
structures from pandas.
Dataset Structure:
• Pandas describe() is used to view some basic
statistical details like percentile, mean, std,
etc. of a data frame or a series of numeric
values.

• DataFrame.info() it provides essential details


such as the total number of non-null values,
data types of each column, and memory
usage.
Read file:
Identifying Missing Values:
Handling Miss Values:

• Deletion(Dropping rows/Columns)
Identifying and Removing
Duplicates
• Removing duplicates is an essential step in
data cleaning and preprocessing, ensuring that
the data is accurate and reliable for further
analysis or modeling.
Visualization:
• Data visualization provides a good,
organized pictorial representation of the
data which makes it easier to understand,
observe, analyze.
Types of Visualizations:

• Bar Charts
• Histograms
• Scatter Plot
• Pie Chart
Bar Chart:
Histogram:
Pie Chart:
Conclusion:
• In conclusion, the provided car dataset
reveals valuable insights into the
characteristics, relationships, and trends
associated with various car attributes.
Through comprehensive analysis and
exploration, we gained a deeper
understanding of car ownership patterns, fuel
preferences, pricing trends, and location-
based variations.

You might also like