0% found this document useful (0 votes)
1K views12 pages

Industrial Training Report

This document is a 4 week industrial training report submitted by Sunny at Ansh Infotech from July 1, 2021 to July 31, 2021 in partial fulfillment of a Bachelor of Engineering degree in Computer Science and Engineering. During the training, Sunny learned about data science topics including linear regression, classification algorithms, and natural language processing. Practical skills developed included working with Python libraries like NumPy, Pandas, Matplotlib, and Seaborn. The training culminated in a 911 call data analysis capstone project to identify high priority emergency calls.

Uploaded by

vikas godara
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as DOCX, PDF, TXT or read online on Scribd
0% found this document useful (0 votes)
1K views12 pages

Industrial Training Report

This document is a 4 week industrial training report submitted by Sunny at Ansh Infotech from July 1, 2021 to July 31, 2021 in partial fulfillment of a Bachelor of Engineering degree in Computer Science and Engineering. During the training, Sunny learned about data science topics including linear regression, classification algorithms, and natural language processing. Practical skills developed included working with Python libraries like NumPy, Pandas, Matplotlib, and Seaborn. The training culminated in a 911 call data analysis capstone project to identify high priority emergency calls.

Uploaded by

vikas godara
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as DOCX, PDF, TXT or read online on Scribd
You are on page 1/ 12

Industrial Training Report

A REPORT OF 4 WEEKS INDUSTRIAL TRAINING (14 PT.)

at

Ansh Infotech
SUBMITTED IN PARTIAL FULFILLMENT OF THE REQUIREMENT FOR THE AWARD

OF THE DEGREE OF

BACHELOR OF ENGINEERING

Computer Science and Engineering

1 JULY, 2021 - 31 JULY, 2021

SUBMITTED BY:

NAME: SUNNY
REGISTRATION NO. : 1940032

DEPARTMENT OF COMPUTER SCIENCE AND ENGINEERING

SANT LONGOWAL INSTITUTE OF ENGINEERING AND TECHNOLOGY


SANT LONGOWAL INSTITUTE OF ENGINEERING AND TECHNOLOGY

CANDIDATE'S DECLARATION

I SUNNY hereby declare that I have undertaken 4 weeks/months Industrial Training Ansh Infotech during a
period from 1 July , 2021 to 31 July , 2021in partial fulfillment of requirements for the award of degree of B.E.
(Computer Science and Engineering) at SANT LONGOWAL INSTITUTE OF ENGINEERING AND
TECHNOLOGY, LONGOWAL. The work which is being presented in the training report submitted to
Department of Computer Science and Engineering at SANT LONGOWAL INSTITUTE OF ENGINEERING
AND TECHNOLOGY, LONGOWAL is an authentic record of training work.

Signature of the Student

The industrial training Viva – Voce Examination ________________________ has been held on

_________________and accepted.

Signature of Examiner
ABSTRACT

Data science encompasses a set of principles, problem definitions, algorithms, and processes


for extracting nonobvious and useful patterns from large data sets. Firstly basic problems
handling with python to revise our earlier concepts. Reading data from csv file and operations
and visualizing it.

The algorithms which are used that is linear regression , classification algorithm , and natural
learning processing.

Many of the elements of data science have been developed in related fields such as machine
learning and data mining.
ACKNOWLEDGEMENT

I am thankful to Ansh Infotech and my course instructor Kriti Bhatia.

Our course instructor helps by giving some tasks ,online platform for data extraction such as
Kaggle , UCI . She also gives a Books related Data science ,Python , Algorithms , and Natural
learning processing.Every Tutorial is live .

Our course mates also helps each other in class as well as out class.

At the end of training, I have great experience .


Introduction to Organisation

AIT is a leading global IT services ,consulting and business solutions provider company that
helps global enterprises re-imagine and transform their businesses through Digital technology
transformation. The Company harness the power of web development, mobile application
development, robotics, cloud,digital marketing, analytics and emerging technologies to help
our clients adapt to the digital world and make them successful.
We Render All Kind Of Development and Designing Services such as Website Designing, Web
Development, Internet Marketing, Phone Application Development, ERP Packages, Software Testing
and much more.

We have been providing training of Developing all kind of programs, apps, digital marketing to
thousands of students already.

We provide reliable project consultancy services in the Industry of Information Technology, providing
information to millions of Industrailists and Entrepreneurs to jump-start their careers, start business,
learn technologies, help reduce risk and give high return.
SOFTWARE TRAINING WORK UNDERTAKEN

Statistics, and the use of statistical models, are deeply rooted within the field of Data Science. Data
Science started with statistics, and has evolved to include concepts/practices such as Artificial
Intelligence, Machine Learning, and the Internet of Things, to name a few. As more and more data
has become available, first by way of recorded shopping behaviors and trends, businesses have
been collecting and storing it in ever greater amounts. With growth of the Internet, the Internet of
Things, and the exponential growth of data volumes available to enterprises, there has been a flood
of new information or Big Data. Once the doors were opened by businesses seeking to increase
profits and drive better decision making, the use of Big Data started being applied to other fields,
such as medicine, engineering, and social sciences.

Data science is also more than “machine learning,” which is about how systems learn from data.
Systems may be trained on data to make decisions, and training is a continuous process, where
the system updates its learning and (hopefully) improves its decision-making ability with more
data. A spam filter is a good example of machine learning. As we feed it more data it keeps
changing its decision rules, using a Bayesian filter, thereby remaining ahead of the spammers. It is
this ability to adaptively learn that prevents spammers from gaming the filter.

(i) It is usually based on a theoretical breakthrough and is therefore well grounded in


science.
(ii) It changes the existing economic paradigm.
(iii) The result is commoditization (e.g. Hadoop).
(iv) It makes available new data that leads to further data science.

Tools which are used:

1) VS Code
2) Mini Anaconda
3) Python notebook
4) Python
5) Tableau
6) Heroku cloud deployment
7) Html

INDUSTRIAL TRAINING WORK UNDERTAKEN

911 Calls Capstone Project

For this capstone project we will be analyzing some 911 call data from Kaggle. The data contains the
following fields:

 lat: String variable, Latitude


 lng: String variable, Longitude

 desc: String variable, Description of the Emergency Call

 zip: String variable, Zipcode

 title: String variable, Title

 timeStamp: String variable, YYYY-MM-DD HH:MM:SS

 twp: String variable, Township

 addr: String variable, Address

 e: String variable, Dummy variable (always 1)

Practical implementation of techniques done:

 Numpy

 Pandas

 Matplotlib

 Seaborn

By using these libraries we have completed 911 Calls Capstone Project.


PROJECT WORK

In this project, you are going to predict the Insurance claim using classification algorithm. This dataset
contains information on the insurance claim. Each observation is different policyholder with various
features like the age of the person, the gender of the policyholder, body mass index, providing an
understanding of the body, number of children of the policyholder, smoking state of the policyholder and
individual medical costs billed by health insurance.

Feature Description

age age of policyholder

sex male(1)/female(0)

bmi body mass index(kg /m^2m2)

children number of children/dependents of policyholder

smoker smoking state nonsmoker(0)/smoker(1)

region residential area northeast(0)/northwest(1)/southeast(2)/southwest(3)

charges medical cost

Insurance claim yes(1)/no(0)

After completing this project, you will have a better understanding of how to build a logistic regression
model. In this project, you will apply the following concepts.

 Train-test split
 Correlation between the features
 Classification algorithm
 Logistic Regression
 K nearest neighbors
 Support Vector Machine
 Cloud Deployment
Result

Source file :

https://github.com/sunny579/Insurance-Claim.git

Cloud deployment :

https://insurance-claim-app.herokuapp.com/

Conclusion and Future Scope

After completing the training and final project . I have learnt how to analyse the data, how to use
different algorithm in various data, natural learning processing. And in final project , we have
learnt about cloud deployment.

Future scope of this project , if anyone wants to check whether a person is eligible for insurance
claim or not ,on the behalf of given requirements for fill.

Refrences :

Books

i) Jake VanderPlas , Python Data Science Handbook, 1005 Gravenstein Highway North,
Sebastopol, CA 95472
ii) John Wiley & Sons, Python Machine Learning,Inc.,Iandianapolis,Indiana
iii) Flavio Copes , The Python Handbook
Online Resource

i) https://uci.edu/
ii) https://www.kaggle.com/
iii) https://github.com/kriti-ixix/ml1030

You might also like