Loan Approval Prediction System Using Machina Learning
Loan Approval Prediction System Using Machina Learning
Abstract:- With the growth of the banking sector, the For banks, lending is the main area of work lending,
identification of reliable borrowers that must maintain money changing, taking deposits from customers, and
the natural core income and asset accumulation becomes earning profits by making interest from customers' loans.
a key issue. Despite all security measures, reliability of The goal here is the funding of creditworthy borrowers, but
customers remains an unclear question. To tackle this standard procedures through lending banks fail to provide
barrier, banking management which is directed towards protection for selected borrowers. To solve this problem, we
customer loan repayment consistency is required. Credit designed a credit finding algorithm with the help of Python
approval is a significant aspect of economy since it and began to get credit approvals from banks. This project
determines the allocation of credit-linked funds. Today, does an eligibility analysis of a client by a process of
machine learning is known for its power to automate and evaluating different parameters including marital status,
scale up the processing of application for loans. income, and expenditure to determine whether they meet the
capacity to repay the loan or not. We use the creates trained
This project will begin with data collection which data sets as a model that we directly then test the data to
will consist of data on historical (regarding the past) loan output a result of "yes" or "no" only to determine if a
applications and the borrower profiles. The dataset has customer or not. can repay the loan. It goes without saying
features of the credit score, income, previous work that This loan approval process is good business sense.
experience, debt-to-income ratio, and loan repayment
record. This way, the models learn through the strengths Automating approvement reduces the use of electricity,
to find good features and the reasons for accepting a upgrades the quality, and makes the customer satisfaction
loan. They are the experts in these areas and can forecast better. It can do so through machine learning and hence can
the potential patterns and connections of the data. do more precise and consistent loan decisions. This
Within the scope of this work, the supervised algorithms approach facilitates just and accurate loan approvals. ML
used are logistic regression, decision trees, random models are a solution to fair analysis and cut off human
forests, and support vector machines. These algorithms biases and errors in manual data analysis.
are applied to the dataset available to often produce
results like binary classification and regression. The II. METHODOLOGIES AND TECHNIQUES
adoption of machine learning among financial
institutions is intended for a faster processing of loans The supervised learning methods are employed to train
which is their benefit. What is credit scoring, it is a tool the prediction model in this loan approval ML task. We
which automate manual loan application review thereby research and study different machine learning algorithms
increases efficiency. The machine-learning algorithms such as logistic regression, decision trees, random forests,
that analyse applications for loans could cut down on the and support vector machines. Feature selection methods and
possibilities of human biases and mistakes which are an data preparation techniques are used to improve model
inherent part of the process. Also, ML uses the model to performance and learning. Undefined.
recognize borrowers who may default and subsequently
lower the likelihood of default. Part of the task involve The execution of the mentioned program may be
utilizing historical credit market data and implementing beneficial for banks and borrowers at the same time. The
ML algorithms to develop a highly accurate and reliable automated loan approval process simplifies decision-
loan approving system based on trained-data, random making, saves manual work and improves efficiency in the
forests, the stream of loans and reliable clients. financial institutions’ operations. ML model facilitates risk
management through precise identification of high-risk
Keywords:- Construction of Data, Creation of Correlation, customers, leading to fewer defaults and losses. Borrowers
Banker Loans, Customer Safety. appreciate a fair and clear loan approval system, in which
many factors are taken into consideration and credit scores
are not the key consideration in a holistic credit evaluation.
This paves the way for the ones with little or no credit
I. INTRODUCTION history or non-financial characters. ML techniques will be
used as a basis for developing a loan approval mechanism
that is more precise and operationally efficient for both support vector machine (SVM) algorithms combining so
lenders and borrowers. Provides predictive accuracy.
In this sphere we face data biases from historical cases E. Forecast Default for Networked- Guaranteed Loans
showing the gap in practices while the machine learning Dawei Cheng, Zibin Niu, Yi Tu, and Liqing Zhang
models are accurate in the quantitative domain but may be address the challenge of predicting the risk of guaranteed
less context sensitive and very complex. On top of being loans in a network by applying an imbalanced approach to
difficult to comply with, there exists the risk of over– the spread of a network with its neighbour a well-weighted
reliance on models. (pwkNN) to be used to forecast cooperative risk in their
proposed model) and algorithms are included.
To cope with these issues a fairness must be ensured,
and a coordinated and comprehensive strategy should be F. Personalized Credit Rating Using Artificial Intelligence
used that includes both human qualified experts. The loan Technology for Federal Student Loans
approval process involves a detailed consideration. It Jian Hu describes the development of a loan
considers contextual factors and complies with regulatory management system designed for federal student loans using
guidelines. The objective in mind is making choices Artificial Neural Network (ANN) technology. Leveraging
between risks and opportunities wisely. the adaptive capabilities of ANN, proposed back
propagation neural network reveals effective credit rating
III. LITERATURE SURVEY evaluation for college students despite the presence of credit
history.
A. An Approach Using Machine Learning Algorithms for
Loan Approval Prediction IV. DATA ANALYSIS
The study by Mohammad Ahmed Sheikh, Amit Kumar
Goel and Tapas Kumar highlights the importance of The course starts with a thorough examination of a
predicting defaults of banking systems to reduce non- historical credit data that contains attributes of applicants as
performing assets (NPAs) and increase profitability. The well as their credit outcomes. The data set has been carefully
paper compares different forecasting methods, with a crafted and cleaned for missing values, redundancies, and
particular focus on Logistic Regression models. class imbalance issues.
Through the study it is concluded that the addition of Exploratory data analysis methods are used to study
individual characteristics to traditional checking account the distribution of loan permissions among different
information improves forecasting accuracy. This highlights demographic groups, types of loans, and risk types, and
the importance of considering a range of consumer factors feature engineering techniques to get additional knowledge
for loan approval decisions. and the accuracy of ML models rises.
B. Default Forecasting of Loans Through Data Mining The study starts with a comprehensive study of
Bhumi Patel, Harshal Patil, Jovita Hembram, and Sri historical credit data, which includes various components on
Jaswal propose the use of data mining algorithms to predict applicants’ financial background and credit outcomes The
loan defaults. By using datasets containing home loan data set is carefully crafted and pre-processed to improve its
applications, the paper aims to help banks make more standards of relevance, duplicates, and class imbalance.
rational decisions.
With exploratory data analysis we determine how the
The study highlights the need for accurate accounting loan approvals are distributed through different demographic
to minimize losses and increase loan approvals. groups, loan types and risk types and by applying feature
engineering techniques we gain useful information and
C. Forecasting Loan Position in Commercial Banks by improve our ML models performance.
Machine Learning Classifier
G. Arutjothi, Dr. C.S. Senthamarai will focus on V. MODEL DEVELOPMENT
building credit scoring models for commercial banks using
machine learning techniques. The paper recommends a The fundamental part of the research is the
machine learning classification-based approach, where Min- development of ML algorithms and the estimation of their
Max normalization and K-Nearest Neighbour (K-NN) accuracy in classification of loan applicants. The algorithms
algorithm are combined the goal is to predict the loan used, such as logistic regression, decision trees, random
condition more accurately, thus helping in making credit forest, and gradient boosting, are many.
decisions.
With the use of boosting the machines and neural
D. Bank Loan Overpayment Prediction Based on LSTM- networks are getting trained and improved with cross-
SVM validation method. Hyperparameter tuning and ensemble
Xin Li, Xianzhong Long, Guozi Sun, Geng Yang, and learning approaches are included to increase the model
Huakang Li propose a new method for bank loan default accuracy, stability, and generalization capability.
prediction using long-term short-term memory (LSTM) and
Similarly, model interpretability is critical as a variable respectively. The decision tree classification model achieved
in this regard as it allows the processes of a loan to be an accuracy of above 90%, with precision, recall and F1
understood and the procedure to remain transparent. scores of 0.91, 0.90 and 0.90 respectively.
Table 2 Results
Metric Value
Logistic Regression Accuracy 72%
Recall 86%
Precision 70%
Fig 1: Workflow of Proposed System
F1 Score 65%
VI. IMPLEMENTATION AND RESULT Decision Tree Accuracy 90%
REFERENCES