Sat - 40.Pdf - Agricultural Product Price and Crop Cultivation Prediction Based On SMLT
Sat - 40.Pdf - Agricultural Product Price and Crop Cultivation Prediction Based On SMLT
References 24
Appendices 25
A. Source Code 25-34
B. Screen Shots 34-35
C. Publication with 35
Plagiarism Report
List of Tables
3 Data set1 9
4 Data set2 10
5 Yield of Cost Production 11
6 Cost Production of Crop 11
7 Decision Tree 17
8 Random Forest 18
9 Naive Bayes 19
10 Support Vector 20
CHAPTER 1
INTRODUCTION
Agriculture research has boosted the world economy and offers enormous benefits
to society as a whole. Crop evaluation in agriculture remains challenging, despite
recent improvements that involve the use of a wide range the availability of
technical resources techniques,as well as techniques Precision farming and agri-
technology sometimes referred to as virtual farming, are current research
areasthat boosts the usage of data-intensive approaches agricultural productivity
while lowering environmental impact.Accurate crop detection focuses on
ecological and soil Agricultural production is dependent on a number of factors,
one of which has been identified. the subject of decades of research.. The majority
of existing algorithms for crop yield estimate involve machine learning (ML), but
very little was done to forecast territory crops solely on soil and climate data as
well as the environment Crop cultivation is influenced by a variety of factors, Soil
texture, nutrients (N2, P, and K), micronutrients (Fe, B, and Mn), temp, and rainfall
are all factors to consider.l. Because the characteristics vary by zone, resulting in
a large order to ensure the sustainability data set, it is necessary to pick key
aspects that aid in the diagnosis of acceptable crops for certain land regions.
Feature selection (FS) approaches are used to carry out the procedure.
LITERATURE SURVEY
Title : Forecasting crop yield using remotely sensed vegetation indices and
Year : 2013
Author : Paul C. Doraiswamy, Sophie Moulin, Paul W. Cook, and Alan Stern
Year : 2010
Wheat is one of the key cereal crops grown worldwide. The forecasts of production
in Kansas closely matched the USDA/NASS reported numbers with a 7% error.
The same regression model forecast winter wheat production in Ukraine within
10% of the official reported numbers six weeks prior to harvest.
Title : Plant Yield Prediction Model Using Firefly based Feature Selection with
Modified Fuzzy Cognitive Maps
Author : 1 D. Sabareeswaran and 2 R. Gunasundari
Title : Crop yield forecasting on the Canadian Prairies using MODIS NDVI data
Year : 2010
METHODOLOGY
Machine learning is to predict the future from past data. Machine learning (ML) is a
type of artificial intelligence (AI) that provides computers with the ability to learn
without being explicitly programmed. Machine learning focuses on the
development of Computer Programs that can change when exposed to new data
and the basics of Machine Learning, implementation of a simple machine learning
algorithm using python. Process of training and prediction involves use of
specialized algorithms. It feed the training data to an algorithm, and the algorithm
uses this training data to give predictions on a new test data. Machine learning can
be roughly separated in to three categories. There are supervised learning,
unsupervised learning and reinforcement learning. Supervised learning program is
both given the input data and the corresponding labeling to learn data has to be
labeled by a human being beforehand. Unsupervised learning is no labels. It
provided to the learning algorithm. This algorithm has to figure out the clustering of
the input data. Finally, Reinforcement learning dynamically interacts with its
environment and it receives positive or negative feedback to improve its
performance.
The demo dataset is now supplied to machine learning model on the basis of this
data set the model is trained. Every new detail filled at the time of application form
acts as a test data set. After the operation of testing, model prediction based upon
the inference it concludes on the basis of the training data sets. Satellite Imagery
(Remote Sensing Data), has been widely used for predicting crop yield. This
dataset is collected using the sensors mounted on satellites or planes, which
detect the energy (electromagnetic waves), reflected or diffracted from surface of
the earth. Remote sensing data has a lot of energy bands to offer, but mainly only
few of them have been used for crop yield prediction. Yet, there are some people
who have tried generating relevant features using the bands which are typically
ignored, and they have been successful with improving results with that. In case of
this dataset, most people rarely explore the high-order moments of the features.
Based on these datasets people have used algorithms like Regression models,
Random Forest and Nearest Neighbor etc.
Variable Description
Crop Crop name
State Name Indian state name
District Name District name list of each state
Cost of Cultivation (`/Hectare) C2 Cultivation amount for C2 Scheme
Cost of Production (`/Quintal) C2 Production amount for A2+FL Scheme
Yield (Quintal/ Hectare) Yield of crop
Crop year Crop year list
District Name District name for each state
Area Total area of each place
Rainfall Water availability of each crop
Average humidity directly influences the water
relations of plant and indirectly
affects leaf growth
Mean Temperature Climate of each crop
Cost Production of per yield crop Cost of crop yield
Dataset Attributes
Data Pre-processing
Data Analysis of Visualization
Comparing Algorithm with prediction in the form of best accuracy result
Deployment Using GUI
Architecture Diagram
Block Diagram
Validation techniques in machine learning are used to get the error rate of
the Machine Learning (ML) model, which can be considered as close to the true
error rate of the dataset. If the data volume is large enough to be representative of
the population, you may not need the validation techniques. However, in real-world
scenarios, to work with samples of data that may not be a true representative of
the population of given dataset. To finding the missing value, duplicate value and
description of data type whether it is float variable or integer. The sample of data
used to provide an unbiased evaluation of a model fit on the training dataset while
tuning model hyper parameters.