0% found this document useful (0 votes)
89 views10 pages

Dairy Cattle Classification with AI Methods

This document discusses using artificial neural networks and cluster analysis to classify dairy cattle based on various traits. It summarizes that artificial neural networks, which are modeled after human brain neurons, can perform classification, pattern recognition, and prediction. The study aims to classify dairy cattle into homogeneous groups based on traits like milk yield using artificial neural networks and k-means clustering. The results determined that artificial neural networks were more successful than k-means clustering at classifying the dairy cattle.
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as PDF, TXT or read online on Scribd
0% found this document useful (0 votes)
89 views10 pages

Dairy Cattle Classification with AI Methods

This document discusses using artificial neural networks and cluster analysis to classify dairy cattle based on various traits. It summarizes that artificial neural networks, which are modeled after human brain neurons, can perform classification, pattern recognition, and prediction. The study aims to classify dairy cattle into homogeneous groups based on traits like milk yield using artificial neural networks and k-means clustering. The results determined that artificial neural networks were more successful than k-means clustering at classifying the dairy cattle.
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as PDF, TXT or read online on Scribd

Investigation of Dairy Cattle Traits by Using Artificial

Neural Networks and Cluster Analysis

Hlya Atl1, Asli Akilli2


1
Department of Biometry and Genetic, Faculty of Agriculture, Ege University,
Izmir, Turkey
2
Department of Biometry and Genetic, Faculty of Agriculture, Ahi Evran University, Kirsehir,
Turkey

Abstract. Artificial neural networks is a method which based on artificial


intelligence, has been emerged according to the working principles of the
human brain nerve cells. Especially in the modelling of nonlinear systems,
with the information learned through experience similarly to humans, it
provides classification, pattern recognition, optimization and allows the
realization of forward-looking forecasts. Artificial neural networks is very
successful method that has been the subject of many studies in different
disciplines. Artificial neural network studies performed in animal husbandry in
recent years, often located in the literature such as prediction of yield
characteristics and classification, animal breeding, quality assessment, disease
diagnosis. In this study, classification according to some traits of dairy cattle
using artificial neural networks and k-means method are aimed. Due to results
of the research, it is determined that artificial neural networks is more
successful than the k-means clustering method. The analysis of study was
performed using SPSS 20.0 statistical software package and Matlab R2011b
work programme.

Keywords: Classification, artificial neural network, dairy cattle.

1 Introduction

One of the key issues in the field of animal breeding studies is classification
differences and similarities measure with respect to each other to various
characteristics of the animals. The animals correct classification which made by the
yield property or exterior features provide great advantages manufacturer and
farmers.
Conscious breeding work, to be derived from animals is to maximize the level of
economic benefits expected in herd management, which plays an important role in
maintaining a profitable production. The aim of classification of animals is getting
homogeneous in itself, themselves a heterogeneous group. Some of the statistical
methods are known as cluster analysis, fuzzy clustering analysis, fuzzy logic, neural
networks, and data mining methods used to make classification or clustering for this
purpose in farming.

681
Artificial neural network method is one of the performing machine learning
methods based on artificial intelligence. Problems encountered in daily life vary
under different conditions. This parallel opposite problem, combined with advances
in technology, new methods of solution are produced. Artificial intelligence
technologies provide great benefits to the people across the problem way to find a
solution on the basis of the system created. Although it is fairly widely used,
especially engineering, medicine, agriculture areas etc. artificial neural network is
frequently used in recent years. Artificial neural networks is a method that have
emerged to move the workings of nerve cells in the human brain. People likewise, it
can perform functions such as optimization, prediction, classification, pattern
recognition using information learned through experience based on the data. It is
very successful method that located in linear and non-linear system the relationship
between input and output variables mathematical modelling, in many cases according
to conventional statistical methods with low error rates obtained from the analysis
results (Takma et al. 2012; Akll and Atl 2014).
In animal science artificial neural network is successfully applied such as
diagnosis of diseases such as mastitis and lameness (Yang ve ark. 1999; Cavero et al.
2008; Sun 2008; Hassan et al. 2009; Roush et al. 2001), in the prediction of the
forward-looking traits (Grzesiak et al. 2003; Salehi et al. 1998; Sanzogni and Kerr
2001; Kominakis et al. 2002; Hosseinia et al. 2007; Grgl 2012), animal breeding
studies (Shahinfar et al. 2012; Salehi et al. 1997; Grzesiak et al. 2010), in the
prediction of the nutrient content in manure (Chen et al. 2008; Chen et al. 2009;
Chen et al. 2009) and oestrus detection (Krieter et al. 2006).
In this study, it is aimed using artificial neural networks and k-means clustering
method in the framework of the identified variables divided into homogeneous
groups of dairy cattle.

2 Material and Method

2.1 Material

The study material consists of records related to 10,000 head of Holstein Friesian
dairy cattle for milk, fat and protein yield values with calving interval, age at first
calving, milking days and season variables that obtained between 1981-2000 years.
Data analysis was carried out using SPSS 20.0 statistical software package
programme and MATLAB (R2011b) programme.

2.2 Method

In this study, artificial neural network and k-means clustering method which is
one of multivariate statistical methods are used for classification according to the
specified characteristics of dairy cattle.
Cluster analysis provides in itself homogeneous and between them heterogeneous
group separation unable to obtain precise information about the studied data group

682
data sets taking into consideration the differences and similarities in that they showed
against each other. Cluster analysis is being implemented in three phases such as
creation of data matrix, selection of the clustering technique and discussion of the
significance of the results (Tatldil 1996; Alpar 2011).
In this study, k-means method is used which is one of non-hierarchical clustering
methods. In this technique, it is not necessary to determine the similarity and distance
matrix. However, it should be noted by the researchers of a possible cluster number
(zdamar 2010). Standardized data variables are used because of having different
scale and size value the variables examined in the study. The data is converted to a
standard value Z scores [ 0,1 ]. Formula used for this purpose is located in
Equation 1 (Kalayc, 2008).

(!!!)
= (1)
!

In k-means technique, observations divided into k clusters including the smallest


sum of squares within clusters as shown in Equation 2.

1 n min 2
Wn = xi a jn
n i =1 1 j k (2)

According to the rule the observation of classification takes place in the nearest
cluster. In the rule; each one x1, x2,...,xn determined number of variables for each
observation vector set, a1, a2.....akn each group were selected as cluster centres for
individuals (Tatldil, 1996).
The second method is a method used in the study neural networks. Artificial
neuron with a similar operating principle biological nerve cells form a neural
network together. Artificial neuron consists of different numbers of inputs and
outputs. Inputs pass through the activation function with a sum function to collect
information from the outside and generate output. The generated output is transferred
to another cell via links to have the neural network (ztemel, 2006; Negnevitsky,
2001). The comparative view of the biological and neural cells is located in Table 1.

Table 1. Biological neuron and artificial neuron

Biological
Explanation Artificial Neuron
Neuron
Neuron Nerve cell Processor element
Axons One-way transmission of information Outputs
Dendrites Receiving the information Inputs
Synapse Communication between nerve cells Weights

Artificial neural networks, is composed of input- output layer and one or different
number of hidden layers in between the layers. Received data from the input layer is
processed in the hidden layer and then sent to the output layer. Determining the
weight value concerning the training of the network that allows nerve cells in neural

683
networks is an important process. Severity of incoming data is decisive here.
Determining the proper amount of weight, it means to make accurate generalizations
to do right never seen through examples and thereby create new information. The
data of the input variables are converted to output values in the output layer being
associated with weight values (Negnevitsky, 2001; Baykal and Beyan, 2004;
ztemel, 2006; Russel and Norvig, 2010; Uur and Knac, 2006).
The studies related to neural networks have led to the emergence of different
network structures according to the different problem structure. In the literature, the
most commonly used networks single and multilayer perceptrons, vector quantization
models (LVQ), self organized map (SOM), adaptive resonance theory (ART),
Hopfield network, Elman network, radial basis function network (ztemel, 2006).
Artificial neural networks are classified as feed forward networks, and feedback
networks depending on the direction of information flow. Also it is classified
according to different learning strategies such as supervised, unsupervised and
reinforcement learning. View of the multi-layered artificial neural network is located
in Figure 1.

Fig. 1. Multilayer perceptron artificial neural network.

This neural network model used in this study is defined as multilayer perceptron.
Multi-layered sensor model operates according to the supervised learning strategy
and it has feed forward terms of the direction of flow of information structure. This
learning algorithm used in the multilayer perceptron model is designed to work as
back-propagation algorithm. Back propagation algorithm operation is performed in
two basic stages. The data of the input variables in the first stage is presented to the
neural network, in next stage after the processing of data weight in each layer based
on the error level is updated on the resulting output and in order to minimize error,
the spread of error is provided backwards from the output layer. Levenberg-
Marquardt algorithm, which is the back propagation algorithm, the problem structure
in this study to investigate suitability, is preferred because the learning error and low
running faster than other algorithms.
The number of processor elements in each layer and number of layers in the
network structure acts in substantially the operating performance of artificial neural
networks. In literature, a specific formula for determining the number of layers and
the processor element is not included. In this study, the numbers of processor

684
elements and layer were determined using detailed literature study for research and
trial and error with the heuristic.
Perform the clustering process with the lowest error which layer and the processor
element number is determined as the optimal number. Performance criteria used for
this, it was determined as the coefficient of determination (R2), the root mean square
error (RMSE), mean absolute deviation (MAD) and mean absolute percentage error
(MAPE). According to these criteria with high value R2 and error variance
expressing the low value of RMSE, MAD and MAPE statistics is the low value
indicated that the best fit of the model which is the subject of research. R2, RMSE,
MAD and MAPE for the calculation of statistical formulas is located in respectively

Equation 3-6. In equations n: number of records, Yi: observed value, Yi : predicted
value.

n
(Yi Y )2
R2 = i=1
n (3)
(Y Y ) i
2

i=1

n
(Y Y ) i
2

RMSE = i=1 (4)


n

n
Y i Yi
MAD = i=1 (5)
n


n
Yi Yi
Yi
i=1
MAPE = 100 (6)
n

The data are subjected to normalization pre-processing prior to analysis done by


the method of artificial neural networks. Normalization process network to reduce
the difficulties during the training is done in order to run faster in the training process
of the network and the balancing of the importance of the parameters involved in
research. In the literature different types of normalization techniques are located.
Researchers are different techniques depending on the problem structure may choose
(ztemel 2006; Xu et al, 2007; Jayalakshmi and Santhakumaran, 2011). In this study
data [0.1-0.9] range is scaled using the formula in the Equation 7 (Xu et al, 2007).

(P Pmin ) (7)
Pn = 0.1 + (0.9 0.1)
(Pmax Pmin )

685
In Equation 7, Pn contained in the data set represents the normalized value as a value
P. Pmax and Pmin , located in the input or output variable indicates the data set
having the highest and lowest values of the data. After the normalization process is
complete the minimum value of 0.1 and the maximum value of 0.9 is determined in
the data set. After network training is completed and the test output data obtained to
convert the actual scale of the output, the formula was used in the Equation 8. In next
step, made a comparison between the predicted value and actual value and
performance criteria is calculated.

(Pmax Pmin ) (8)


P = (Pn 0.1) + Pmin
(0.9 0.1)

P represents the converted value that is transformed pre-normalization.


Before starting the analysis with neural networks dataset completely randomized
on the 80% training and 20% testing data set is divided into two different groups
namely. 8,000 head of dairy cattle in the training data set and test data set consists of
records pertaining to 2,000 head of dairy cattle milk production traits variable.
In this study, a detailed literature review for research and using trial and error,
layers and the number of processor elements are determined. Prediction performs
operations with the lowest error layer and the number of processor elements has been
identified as the most appropriate number. Number of hidden layers is 1 and the
number of neuron in this layer is 7 are determined in the neural network designed
study. Also, convergence criteria and maximum number of iteration has been
recognized as 1x10-6 and 1000. Animals which have been the subject of research
were clustered before analysis by an expert. The methods performance was
evaluated with the expert opinion. The inputs of model are defined as first calving
age model, calving interval, milk with milking day data, the value related to fat and
protein yield. The output of the model refers to the group to which it belongs to the
subject of dairy cattle research.

3 Results and Discussion

In this study, artificial neural networks, k-means clustering method based on


10,000 head of dairy cattle milk production traits is divided into homogeneous
groups. In the result of analysis, animals were included in three different groups.
Analysis of the results of artificial neural networks is located in Table 2. According
to the results of highest classification success in the case where the number of
neurons is provided seven. Here, the neural network which has seven neurons is
observed that coefficient of determination of 99.9% and lower levels of error
statistics according to the number of other neurons.

686
Table 2. Artificial neural network test results working with different numbers of neuron.

Neuron
R2 RMSE MAD MAPE
Number
4 99.9% 1.7603 0.08115 0.0048
5 99.9% 1.7603 0,08114 0.0017
6 99.9% 1.7601 0,08113 0.0106
7 99.9% 1.7599 0,08108 0.0177
8 99.7% 1.7595 0,08104 0.0088
9 99.8% 1.7594 0,08103 0.0513
10 99.9% 1.7602 0.08114 0.0053

The neural network and k-means method is situated classification performance in


Table 3. The method used in the classification process seems to be quite successful
relating to dairy cattle. The results indicate that the more successful classification by
the k-means method of artificial neural network method.

Table 3. Artificial neural network and K-means results.

Statistics ANN K-means


Coefficient of
99.9% 94.9%
determination
RMSE 1.7599 1.7758
MAD 0,08108 0.0002
MAPE 0.0177 0.1048

Artificial neural network and k-means method is located performance values of


conformational display in Figure 2.
Classification works done by artificial neural networks are quite common in the
livestock area. Hassan et al. (2009) a neural network model used for the detection of
mastitis and analysis resulted in success in their study. A like, Yang et al. (2000) in
the estimation of clinical mastitis cases with milk production traits studied the
availability of artificial neural networks. The study of comparative review of
artificial neural networks, Grzesiak et al. (2003) in their study, multiple regression
and artificial neural networks methods have used to estimate the 305-day lactation
milk yield. Takma et al. (2012), lactation milk yield of Holstein were modelled using
multiple regression and neural network. In both studies it noted that the comparison
of the results artificial neural networks can be an alternative method to regression
analysis.

687
ANN

200,00%

150,00%

100,00%

50,00%

0,00%
Coefficient of RMSE MAD MAPE
Determination

K-Mean

200,00%

150,00%

100,00%

50,00%

0,00%
Coefficient of RMSE MAD MAPE
Determination

Fig. 2. Artificial neural network and K-means method results

In this study, artificial neural networks and k-means clustering methods were
examined in comparison to their classification and such as mentioned in recent
studies methods artificial neural networks that provide very good results.

4 Conclusion

In this study, k-means clustering method and the method of artificial neural
networks are used for classification of dairy cattle. When the k-means clustering
method and artificial neural network method of classification performance
comparison, the neural network is seen as a better fit. The survey results indicate
neural networks can be used as an alternative to clustering analysis of animal science
methods. Artificial neural network -the machine learning perform and one of the
artificial intelligence methods- provide a lower estimate convenience incorrect
classification according to the traditional method for researchers. Classification work
done by artificial neural networks in the field of animal science in the literature is
quite low. In the later stages of this study it is intended to be resources for individuals
to do research on this subject. Different neural network models and different input

688
variables with in studies are expected to give positive results in the classification of
animals.

References

1. Akll, A. and Atl, H. (2014) St srclnda yapay zeka teknolojisi: Bulank


mantk ve yapay sinir alar. Hayvansal retim, 55(1). p.39-45.
2. Alpar, R. (2011) Uygulamal ok deikenli istatistiksel yntemler. Ankara:
Detay Yaynclk.
3. Baykal, N. and Beyan, T. (2004) Bulank mantk ilke ve temelleri. Ankara:
Baklar Kitabevi.
4. Cavero, D., Tlle, KH., Henze, C., Buxad, C. and Krieter, J. (2008) Mastitis
detection in dairy cows by application of neural networks. Livestock Sci., 114.
p.280286.
5. Chen, LJ., Cui, LY., Xing, L. and Han, LJ. (2008) Prediction of the nutrient
content in dairy manure using artificial neural network modeling. J. Dairy Sci.,
91. p.4822-4829.
6. Chen, LJ, Xing, L. and Han, LJ. (2009) Quantitative determination of nutrient
content in poultry manure by near infrared spectroscopy based on artificial neural
networks. Poult Sci., 88(12). p.2496-503.
7. Gorgulu, O. (2012) Prediction of 305-day milk yield in Brown Swiss cattle using
artificial neural networks. South African J. Anim. Sci., 42(3). p.280-287.
8. Grzesiak, W., Lacroix, R., Wjcik, J. and Blaszczyk, P. (2003) A comparison of
neural network and multiple regression predictions for 305-day lactation yield
using partial lactation records. Can. J. Anim. Sci., 83. p.307-310.
9. Grzesiak, W., Zaborski, D., Sablik, P., ukiewicz, A., Dybus, A. and
Szatkowska, I. (2010) Detection of cows with insemination problems using
selected classification models. Comput. Electron. Agr., 74 (2). p.265273.
10. Hassan, KJ., Samarasinghe, S. and Lopez-Benavidest, MG. (2009) Use of neural
networks to detect minor and major pathogens that cause bovine mastitis. J. Dairy
Sci., 92. p.1493-1499.
11. Hosseinia, P., Edrisi, M., Edriss, MA. and Nilforooshan, M.A. (2007) Prediction
of second parity milk yield and fat percentage of dairy cows based on first parity
information using neural networks system. J. Appl. Sci., 7. p.3274-3279.
12. Jayalakshmi, T. and Santhakumaran, A. (2011) Statistical normalization and back
propagation for classification. IJCTE 2011, 3(1). p.89-93.
13. Kalayc, . (2008) SPSS uygulamal ok deikenli istatistik teknikler. Ankara:
Asil Yayn Datm.
14. Kominakis, AP., Abas, Z., Maltaris, I. and Rogdakis, E. (2002) A preliminary
study of the application of artificial neural networks to prediction of milk yield in
dairy sheep. Comput. Electron. Agric., 35. p.35-48.
15. Krieter, J., Stamer, E. and Junge, W. (2006) Control charts and neural networks
for oestrus detection in dairy cows. Lecture Notes in Informatics. Land- und
Ernhrungswirtschaft im Wandel -Aufgaben und Herausforderungen fr die

689
Agrar und Umweltinformatik, Referate der 26. GIL Jahrestagung, 68 March
2006, Potsdam, p.133136.
16. Negnevitsky, M. (2002) Artificial intelligence: A guide to intelligent systems.
Harlow: Pearson Education.
17. zdamar, K. (2010) Paket programlar ile istatistiksel veri analizi. Eskiehir: Kaan
Kitabevi.
18. ztemel, E. (2006) Yapay sinir alar. stanbul: Papatya Yaynclk.
19. Roush, WB., Wideman, RF., Jr Cahaner, A., Deeb, N. and Cravener, TL. (2001)
Minimal number of chicken daily growth velocities for artificial neural network
detection of pulmonary hypertension syndrome PHS. Poultry Sci., 80(3). p.254-
259.
20. Russel, S. and Norvig, P. (2010) Artificial intelligence: A Modern Approach.
Pearson Education.
21. Salehi, F., Lacroix, R., Yang, XZ. and Wade, KM. (1997) Effects of data
preprocessing on the performance of artificial neural networks for dairy yield
prediction and cow culling classification. Trans. ASAE, 40(3). p.839-846.
22. Salehi, F., Lacroix, R. and Wade, KM. (1998) Improving dairy yield predictions
through combined record classifiers and specialized artificial neural networks.
Comput. Electron. Agric., 20. p.199-213.
23. Sanzogni, L. and Kerr, D. (2001) Milk production estimates using feed forward
artificial neural networks. Comput. Electron. Agric., 32. p.21-30.
24. Shahinfar, S., Mehrabani-Yeganeh, H., Lucas, C., Kalhor, A., Kazemian, M. and
Weigel, K. A. (2012) Prediction of breeding values for dairy cattle using artificial
neural networks and neuro-fuzzy systems. Comput. Math. Methods Med.,
Volume 2012, Article ID 127130.
25. Sun, Z. (2008) Application of artificial neural networks in early detection of
mastitis from improved data collected on-line by robotic milking stations.
Dissertation, Lincoln University, New Zealand.
26. Takma, ., Atl, H. and Aksakal, V. (2012) oklu dorusal regresyon ve yapay
sinir a modellerinin laktasyon st verimlerine uyum yeteneklerinin
karlatrlmas. Kafkas niv. Vet. Fak. Derg., 18(6). p.941-944.
27. Tatldil, H. (1996) Uygulamal ok deikenli istatistiksel analiz. Ankara: Cem
Web Ofset.
28. Uur, A. and Knac, AC. (2006) Yapay zeka teknikleri ve yapay sinir alar
kullanlarak web sayfalarnn snflandrlmas. Inet-tr 2006, XI. Trkiyede
nternet Konferans, TOBB Ekonomi ve Teknoloji niversitesi, Ankara, 21-23
Aralk 2006.
29. Xu, L., Jiandong, X., Shizhong, W., Yongzhen, Z. and Rui L. (2007)
Optimization of heat treatment technique of high-vanadium high-speed steel
based on back-propagation neural networks. Mater Des., 28. p.14251432.
30. Yang, XZ., Lacroix, R. and Wade, KM. (1999) Neural detection of mastitis from
dairy herd improvement records. Trans. ASAE, 42(4). p.1063-1071.
31. Yang, XZ., Lacroix, R. and Wade, KM. (2000) Investigation into the production
and conformation traits associated with clinical mastitis using artificial neural
networks. Can. J. Anim. Sci., 80. p.415426.

690

You might also like