0% found this document useful (0 votes)

116 views8 pages

Kernel Density Estimation and Its Application

Uploaded by

Kevin Rios

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

116 views8 pages

Kernel Density Estimation and Its Application

Uploaded by

Kevin Rios

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 8

ITM Web of Conferences 23, 00037 (2018) https://doi.org/10.

1051/itmconf/20182300037
XLVIII Seminar of Applied Mathematics

Kernel density estimation and its application

Stanisław Węglarczyk1,*
1
Cracow University of Technology, Institute of Water Management and Water Engineering, Warszawska 24, 31-115 Kraków,
Poland

Abstract. Kernel density estimation is a technique for estimation of probability density function
that is a must-have enabling the user to better analyse the studied probability distribution than when
using a traditional histogram. Unlike the histogram, the kernel technique produces smooth estimate
of the pdf, uses all sample points' locations and more convincingly suggest multimodality. In its
two-dimensional applications, kernel estimation is even better as the 2D histogram requires
additionally to define the orientation of 2D bins. Two concepts play fundamental role in kernel
estimation: kernel function shape and coefficient of smoothness, of which the latter is crucial to the
method. Several real-life examples, both for univariate and bivariate applications, are shown.

1 Introduction Two concepts play fundamental role in kernel

estimation: the kernel function and the coefficient of
Out of all probability distribution functions, probability smoothness.
density function (pdf) best shows how the whole 100%
probability mass is distributed over the x-axis, i.e., over
the values of an X random variable. However, the oldest 2 Kernel density
pdf empirical representation  a histogram  is a highly
subjective structure as its shape depends on the Let the series {x1, x2,..., xn} be an independent and
subjective choice of the number (or widths) of class identically distributed (iid) sample of n observations
intervals (bins) to which the range of a sample is taken from a population X with an unknown probability
divided, and on the choice of the initial point (e.g., [1]). distribution function f(x). Kernel estimate fˆ ( x) of
To this aim several formulas have been proposed of original f(x) assigns each i-th sample data point xi
which most relate the number of intervals to the sample a function K(xi,t) called a kernel function in the
size only [2–3]; the other include additionally certain following way [11]:
sample characteristics as standard deviation [4],
interquartile range [5] or skewness [6]. 1 n
fˆ (t )   K ( xi , t ) (1)
Independently of the class selection method used, the n i1
histogram suffers from its original sin: data binning,
which depraves the data of their individual location K(x,t) is nonnegative and bounded for all x and t:
replacing their locations with a bin (interval) location.
This causes the histogram shape to become 0  K ( x, t )   for all real x, t (2)
discontinuous, and flat in each bin.
Kernel estimation of probability density function has and, for all real x,
not these drawbacks. It produces (in in most practical 
applications) a smooth empirical pdf based on individual
locations of all sample data. Such pdf estimate seems to
 K ( x, t )dt  1.

(3)

better represent the "true" pdf of a continuous variable.

Kernel estimation is not a quite new technique: it was Property (3) ensures the required normalization of
originated more than a half century ago by Rosenblatt kernel density estimate (1):
[7] and Parzen [8]. With the development of computer  
1 n
technology, this method has been developing rapidly and  fˆ (t )dt    K ( xi , t )dt  1. (4)
vastly [4, 9–17].  n i1 
The paper shows the advantages and disadvantages
of the method illustrating them with real-life examples In other words, kernel transforms the "sharp" (point)
for one- and two-dimensional applications. location of xi into an interval centred (symmetrically or
not) around xi.

*
Corresponding author: [email protected]
© The Authors, published by EDP Sciences. This is an open access article distributed under the terms of the Creative Commons Attribution License 4.0
(http://creativecommons.org/licenses/by/4.0/).
ITM Web of Conferences 23, 00037 (2018) https://doi.org/10.1051/itmconf/20182300037
XLVIII Seminar of Applied Mathematics

In most common practical applications, the kernel

estimation uses symmetric kernel function, although
asymmetric functions have recently been increasingly
used [18–20]. Figs. 1 and 2 illustrate the idea of kernel
estimation for both cases.

Fig. 3. The value of the smoothing parameter h influences the

shape of the resulting kernel density. The 4-element sample
(vertical segments) are the same as in Figs. 1 and 2.

Many types of kernel function can be found in the

relevant literature. Examples of symmetric kernels are
presented in Table 1 and in Fig. 4, while Table 2 shows
Fig. 1. Construction of kernel density estimator (1) (continuous the asymmetric ones.
line) with a symmetric kernel (dashed lines) for a 4-element
sample (vertical segments). Table 1. Examples of symmetrical kernel functions [11].

Kernel Definition

 3 (1  15 t ) 2 for t  5
K (t )  
4 5
Epanechnikov
 0 for t  5

 1615 (1  t ) for t  1
2 2

Biweight K (t )  
0 for t  1


1  t for t  1
Triangular K (t )  

0 for t  1
1 t 2 / 2
Gaussian K (t )  e
2
Fig. 2. Construction of kernel density estimator (1) (continuous
 for t  1
1
line) with an asymmetric kernel (dashed lines) for the same 4-
Rectangular K (t )   2
element sample as in Fig. 1. 0 for t  1
Fig. 1 shows that the shape of a symmetric kernel is Table 2. Examples of asymmetrical kernel functions.
the same for all sample points while Fig. 2 reveals that Symbol b denotes the smoothing parameter.
the shape of an asymmetric kernel differs with the point
placement.
Symmetry property allows to write the kernel Kernel Definition
function in a form used most frequently: x / b t / b
t e
Gamma 1 [18] K GAM 1 ( x, b; t ) 
1  x t  b
x / b 1
( x / b  1)
K sym ( x, t )  K  (5)
h  h  t b ( x ) 1et / b
KGAM 2 ( b ( x), b; t )  b ( x )
b ( b ( x))
where parameter h, called smoothing parameter, window Gamma 2 [18]
x / b for x  2b
width or bandwidth, governs the amount of smoothing b ( x )   1
 4 ( x / b)  1 for x  [0,2b)
2
applied to the sample (Fig. 3).
For symmetrical kernel functions, the choice of the 
1 t x
Inverse 1  2 
2 bx  x t
shape of the kernel function K(.) has rather little effect K IG ( x, b; t )  e
Gaussian [19] 2 bt
on the shape of the estimator [11, 21], whereas  as Fig.
3

3 shows  the influence of the smoothing parameter h is Reciprocal

1    2
x b  t x b 

critical because it determines the amount of smoothing. Inverse K RIG ( x, b, t )  e 2b  x  b t 

Gaussian [19] 2 bt
Too small value of h may cause the estimator to show
 ln t  ln x 2
insignificant details while too large value of h causes Lognormal 1 
8 ln(1 b )
K LN ( x, b; t )  e
oversmoothing of the information contained in the [20] 8 ln(1  b)t
sample, which, in consequence, may mask some of
important characteristics, e.g. multimodality, of f(x) (cf.
Fig. 3). A certain compromise is needed.

2
ITM Web of Conferences 23, 00037 (2018) https://doi.org/10.1051/itmconf/20182300037
XLVIII Seminar of Applied Mathematics

Two versions of (6) are used in practice: the product

kernel estimator and the radial kernel estimator [24].
In its most popular form, the product kernel estimator
may be written as follows

1 n
 xi  x   y j  y 
fˆ ( x, y )  K   K   (7)
nhx hy i 1  hx   hy 

The radial kernel estimator is based on the Euclidean

Fig. 4. Shapes of symmetric kernels defined in Table 1. distance between an arbitrary point {x,y} and sample
point {xi,yi}, i = 1,2, ..., n:
Fig. 5 illustrates how the kernel type (cf. Table 1)
used to estimate pdf influences the kernel pdf estimate.
n   x  x 2  y  y 2 
Triangular and rectangular kernels (especially the latter) 1
produce many local maxima and thus they are rather not
fˆ ( x, y )   K   i
   h 
i  (8)

  x   y 
nhx hy i 1 h
recommended for application. The biweight kernel has 
shorter support than the Epanechnikov one, so reveals
In practice, the product kernel estimator is mostly
more details and more clearly suggests two basic modes.
used.
The Gaussian kernel, distributed over the whole x-axis,
produces the most smooth estimate, and this property The advantage of multivariate kernel pdf over
probably causes the kernel to be most frequently used. multivariate histogram is even greater than in an
univariate case. This is because of an additional
subjective requirement occurs: the user has to decide
about the orientation of a two-dimensional bin, which
may considerably influence the final shape of the
histogram.

3 Measures of discrepancy between the

kernel density estimator fˆ and the true
density f
Each estimator fˆ ( x) differs from its original f(x) with
100% probability. In order to build a method producing
an estimator fˆ ( x) which will be as close to f(x) as
possible, certain measures should be defined to evaluate
this discrepancy.
For each single x, a difference between the "true"
density function f(x) and its estimator fˆ ( x) can be
estimated with the mean squared error, MSEx, [11]:

   
Fig. 5. Different symmetrical kernel functions applied to
MSE x fˆ  E  fˆ  x   f  x  
2
a sample of 45 standardized annual maximum (9)
 
flows (1961–1995) of Odra river recorded at the
Racibórz-Miedonia gauge station (data source: [22]).
which, after simple transformations, can be presented as
The univariate case can be easily formally extended follows:
to the multivariate case [23]. However, its illustrative
   
MSE x fˆ  Efˆ  x   f  x   var fˆ  x 
2
(graphical) power works well for bivariate case only.
The most frequently used bivariate kernel function is (10)
  bias fˆ  x    var fˆ  x 
2
symmetric
 
1 n  xi  x y j  y 
fˆ ( x, y)   K  ,
hy 
 (6) that is, MSEx is the sum of the square bias and the
nhx hy i 1  hx variance of fˆ ( x) at x. Reducing the bias causes variance
to increase and vice versa, so a trade-off between these
where {xi, yi}, i = 1,2,...,n, is a sample, and hx and hy are terms is needed.
smoothing coefficients. Available are multivariate MSEx is a local measure. Integration of MSEx over
counterparts of univariate kernel functions listed in
Table 1, e.g., multivariate Epanechnikov kernel or all x gives a global measure of conformity of fˆ ( x) with
multivariate Gaussian kernel [11]. f(x), called the mean integrated square error, MISE, [11]:

3
ITM Web of Conferences 23, 00037 (2018) https://doi.org/10.1051/itmconf/20182300037
XLVIII Seminar of Applied Mathematics

 
 The value (15) is widely used in practice and referred to
MISE(fˆ )   MSE x fˆ dx
- as the Silverman’s bandwidth or (Silverman’s) rule of
 
(11)
thumb, and will be used in most of the remainder of the
   bias fˆ  x   dx   var fˆ  x  dx
2

-   - paper.

MISE is one of measures used to estimate the smoothing

parameter. 4.2 Least squares cross validation method
In practice, an approximate version of MISE, called (LSCV)
AMISE (asymptotic MISE) is also used, developed by The least squares cross validation method (LSCV) of
expanding MISE into a Taylor series and taking only the selecting the smoothing parameter is a very popular
most important parts [25, 26]. technique [11, 30, 33–38].
Integrated square error, ISE, is an intermediate LSCV uses the integrated square error, ISE (12),
measure, between MISE and MSE: which can be expressed in the following form 11:


  fˆ  x   f  x  dx
2 

  fˆ  x   f  x  
ISE(fˆ )  (12) ISE (h) 
2
dx


 
which is also a discrepancy measure used to estimate the   fˆ 2  x dx  2  fˆ  x   f  x  dx (16)
magnitude of the smoothing parameter.  

  f 2  x dx
4 Methods for calculating optimum 
value of smoothing parameter
The last part of the expression (16) does not depend
The choice of the optimal smoothing parameter is based,
on the estimator fˆ ( x) (it is a constant), therefore the
i.a., on formulas that minimize the criterion functions
discussed above, mainly ISE [27], MISE [28] and choice of the smoothing parameter (in the sense of
AMISE [11, 15, 29–32]. minimizing ISE) will correspond to the choice of
Many other methods for calculating the smoothing h which minimizes the function
parameter are available in the relevant literature; many  
of them are available also through statistical software.
Two methods are described below  one for the
   fˆ
R fˆ 

2
 x  dx  2  fˆ  x   f  x  dx

(17)

symmetrical kernel function (Gaussian), the other for

any kernel function. To estimate the second part of (17) a leave-one-out
density estimator, fˆi  x  , is used:
4.1 Rule-of-thumb method
 K  x, x j 
1
The rule-of-thumb method is based on the asymptotic fˆi  x   (18)
n  1 j i
mean integrated square error, AMISE, when the kernel
function and true distribution are assumed normal. which is an estimate of the density function calculated
Silverman [11] got then the values of the smoothing using all sample values except xi. The resulting form of
parameter h as follows: the LSCV criterion function is
h  1.06  ˆ  n1/5 (13) 
2
LSCV  h    fˆ 2  x  dx   fˆi  xi  (19)
where ˆ is the sample standard deviation and n is the 
n i
sample size.
In order to have an estimator more robust against The optimal smoothing parameter hLSCV is the value for
outliers the sample interquartile range IRQ may be which the LSCV(h) function achieves the minimum. The
final form of LSCV function (19), applicable to both
applied [11]: symmetrical and asymmetrical kernels, is:

h  0.79  IQR  n1/5 (14) 

  K  x, x  K  x, x dx
1
LSCV  h   i j
n2 i , j 
Silverman [11] believes that the value (13) smoothes (20)
non-unimodal distributions too much, and  as one of the
 K  xi , x j 
2

remedies  proposes a slightly reduced value of the n(n  1) i j  i
smoothing parameter (13):
Least squares cross-validation is also referred to as
 IQR  1/5 unbiased cross-validation 26.
h  0.9  min  ˆ , n
 1.34 
(15) Unfortunately, the LSCV method also has
  drawbacks: the variance of the obtained smoothing

4
ITM Web of Conferences 23, 00037 (2018) https://doi.org/10.1051/itmconf/20182300037
XLVIII Seminar of Applied Mathematics

parameters calculated for samples drawn from the same

distribution is large [30]. It happens that the LSCV(h)
function has several minimums, often false and far on
the side of too small smoothing [39]; sometimes
LSCV(h) does not have any minima at all [14, 30].
There are other versions of the cross-validation
method, e.g. biased cross-validation (BCV) or smoothed
cross-validation (SCV), and other methods to obtain
optimum smoothing coefficient (e.g., [40]). Some
resulting examples are shown in Fig. 6.

Fig. 7. Kernel density estimates for four 45-year time series of

standardized annual maximum flows (1961–1995)
of given River/Gauging station (data source: [22]).

Fig. 8. Kernel density estimates for four 32-year time series of

standardized annual minimum flows (1983–2015)
of given River/Gauging station (data source: [41]).

Fig. 6. Different methods for kernel smoothing coefficient

estimation available in Wolfram Mathematica 11.1 applied to
the 1961–1995 series of standardized annual maximum flows
of Odra river recorded at the Racibórz-Miedonia
gauge station (data source: [22]).

5 Kernel density in practice

5.1 The univariate case

Figs. 7 through 9 contain several kernel pdf estimates
obtained for maximum and minimum annual flows of
certain rivers and maximum annual precipitations in
Poland. Apart from the nice smoothness contrasting with Fig. 9. Kernel density estimates for four 30-year time series of
a histogram shape, a very attractive characteristic of standardized annual maximum precipitation (1984–
kernel estimation is shown: its ability to suggest 2013) at given Precipitation station/River basin/ (data source:
[42]).
multimodality in a more convincing way than the
histogram does. Of course, the multimodal shape of a pdf estimate
does not prove the existence of the real multimodality. It
is, however, a sign of possible non-homogeneity that
should be considered through the analysis of the
mechanism generating the data. Some attempts to
statistical testing multimodality are described in [11];

5
ITM Web of Conferences 23, 00037 (2018) https://doi.org/10.1051/itmconf/20182300037
XLVIII Seminar of Applied Mathematics

however, as Silverman ([11], p. 141) conclude: "It may If the amount of the probability leakage cannot be
be futile to expect very high power from procedures disregarded, one of the remedies is to logarithmize the
aimed at such broad hypotheses as unimodality and data and apply the kernel estimation to such data. If pdf
multimodality". Nevertheless, the kernel estimation is of logarithmized data is gˆ( x) the following recalculation
a good method for an initial stage of the planned study should be used:
on probability distribution.
When the variable under study is nonnegative, it may 1
happen that kernel estimate exhibits an undesirable case: fˆ ( x)  gˆ (ln( x)) (21)
x
probability leakage below zero. It occurs when a part of
the sample lies near zero and the magnitude of the Fig. 11(b) shows the result. The leakage has been
smoothing coefficient enables such crossing in removed; unfortunately, the second mode disappeared
a considerable amount. Four such cases are presented in although certain suggestion of non-unimodality has
Fig. 10. remained visible in the heaviness of the right tail.
Another remedy is to use an asymmetric kernel
shown in Fig. 11(c). This approach shows the bimodality
revealed in Fig. 11(a). In terms of cumulative
distribution function (Fig. 11(d)), log transformation and
asymmetric kernel approach are almost equivalent.

5.2 The bivariate case and some general

remarks on the multivariate case

Formally, the univariate case can be easily extended to

the multivariate one, which has been exemplified by
equations (7) and (8) for the bivariate kernel. Fig. 12
illustrates with the use of equation (7) how the relation
between the two variables studied evolves over the year.
2D kernel pdf graphics may help the user in
differentiating the sample into subsamples, for which
a non-statistical (cause-and-effect) confirmation may be
Fig. 10. Probability leakage below zero (marked dark blue) in
found. Such graphics is informative when a sample
kernel density estimates for time series of standardized annual
maximum flow (1961–1995), top two graphs, and
contains many identical data, which are not visible in an
annual minimum flows (1983–2015), bottom two x-y plot.
graphs, of given River/Gauging station (data source: [22]). The Unfortunately, graphical illustration or interpretation
numbers within the graphs show the magnitude of probability for more than two-variate case is at least difficult if not
leakage. impossible. Moreover, sample size necessary for
preserving similar accuracy as that for one-dimensional
case grows rapidly with growing dimension  the
problem known as the 'curse of dimensionality'.
Minimization of the effect of the curse of
dimensionality requires not only sufficient data, but also
careful data preparation [23]. This may involve proper
transformation of marginal variable in order to reduce
the large skewness or heavy tails, determination if the
data are of full rank, and even  if the data do not have
many significant digits  carefully blurring the data [23].

6 Summary and conclusions

When compared with the commonly used histogram, the
kernel density estimator shows several advantages.
1. It is a smooth curve and thus it better exhibits the
details of the pdf, suggesting in some cases non-
unimodality.
Fig. 11. Removing probability leakage below zero (4.9%, 2. It uses all sample points' locations, so, therefore, it
marked dark blue) in kernel density estimate (a) of the 1984– better reveal the information contained in the sample.
2015 time series of standardized annual maximum flows 3. It more convincingly suggests multimodality.
; (b) logarithmized pdf added; (c) asymmetric 4. The bias of the kernel estimator is of one order
gamma kernel pdf added (cf. Table 2, kernel KGAM1, b = 0.06); better than that of a histogram estimator [26].
(d) three cumulative distribution functions (data source: [41]). 5. Compared with 1D application, 2D kernel
applications are even more better as the 2D histogram

6
ITM Web of Conferences 23, 00037 (2018) https://doi.org/10.1051/itmconf/20182300037
XLVIII Seminar of Applied Mathematics

requires additionally the specification of the orientation References

of the bins which enhances the subjectivity of histogram.
It should be remembered, however, that the value of 1. S. Węglarczyk., M. Kulig, Wiad. IMGW
smoothing coefficient is to some extent a subjective XIV(XLV), Z.2, 59–69 (2001)
estimate. 2. H.A. Sturges, J. Amer. Statist. Assoc. 21, 65–66
(1926)
3. C. E. P. Brooks, N. Carruthers, Handbook of
statistical methods in meteorology (HM Stationary
Ofﬁce, London, 1953)
4. D.W. Scott, Biometrica 66, 605–610 (1979)
5. D. Freedman, P. Diaconis, Zeit. Wahr. ver. Geb.
57(4), 453–476 (1981)
6. D. P. Doane, American Statistician 30(4), 181–183
(1976)
7. M. Rosenblatt, Annals of Mathematical Statistics 27,
832–837 (1956)
8. E. Parzen, Annals of Mathematical Statistics 33,
1065–1076 (1962)
9. A. Bowman, Journal of Stat. Comp. Simul. 21, 313–
327 (1985)
10. G.R. Terrel, D.W. Scott, Journal of the American
Statistical Association 80(389), 209–214 (1985)
11. B.W. Silverman, Density estimation for statistics and
data analysis (Chapman and Hall, London, 1986)
12. L. Devroye, Annales de l’Institut Henri Poincaré 25,
533–580 (1989)
13. G.R. Terrel, Journal of the American Statistical
Association 85, 470–477 (1990)
14. S.J. Sheather, Computational Statistics 7, 225–250
(1992)
15. J.S. Marron, M. P. Wand, Annals of Statistics 20,
712–736 (1992)
16. L. Devroye, Statistics and Probability Letters 20,
183–188 (1994)
17. L. Devroye, A. Krzyżak, Journal of Multivariate
Analysis 82, 88–110 (2002)
18. S.X. Chen, Annals Of The Institute of Statistical
Mathematics 52(3), 471–480 (2000)
19. O. Scaillet, Density estimation using inverse and
reciprocal inverse gaussian kernels (IRES
Discussion Paper 17, Université Catolique de
Louvain, 2001)
20. X. Jin, J. Kawczak, Annals of Economics and
Finance 4, 103–124 (2003)
21. P. Hall, J.S. Marron, The Annals of Statistics 15(1),
163–181 (1987)
22. B. Fal, E. Bogdanowicz, W. Czernuszenko,
I. Dobrzyńska, A. Koczyńska, Przepływy
charakterystyczne głównych rzek polskich w latach
Fig. 12. Bivariate kernel density estimates for two-dimensional
1951–1995 (in Polish: Characteristics flows of main
random variable (monthly maximum temperature tmx, and rivers in Poland in 1951–1995) (Materiały
monthly sunshine duration, S), in Oxford, UK, 1853–2017 Badawcze, Seria: Hydrologia i Oceanologia 26,
(data source:[43]). Instytut Meteorologii i Gospodarki Wodnej,
Warszawa, 2000)

7
ITM Web of Conferences 23, 00037 (2018) https://doi.org/10.1051/itmconf/20182300037
XLVIII Seminar of Applied Mathematics

23. D. W. Scott, Multivariate Density Estimation,

Theory, Practice, and Visualization (John Wiley and
Sons, Inc., 1992)
24. W. Härdle, M. Müller, S. Sperlich, A. Werwatz,
Nonparametric and Semiparametric Models
(Springer, 2004)
25. T. Ledl, Austrian Journal of Statistics 33(3), 267–279
(2004)
26. S.J. Sheather, Statist. Sci. 19(4), 588–597 (2004)
27. S.R Sain, Adaptive kernel density estimation, (PhD
diss., Rice University, http://hdl.handle.net/1911/
16743, 1994, accessed June 2018)
28. J.S. Marron, D. Nolan, Statistics and Probability
Letters 7, 195–199 (1989)
29. A. Bowman, P. Hall, T. Prvan, Biometrika 85(4),
799–808 (1998)
30. B.A. Turlach, Bandwidth selection in kernel density
estimation: A Review (Discussion Paper, C.O.R.E.
and Institut de Statistique, Université Catolique de
Louvain-la-Neuve, Belgium, 1993)
31. W. Feluch, Wybrane metody jądrowej estymacji
funkcji gęstości prawdopodobieństwa i regresji w
hydrologii (in Polish: Selected methods for kernel
estimation of probability density function and
regression in hydrology) (Prace Naukowe Poli-
techniki Warszawskiej 15, Oficyna Wydawnicza
Politechniki Warszawskiej, Warszawa, 1994)
32. E. Choi, P. Hall, Biometrika 86(4), 941–947 (1999)
33. P. Hall, S.J. Sheather, M.C. Jones, J.S. Marron,
Biometrika 78(2), 263–269 (1991)
34. S.T. Chiu, Statistica Sinica 6, 129–145 (1996)
35. S.T. Chiu, The Annals of Statistics 19(4), 1883–1905
(1991)
36. S.T. Chiu, Biometrika 79(4), 771–782 (1992)
37. M. Rudemo, Scand. Journal of Statistics 9, 65–78
(1982)
38. A. Bowman, Biometrica 71, 353–360 (1984)
39. P. Hall, J.S. Marron, Journal of the Royal Statistical
Society B(53), 245–252 (1991)
40. A. Michalski, Meteorology Hydrology and Water
Management 4(1), 40–46 (2016)
41. Roczniki Hydrologiczne 1984–2015, IMGW-PIB
(Institute of Meteorology and Water Management -
National Research Institute), CD-ROM
42. IMGW-PIB (Institute of Meteorology and Water
Management - National Research Institute)
43. www.metoffice.gov.uk/pub/data/weather/uk/
climate/stationdata/oxforddata.txt (accessed May
2018)

The Study of Different Types of Kernel Density Estimators: Minge Sha, Yonggang Xie
No ratings yet
The Study of Different Types of Kernel Density Estimators: Minge Sha, Yonggang Xie
5 pages
Kernel Density Estimation - Wikipedia
No ratings yet
Kernel Density Estimation - Wikipedia
11 pages
Lecture 12
No ratings yet
Lecture 12
4 pages
Articulo Sheather
No ratings yet
Articulo Sheather
11 pages
A Review of Kernel Density Estimation With Applications To Econometrics (#278024) - 259389
No ratings yet
A Review of Kernel Density Estimation With Applications To Econometrics (#278024) - 259389
23 pages
Econometricians' Guide to KDE
No ratings yet
Econometricians' Guide to KDE
35 pages
Non Parametric Density Estimation
No ratings yet
Non Parametric Density Estimation
4 pages
Kernel Density Estimation
No ratings yet
Kernel Density Estimation
10 pages
Ast Part1 PDF
No ratings yet
Ast Part1 PDF
20 pages
Simon Sheather 2004 PDF
No ratings yet
Simon Sheather 2004 PDF
10 pages
Kernel (Statistics)
No ratings yet
Kernel (Statistics)
4 pages
Chapter One
100% (1)
Chapter One
46 pages
Getdist: Kernel Density Estimation: Url: Http://Cosmologist - Info
No ratings yet
Getdist: Kernel Density Estimation: Url: Http://Cosmologist - Info
11 pages
Towardsdatascience Com The Math Behind Kernel Density Estimation 5deca75cba38 ...
No ratings yet
Towardsdatascience Com The Math Behind Kernel Density Estimation 5deca75cba38 ...
26 pages
Non-Parametric Methods Using Kernel Density Estimation
No ratings yet
Non-Parametric Methods Using Kernel Density Estimation
1 page
Density Estimation
No ratings yet
Density Estimation
17 pages
TEAA - Memory Based Tecniques
No ratings yet
TEAA - Memory Based Tecniques
23 pages
Empirical Finance1
No ratings yet
Empirical Finance1
31 pages
Nadaraya-Watson Teoria PDF
No ratings yet
Nadaraya-Watson Teoria PDF
9 pages
CrimeStatChapter 8
No ratings yet
CrimeStatChapter 8
43 pages
Advanced Data Analysis Techniques
No ratings yet
Advanced Data Analysis Techniques
20 pages
Intro To Kernel Density Estimation
No ratings yet
Intro To Kernel Density Estimation
4 pages
Chap 4
No ratings yet
Chap 4
21 pages
Kernel Density Estimation Guide
No ratings yet
Kernel Density Estimation Guide
105 pages
Mean Shift Algorithm Implementation
No ratings yet
Mean Shift Algorithm Implementation
18 pages
Tabak Turner
No ratings yet
Tabak Turner
20 pages
Bayesian Selector of Adaptive Bandwidth For Multivariate Gamma Kernel Estimator On (0, )
No ratings yet
Bayesian Selector of Adaptive Bandwidth For Multivariate Gamma Kernel Estimator On (0, )
23 pages
13 Density Estimation Note
No ratings yet
13 Density Estimation Note
48 pages
A Primer in Nonparametric Econometrics
No ratings yet
A Primer in Nonparametric Econometrics
88 pages
Racine - 2007 - Nonparametric Econometrics A Primer
No ratings yet
Racine - 2007 - Nonparametric Econometrics A Primer
88 pages
Optimal Bandwidth for Skewed Distribution
No ratings yet
Optimal Bandwidth for Skewed Distribution
9 pages
Lec7 Density PDF
No ratings yet
Lec7 Density PDF
9 pages
CH Density Estimation
No ratings yet
CH Density Estimation
15 pages
AMC Technical Brief 4 (Kernel Density Estimation Using Kernel - Xla)
No ratings yet
AMC Technical Brief 4 (Kernel Density Estimation Using Kernel - Xla)
2 pages
(Paper) Wand, M. P. and Schucany, W. R. (1990) - Gaussian-Based Kernels. Canad. J. Statist. 18 197-204
No ratings yet
(Paper) Wand, M. P. and Schucany, W. R. (1990) - Gaussian-Based Kernels. Canad. J. Statist. 18 197-204
9 pages
Robust Kernel Density Estimation-Kim and Scott
No ratings yet
Robust Kernel Density Estimation-Kim and Scott
37 pages
Univariate Density Estimation by Orthogonal Series: Department of Statistics, Oregon State University, Corvallis
No ratings yet
Univariate Density Estimation by Orthogonal Series: Department of Statistics, Oregon State University, Corvallis
8 pages
On Density Estimation
No ratings yet
On Density Estimation
4 pages
Unit - 3 Image Proc
No ratings yet
Unit - 3 Image Proc
71 pages
A Method For Continuous-Range Sequence Analysis With Jensen-Shannon Divergence
No ratings yet
A Method For Continuous-Range Sequence Analysis With Jensen-Shannon Divergence
10 pages
Transformations in Density Estimation
No ratings yet
Transformations in Density Estimation
12 pages
Comprehensiv Questions Solved
No ratings yet
Comprehensiv Questions Solved
28 pages
Introduction To Kernel Smoothing
100% (1)
Introduction To Kernel Smoothing
24 pages
Introduction To Kernel Smoothing
No ratings yet
Introduction To Kernel Smoothing
24 pages
Mathematics 11 04478 v5
No ratings yet
Mathematics 11 04478 v5
21 pages
Medium Com @jtchen2k Kernel Density Estimation With Python From Scratch c200b187...
No ratings yet
Medium Com @jtchen2k Kernel Density Estimation With Python From Scratch c200b187...
8 pages
A Kernel Method For Estimating Finite Population Distribution Functions Using Auxiliary Information
No ratings yet
A Kernel Method For Estimating Finite Population Distribution Functions Using Auxiliary Information
9 pages
Estimating The Support of A High-Dimensional Distribution
No ratings yet
Estimating The Support of A High-Dimensional Distribution
28 pages
Week 3-Nonparametric Estimation
No ratings yet
Week 3-Nonparametric Estimation
37 pages
Mean-Shift Tracking: R.Collins, CSE, PSU CSE598G Spring 2006
No ratings yet
Mean-Shift Tracking: R.Collins, CSE, PSU CSE598G Spring 2006
93 pages
Green 1988
No ratings yet
Green 1988
3 pages
Classes of Kernels For Machine Learning: A Statistics Perspective
No ratings yet
Classes of Kernels For Machine Learning: A Statistics Perspective
14 pages
Jinuntuya 2025 J. Phys.: Conf. Ser. 2934 012002
No ratings yet
Jinuntuya 2025 J. Phys.: Conf. Ser. 2934 012002
6 pages
Kernel Smoothers: An Overview of Curve Estimators For The First Graduate Course in Nonparametric Statistics
No ratings yet
Kernel Smoothers: An Overview of Curve Estimators For The First Graduate Course in Nonparametric Statistics
13 pages
(Bernard. W. Silverman) Density Estimation For Sta
No ratings yet
(Bernard. W. Silverman) Density Estimation For Sta
92 pages
Pre-Undercut Caving in El Teniente Mine, Chile
No ratings yet
Pre-Undercut Caving in El Teniente Mine, Chile
66 pages
A Simulation Model For The Optimization and Risk Management of Preproduction Mine Development in A Block Caving Mining Project
No ratings yet
A Simulation Model For The Optimization and Risk Management of Preproduction Mine Development in A Block Caving Mining Project
6 pages
Optimising Stope Design Through Economic and Geotechnic Assessments of Predictions Made at A Meter Scale Resolution Using The Sites' Reconciled Data
No ratings yet
Optimising Stope Design Through Economic and Geotechnic Assessments of Predictions Made at A Meter Scale Resolution Using The Sites' Reconciled Data
19 pages
Optimising For Success at The Grasberg Block Cave
No ratings yet
Optimising For Success at The Grasberg Block Cave
14 pages
Analysis of Induced Stress During Construction and Production Stages of Drawbells in Block Caving Mines
No ratings yet
Analysis of Induced Stress During Construction and Production Stages of Drawbells in Block Caving Mines
18 pages
A Solution To Estimate The Total and Effective Stresses in Backfilled Stopes With An Impervious Base During The Filling Operation of Cohesionless Backfill
No ratings yet
A Solution To Estimate The Total and Effective Stresses in Backfilled Stopes With An Impervious Base During The Filling Operation of Cohesionless Backfill
17 pages
Optimizing Darlot Mine Paste Fill
No ratings yet
Optimizing Darlot Mine Paste Fill
10 pages
Mathematical Programming Applications in Block-Caving Scheduling A Review of Models and Algorithms
No ratings yet
Mathematical Programming Applications in Block-Caving Scheduling A Review of Models and Algorithms
25 pages
Geotechnical Properties of Mine Fill
No ratings yet
Geotechnical Properties of Mine Fill
7 pages
Rheological Properties of Cemented Paste Backfill and The Construction of A Prediction Model
No ratings yet
Rheological Properties of Cemented Paste Backfill and The Construction of A Prediction Model
17 pages
Mining Cave Block in Level Undercut The and Boundary Stope Optimizing
No ratings yet
Mining Cave Block in Level Undercut The and Boundary Stope Optimizing
13 pages
Effects of Groundwater Abstraction and Desalination Brine Injection On A Middle Miocene Aquifer of The El-Dabaa Area, Northern Coast of Egypt
No ratings yet
Effects of Groundwater Abstraction and Desalination Brine Injection On A Middle Miocene Aquifer of The El-Dabaa Area, Northern Coast of Egypt
7 pages
On The Potential of Ground Penetrating Radar To Help Rock Fall Hazard Assessment A Case Study of A Limestone Slab, Gorges de La Bourne (French Alps)
No ratings yet
On The Potential of Ground Penetrating Radar To Help Rock Fall Hazard Assessment A Case Study of A Limestone Slab, Gorges de La Bourne (French Alps)
14 pages
Creep Behavior of Rocks and Its Application To The Long-Term Stability of Deep Rock Tunnels
No ratings yet
Creep Behavior of Rocks and Its Application To The Long-Term Stability of Deep Rock Tunnels
35 pages
Limiting The Influence of Extreme Grades in Ordinary Kriged Estimates
No ratings yet
Limiting The Influence of Extreme Grades in Ordinary Kriged Estimates
11 pages
Defining Spatial Entropy From Multivariate Distributions of Co-Occurrences
No ratings yet
Defining Spatial Entropy From Multivariate Distributions of Co-Occurrences
14 pages
Construction and Optimization Method of The Open-Pit Mine DEM Based On The Oblique Photogrammetry Generated DSM
No ratings yet
Construction and Optimization Method of The Open-Pit Mine DEM Based On The Oblique Photogrammetry Generated DSM
11 pages
Day-to-Day Evolution of Departure Time Choice in Stochastic Capacity Bottleneck Models With Bounded Rationality and Various Information Perceptions
No ratings yet
Day-to-Day Evolution of Departure Time Choice in Stochastic Capacity Bottleneck Models With Bounded Rationality and Various Information Perceptions
25 pages
Stope Optimization With Convexity Constraints
No ratings yet
Stope Optimization With Convexity Constraints
20 pages
Sattarvand Niemann Delius
No ratings yet
Sattarvand Niemann Delius
14 pages
An Overview of Bench Design For Cut Slopes With An Example of An Advanced Dataset Assessment Technique
No ratings yet
An Overview of Bench Design For Cut Slopes With An Example of An Advanced Dataset Assessment Technique
18 pages
Net Present Value Maximization Model For Optimum Cut-Off Grade Policy of Open Pit Mining Operations
No ratings yet
Net Present Value Maximization Model For Optimum Cut-Off Grade Policy of Open Pit Mining Operations
10 pages
Stress Mapping Tool for Geoscientists
No ratings yet
Stress Mapping Tool for Geoscientists
17 pages
Artificial Intelligence Algorithms For Realtime Production Planning With Incoming New Information in Mining Complexes
No ratings yet
Artificial Intelligence Algorithms For Realtime Production Planning With Incoming New Information in Mining Complexes
309 pages
Strategic Mine Planning Guide
No ratings yet
Strategic Mine Planning Guide
5 pages
Basu 1999
No ratings yet
Basu 1999
7 pages
ARDL
No ratings yet
ARDL
3 pages
Key Steps in Exploratory Data Analysis
No ratings yet
Key Steps in Exploratory Data Analysis
2 pages
From Career Decision-Making Styles To Career Decision-Making Profiles
No ratings yet
From Career Decision-Making Styles To Career Decision-Making Profiles
15 pages
Non Parametric Tests - A
No ratings yet
Non Parametric Tests - A
13 pages
ADL 07 Quantitative Techniques in Management V3
No ratings yet
ADL 07 Quantitative Techniques in Management V3
5 pages
Understanding Standard Error in Commerce
No ratings yet
Understanding Standard Error in Commerce
16 pages
Big Five Traits & Student GPA
No ratings yet
Big Five Traits & Student GPA
18 pages
Multiclass vs Binary Classification
No ratings yet
Multiclass vs Binary Classification
3 pages
4) Results and Discussion
No ratings yet
4) Results and Discussion
28 pages
Case Processing Summary: Lama Kerja
No ratings yet
Case Processing Summary: Lama Kerja
2 pages
Risk (Part 2) - Variance & Covariance - Varsity by Zerodha - All Things Stock Markets Simplified
No ratings yet
Risk (Part 2) - Variance & Covariance - Varsity by Zerodha - All Things Stock Markets Simplified
6 pages
Continuous Probability Distribution
No ratings yet
Continuous Probability Distribution
22 pages
Creativity's Impact on Technopreneurship
No ratings yet
Creativity's Impact on Technopreneurship
8 pages
PDF Trial STPM Mathematics M 2 Selangor SMK Seafieldsubang Compress
No ratings yet
PDF Trial STPM Mathematics M 2 Selangor SMK Seafieldsubang Compress
5 pages
Chapter 14-Introduction To Multiple Regression
No ratings yet
Chapter 14-Introduction To Multiple Regression
67 pages
STAT501 Online - Spring2024 - FinalExam
No ratings yet
STAT501 Online - Spring2024 - FinalExam
14 pages
Solution Econometric Chapter 10 Regression Panel Data
No ratings yet
Solution Econometric Chapter 10 Regression Panel Data
3 pages
Time Series Analysis: Henrik Madsen
No ratings yet
Time Series Analysis: Henrik Madsen
25 pages
Basics of SAS for Econometrics
No ratings yet
Basics of SAS for Econometrics
14 pages
Student Score Prediction Guide
No ratings yet
Student Score Prediction Guide
4 pages
(Ebook PDF) Business Statistics A First Course, 6th Edition Instant Download
100% (5)
(Ebook PDF) Business Statistics A First Course, 6th Edition Instant Download
40 pages
Asia Paci Fic Management Review: M. Adnan Kabir, Sultana Sabina Chowdhury
No ratings yet
Asia Paci Fic Management Review: M. Adnan Kabir, Sultana Sabina Chowdhury
12 pages
PROCESS Documentation Addendum
No ratings yet
PROCESS Documentation Addendum
26 pages
n n x¯ x¯ σ = σ =: Sample 1 Sample 2
No ratings yet
n n x¯ x¯ σ = σ =: Sample 1 Sample 2
8 pages
Decision Trees
No ratings yet
Decision Trees
26 pages
Sathyabama University: Register Number
No ratings yet
Sathyabama University: Register Number
4 pages
2010 Data Envelopment Analysis As Nonparametric Least-Squares Regression
No ratings yet
2010 Data Envelopment Analysis As Nonparametric Least-Squares Regression
13 pages
Final Exam
100% (1)
Final Exam
2 pages
Financial Time Series Analysis
No ratings yet
Financial Time Series Analysis
11 pages
AS MCQ New
100% (2)
AS MCQ New
13 pages

Kernel Density Estimation and Its Application

Uploaded by

Kernel Density Estimation and Its Application

Uploaded by

ITM Web of Conferences 23, 00037 (2018) https://doi.org/10.

Kernel density estimation and its application

1 Introduction Two concepts play fundamental role in kernel

better represent the "true" pdf of a continuous variable.

In most common practical applications, the kernel

Fig. 3. The value of the smoothing parameter h influences the

Many types of kernel function can be found in the

3 shows  the influence of the smoothing parameter h is Reciprocal

Two versions of (6) are used in practice: the product

The radial kernel estimator is based on the Euclidean

3 Measures of discrepancy between the

MISE is one of measures used to estimate the smoothing

symmetrical kernel function (Gaussian), the other for

h  0.79  IQR  n1/5 (14) 

parameters calculated for samples drawn from the same

Fig. 7. Kernel density estimates for four 45-year time series of

Fig. 8. Kernel density estimates for four 32-year time series of

Fig. 6. Different methods for kernel smoothing coefficient

5 Kernel density in practice

5.1 The univariate case

5.2 The bivariate case and some general

Formally, the univariate case can be easily extended to

6 Summary and conclusions

requires additionally the specification of the orientation References

23. D. W. Scott, Multivariate Density Estimation,

You might also like