0% found this document useful (0 votes)
380 views4 pages

Pearson Product Moment Correlation Guide

This document discusses the Pearson Product Moment Correlation, a statistical method used to determine the relationship between two quantitative variables. It outlines the objectives of understanding correlation coefficients, provides the formula for calculating them, and explains the types of correlation (positive, negative, and none). Additionally, it includes examples and practice exercises for calculating and interpreting correlation coefficients.

Uploaded by

khnnmlndrz
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as DOCX, PDF, TXT or read online on Scribd
0% found this document useful (0 votes)
380 views4 pages

Pearson Product Moment Correlation Guide

This document discusses the Pearson Product Moment Correlation, a statistical method used to determine the relationship between two quantitative variables. It outlines the objectives of understanding correlation coefficients, provides the formula for calculating them, and explains the types of correlation (positive, negative, and none). Additionally, it includes examples and practice exercises for calculating and interpreting correlation coefficients.

Uploaded by

khnnmlndrz
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as DOCX, PDF, TXT or read online on Scribd

Activity No.

___

PEARSON PRODUCT MOMENT CORRELATION

I. Introduction

The most meaningful research is that which seeks to find and to verify relationships
between and among variables. In correlational studies, researchers determine if a relationship
exists between two or more quantitative variables, such as score and age or reading
comprehension and word-problem solving skill in mathematics. Such relationships are
oftentimes used in prediction, to imply causation. Although causal relationships cannot be
proven through correlational researches, researchers hope eventually to make causal statements
as an outgrowth of their work (Basilio et. al, 2003).

II. Objectives

At the end of this activity, you should be able to:

1. explain what a correlation coefficient measure is.


2. calculate the correlation coefficient of the data set.
3. interpret the correlation coefficient of the data set.

III. Concepts

A. Formula of the Correlation Coefficient

Correlation measures the association or the strength of the relationship between two
variables say x and y, denoted by r. Karl Pearson (1857-1936), an influential English
mathematician and biostatistician, invented a formula that can give a numerical value about the
measure of the correlation. This formula does not only inform us how greatly two data sets are
correlated but also reveals if the correlation is direct, inverse, or none in a given interval data.
The formula named after his is called Pearson Product Moment Correlation or Pearson
Correlation Coefficient (Baltazar et. al, 2013).

To calculate the Pearson correlation coefficient r, we use the following formula:

n ( ∑ xy )−( ∑ x ) ( ∑ y )
r= where n is the number of observations
√ n¿ ¿ ¿

1
Note: The Pearson product-moment correlation is only applicable when the two variables
x and y are interval type of data.

B. Three Types of Correlation

1. Two variables are positively correlated if the values of the two variables both increase.

2. Two variables are negatively correlated if the values of one variable increase while the
values of the other decrease.

3. Two variables are not correlated or they have zero correlation if one variable neither
increases nor decreases while the other increases.

The following figure shows some scatter diagrams along with the type of linear
correlation that exists between the x and y variables. The closer |r| is to 1, the stronger the linear
relationship between the variables.

2
Example : Consider the data presented below. Determine the correlation coefficient between the
English and Mathematics Scores of the students and interpret the result.

Studen English Mathematics Xy x2 y2


t Scores (x) Scores (y)
1 5 11 55 25 121
2 8 9 72 64 81
3 7 8 56 49 64
4 10 14 140 100 196
5 7 10 70 49 100
6 12 6 72 144 36
∑ x =49 ∑ y=58 ∑ xy =465 ∑ x 2=431 ∑ 2
y =598

Solution:

To compute the correlation coefficient r, use the correlation coefficient formula and
substitute the needed parameters to the formula, that is:

n ( ∑ xy )−( ∑ x ) ( ∑ y )
r=
√ n¿ ¿ ¿
6 ( 465 )−( 49 ) (58)
r=
√ 6 ( 431 )−(49)2 √ 6 ( 598 )−(58)2
2790−2842
r=
√ 2586−2401 √3588−3364
−52 −52
r= = 203.57
√ 185 √ 224
r =¿ -0.255

The correlation coefficient of -0.255 suggests a negative correlation between English


and Mathematics scores of the students.

3
IV. Practice Exercises

1. The table below presents the length of the hand and height of the 5 persons in centimeter scale.
Calculate and interpret the correlation coefficient of the two variables stated.

Person Hand (x) Height (y)


A 18 165
B 19 170
C 16 150
D 22 173
E 18 168

2. The table below lists the scores of ten students in Statistics test and Physics test. Calculate and
interpret the correlation coefficient of the variables x and y.

Student Statistics Scores Physics Scores


1 8 12
2 10 9
3 9 16
4 13 10
5 10 15
6 14 18
7 6 9
8 8 12
9 14 20
10 15 20

REFERENCES:

1. Aufmann, R. N., Lockwood, J.S., Nation, R. D., & Clegg, D. K. (2013). Mathematical
Excursions (Third Edition). Belmont CA: Brooks/Cole Cengage Learning.

2. Baltazar, E., Ragasa, C., & Evangelista, J. (2018). Mathematics in the Modern World. C & E
Publishing Inc.,

3. Basilio, F. B., Chua E. A., Jumawan, M. T., Mangabat, LO. A., Mendoza, M. B., Pacho, E.
M., Tamoria, F. V., Villnea,. S., Vizcarra, F. O., Yambao T., M. (2003). Fundamental Statistics.
Trinitas Publishing, Inc.

Common questions

Powered by AI

A correlation coefficient interpretation can drastically change with data variability. In the example given with student scores, a calculated correlation coefficient of -0.255 suggests a weak negative correlation, but if the variability in either the English or Mathematics scores were higher or lower, this could affect the interpretation's reliability. High variability implies that scores are spread out, possibly due to underlying factors not captured by the data, and could lead to an underestimation of the true strength of the relationship. Conversely, low variability might lead to an overestimation of the relationship's strength, assuming the line represents the correlation well across the sample .

The practice exercise illustrates the Pearson correlation application by providing datasets to calculate correlations. In calculating the correlation between hand length and height, or test scores in subjects, the Pearson formula is used to assess the linear relationship. These calculations transform raw data into 'r', quantifying the relation's strength and suggesting directions for further study or inquiry. By interpreting 'r' values, one can assess whether stronger predictive links might be feasible, despite Pearson's limitations in detecting causation or linearity alone .

Misinterpreting a weak correlation in academic performance could lead to incorrect conclusions about the effectiveness of educational interventions or curricular adjustments. For instance, mistaking a weak correlation for causation might result in unwarranted resource allocation or policy shifts. To mitigate this, educators should use correlation analysis alongside other statistics, such as regression or multivariate analysis, to confirm findings. Engaging stakeholders in data discussion can also ensure broader perspectives inform decision-making, reducing reliance on a single, potentially misleading metric .

Correlation analysis is beneficial in exploratory data analysis for revealing potential relationships between variables, guiding hypothesis formation, and identifying areas for more detailed investigation. It offers a straightforward, quantitative measure of association that can underpin predictive models. However, its limitations include the inability to infer causation, its restriction to linear relationships, potential misinformation by outliers, and assumptions of normally distributed data. In complex systems where variables interact non-linearly or where there are confounding factors, reliance solely on correlation could be misleading .

Educational institutions might use correlation coefficients to identify relationships between various performance metrics, such as prior grades and standardized test scores, to predict future student performance. Such analysis can guide targeted interventions for at-risk students, optimize resource allocation, and inform strategic planning. However, caution is required as correlation does not imply causation, and predictive measures might overlook qualitative factors influencing performance. Educators should integrate holistic assessments and consider confounding variables, ensuring that predictions are made with a nuanced understanding of student dynamics .

Scatter diagrams serve as a visual representation of the relationship between two variables, allowing for a quick assessment of the type of correlation. They help identify whether a relationship exists by illustrating patterns such as linear trends or clusters. The direction and tightness of the points around a line also indicate correlation strength and type: positive if points ascend together, negative if they descend, and none if points are spread without any discernible pattern. However, scatter diagrams are limited by scale and can be misleading with small sample sizes or outliers .

To apply the Pearson Product Moment Correlation Coefficient, the variables must be on an interval scale, meaning the data should have a meaningful order and equal intervals. Additionally, the data should be approximately normally distributed without significant outliers, and the relationship should be linear. Meeting these conditions is critical because the Pearson coefficient measures linear relationships; non-linear dynamics or non-normal distributions could lead the coefficient to inaccurately represent the strength or direction of the relationship between the variables .

A correlation coefficient of -0.255 indicates a weak negative linear relationship between English and Mathematics scores. This suggests that increases in English scores are slightly associated with decreases in Mathematics scores, or vice versa, but the relationship is not strong or significant. The weak correlation implies that other factors may contribute to changes in scores, highlighting the need for further investigation into these variables .

In real-world scenarios, positive correlation is seen when two variables increase together, such as height and weight in growing children. Negative correlation occurs when one variable increases while the other decreases, exemplified by indoor temperature and heating costs. A zero correlation signifies no predictable relation, as would be the case between shoe size and intelligence. These types facilitate understanding relationships between measured phenomena, aiding in predictive modeling, trend analysis, and decision-making processes across various fields like economics, biology, and social sciences .

The Pearson Product Moment Correlation Coefficient, denoted by 'r', measures the strength and direction of the linear relationship between two interval scale variables. A value of 'r' close to 1 indicates a strong positive linear relationship, while a value close to -1 indicates a strong negative linear relationship. A result near 0 suggests no linear correlation. The Pearson coefficient, however, only detects linear relationships and can be influenced by outliers. It cannot imply causation, so while it can suggest a potential predictive relationship, it does not establish cause and effect .

You might also like