Clustering Student Learning Behaviors

Uploaded by

Ika Qutsiati Utami

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

41 views8 pages

Clustering Student Learning Behaviors

Uploaded by

Ika Qutsiati Utami

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 8

13

Journal of Advanced Technology and Multidiscipline (JATM)

Vol. 03, No. 01, 2024, pp. 13-20
e-ISSN: 2964-6162

Student’s Behavior Clustering based on

Ubiquitous Learning Log Data using
Unsupervised Machine Learning
Ika Qutsiati Utami1, Wu-Yuin Hwang2, Ratih Ardiati Ningrum3
1,3
Data Science Technology, Engineering Department, Faculty of Advanced Technology and Multidiscipline,
Universitas Airlangga, Surabaya, Indonesia
1,2
Graduate Institute of Network Learning Technology, National Central University, Taoyuan, Taiwan
2
Department of Computer Science and Information Engineering, National Dong Hwa University, Hualien, Taiwan

Abstract—Online learning is the source of data generation utilize models in improving learning and evaluating the process
related to learner’s learning behaviors, which is valuable for through instrumental investigation. According to the first
knowledge discovery. Existing research emphasized more on an definition of LA, it is an approach to collecting, analyzing, and
understanding of student’s performance and achievement from reporting educational data related to learning, learners, and its
learning log data. In this study, we presented data-driven learning related context [7]. There are several techniques used for the
behavior clustering in authentic learning context to understand
students’ behavior while participating in the learning process. The
analytical process of learning related data such as supervised
objective of the study is to distinguish students according to their and unsupervised learning methods. The main difference
learning behavior characteristics and identify clusters of students between the two approaches is the use of labeled data (Nafis
at risk of unsuccessful learning achievement. Learning log data and Biswas, 2022; Shakarami, Shahidinejad and Ghobaei-
were collected from ubiquitous learning applications before Arani, 2021). In the unsupervised learning method, the process
conducting Exploratory Data Analysis (EDA) and cluster analysis. of data analysis to learn the patterns from data does not require
We used partitional clustering using K-means algorithm and labeled input and output data. It is being used generally for
hierarchical clustering based on the agglomerative method to clustering and segmentation-related tasks. The algorithm
improve clustering strategies. The result of this study revealed performs natural clustering over the dataset to identify similar
three different clusters of students supported by data visualization
techniques. Cluster 1 comprised more students with active
patterns and characteristics. The process of learning about user
learning behavior based on the total logs, total problems posed, behavior from log data typically involves partitioning the data
and the total attempts in fraction operation and simplification. into meaningful subsets, called partitions, and comparing the
Students in clusters 2 and 3 had a higher attempt at problem- different partitions.
solving instead of problem-posing. Both clusters also focused on In an educational context, cluster analysis can be used to
fraction’s conceptual understanding. Knowledge discovery of this gain insight into structured data such as student behavior
study used real data generated from ubiquitous learning grouping, finding similar learning patterns, and student
application namely U-Fraction. We combined two different types performance clustering [10], [11]. However, despite the
of clustering method for delivering more accurate portrait of a potential of unsupervised learning or cluster analysis for LA, it
student’s hidden learning behaviors. The outcome of this study can
be a basis for educational stakeholders to provide preventive
is seldom utilized for supporting teaching and learning analysis
learning strategies tailored to a different cluster of students. in ubiquitous learning contexts based on students’ learning log
data [12]. Log data is automatically produced files and
Keywords—Learning analytics, behavior clustering, timestamps relevant to the system or software application [13].
unsupervised learning, learning log-data, education research, Log data can provide a portrait of a student’s hidden learning
educational policies. behavior and give a more complete or accurate picture of all
behaviors. Yet, log data generated by the learning application
I. INTRODUCTION server had left the characteristic prone to data noise. The
Over ten years, starting from 2011, learning analytics (LA) process of mining and reducing noise in log data is considered
with data-driven analysis has arisen by exploiting machine as challenging task. In addition to that fact, this study tries to
learning in the educational field [1], [2]. Several research perform an unsupervised learning method on student behavior
studies in educational data mining and artificial intelligence based on learning log data generated from ubiquitous learning
have tempted to distinguish the LA movement in an educational applications. In this study, log data refers to all students’
context [3], [4], [5], [6]. LA used educational data for activity while using the learning system namely ubiquitous
knowledge discovery and transform data into meaningful fraction (U-Fraction) [14], [15]. This learning application is
insights. It is used for leveraging educational data to support the installed on a tablet device with an Android operating system.
teaching and learning process. The main purpose of LA is to By analyzing learning log data produced from the application,
14
Journal of Advanced Technology and Multidiscipline (JATM)
Vol. 03, No. 01, 2024, pp. 13-20
e-ISSN: 2964-6162
the educational stakeholder can obtain learning problems at the Furthermore, students learning and social interaction with real-
earliest possible time. Additionally, it can enable them to world situation is critical to be learned and analyzed. Currently,
resolve learning issues in a timelier fashion. Most importantly, there have been few studies that examine students' actions like
a lot of data from learning systems and applications can be their interaction behaviors rather than their perceptions and
analyzed using machine-learning techniques to support performance. The present study takes a further step toward the
decision-making in the educational field. We structured this direction to propose an approach in interpreting students
paper as follows. In Section 1, an overview of LA especially for learning log data to understand how students learn in the
cluster analysis in an educational context was presented. authentic situation over time. These findings are hinting that log
Section 2 presented a literature review of related studies. data could be an important source to identify behavioral
Section 3 described the methodological part of the research. interaction in authentic learning context.
Furthermore, section 4 explained the result of the study
B. Cluster analysis used in educational purpose
followed by a research discussion. Last, section 5 provided the
conclusion of the study. . To support the call for LA in education, several cluster
analyses have been researched in the literature. While some
II. LITERATURE REVIEW research studies have collected log-file data from virtual
learning environments, the data was frequently evaluated using
A. Learning log data in educational context more conventional statistical techniques like regression,
Learning log data is defined as important source to provide correlation, and t-tests rather than analytics algorithms [22].
powerful portrait of students learning patterns and their hidden Instead, cluster analysis serves as an exploratory method that
behaviors during participation on learning process. Log data are aims to identify naturally occurring homogeneous groups that
commonly collected from online learning platforms such as were either unclear or previously unknown [23]. With a rapid
virtual learning environment, e-learning, or mobile learning increase in available learner data, cluster analysis becomes the
applications. Accessing and analyzing learning log data is potential in understanding and unveiling hidden information
challenging due to privacy issue and proper storage about students in educational settings [24]. Studies by Yadav
management. Effective learning log data management requires [25], [26] proposed a new approach known as hybrid clustering
more time to be processed because the huge amount of to assess students’ academic performance. The clusters are
information collected from online server need complex formed based on the intelligence level of students. Walsh and
treatment like understanding of application usage, Risquez [22] used cluster analysis to explore the engagement of
preprocessing task, data engineering, and data architecture native and non-native English-speaking management students
provision. In the past research, some studies focused on the in a flipped classroom. They used log file data to identify hidden
direction how to interpret learning log data in understanding patterns in student behavior, paying particular attention to the
student learning process in flipped classroom [16], [17], [18]. institution's native language proficiency.
Commonly, researchers on learning analytics used learning log Research shows the exploratory potential of cluster
data from Learning Management System (e.g., Moodle, analysis on log file data in other contexts such as peer tutoring
Canvas, etc.) or Massive Open Online Course (e.g., Coursera, [27], [28]. However, despite its potential, cluster analysis is still
Udemy, etc.). The learning analytics goals emphasized teaching underutilized in the context of education. Moreover, the rare
and learning processes in asynchronous learning networks. For previous application of cluster analysis to study student
example, data collection related to the number of posts, the learning behavior in ubiquitous learning contexts remained
number of posts read, the number of posts replied, and content unclear. The present study is adapted from the work of
viewed. Jovanovic et al. [29]. However, in the present study, we applied
A limitation of previous studies is that they focus on cluster analysis to log-file data to identify patterns in how
student performance and student satisfaction which typically students access online resources over time while engaging with
rely on self-reporting and may be inaccurate [19], [20]. a ubiquitous learning application. This paper attempts to
Therefore, more studies based on log-file data are needed in address the lack of research using learning analytics in the
order to add an additional level of research validity to the ubiquitous learning context, using students' learning log data
understanding of students' behavior in relation to the authentic from a mobile application, and the cluster analysis algorithm
learning approach. It has been argued that log-file data may be using hierarchical and partitional methods.
more genuine and authentic than survey data, which are prone
bias into students' interpretations [21]. Instead, learning III. METHOD
analytics can reflect real and uninterrupted user behavior [17]. In the present study, we employed EDA as an initial
Therefore, rather than relying on student perceptions, this study technique for understanding the dataset. Investigation of data
examines ubiquitous application learning log data on how using EDA is used to discover unseen patterns, data anomalies,
student interact in authentic learning context and how students and a summary of the data [30]. Two important practices in
accessed learning material. However, the use of learning log EDA i.e., descriptive statistics and data visualization were used
data in mobile application particularly for authentic learning to gather insight from the data [31]. Before conducting EDA,
context had not yet fully exploited to unveil students learning we accessed the data from the online repository and organized
behaviors. Whereas, the adoption and acceptance of mobile it using Structured Query Language (SQL) operations such as
learning has led to a dramatic increase in available learner data. data selection, data join, and data aggregation. We used
15
Journal of Advanced Technology and Multidiscipline (JATM)
Vol. 03, No. 01, 2024, pp. 13-20
e-ISSN: 2964-6162
learning log data generated from a ubiquitous learning means algorithm, we also performed the agglomerative method
application namely U-Fraction. The dataset is related to student as a bottom-up approach to hierarchical clustering. Recursively,
learning activity while using the application such as problem- each observation starts in its cluster, and pairs of clusters are
solving activities and peer assessment. It was adapted from an merged as one moves up the hierarchy. This method works from
experimental study conducted by Hwang et. al. in 2018 [32]. the dissimilarities between the objects to be grouped. A type of
The data log structure before data preprocessing is represented dissimilarity can be suited to the subject studied and the nature
by the database design (Figure 2). There are 10 variables of the data. Overall, the process in research methodology is
selected for cluster analysis after the data preprocessing stage presented in Figure 1.
and feature selection stage. The attributes of the dataset are TABLE 3
K-MEANS PSEUDOCODE
presented in Table 1.
Algorithm 1. K-Means Algorithm
TABLE 1
THE ATTRIBUTE OF THE DATASET Data: number of clusters k, dataset X
Result: cluster centres C = {c1, ..., ck}
No Attribute name Description
Start
1 Operation Total attempts of fraction operation
Randomly select k data points as initial cluster centres;
2 Success_oper Total of successful fraction operation
Repeat
3 Simplification Total attempts at fraction simplification
Reinitialize all partition S subsets as empty:
4 Success_simp Total of successful fraction’s simplification
S1 = S2 = ··· = Sk = {};
5 Asking Problem posing
Compute the distance of each data point to each cluster centre;
6 Answer Problem-solving
Assign each data point to the closest cluster centre:
7 Comment Peer assessment
for i ∈ {1, ..., N} do
8 Understanding Fraction understanding
respective label l = argminj ∈ {1, ..., k} ‖ xi – cj ‖2;
9 Log1 Total data logging 1
Sl = Sl ∪ {xi};
10 Log2 Total data logging 2
End
Define new cluster centres based on the current partition:
After the data preprocessing step with EDA, we followed a two- for j ∈ {1, ..., k} do
step cluster analysis using a K-means algorithm and cj = ∑ i ∈ {1, ..., N} xi ∈ Sj xi / |Sj|
agglomerative method. The K-means algorithm is a partition- End
until the cluster assignment converges;
based clustering method, while agglomerative is a hierarchical End
clustering method [27], [28]. K-means is best suited for a small-
to-medium number of clusters, as is the case for student IV. RESULT AND DISCUSSION
behavior clustering of this study [29]. The clustering process in
K-means started by defining the number of clusters k [30], [31]. In this section, we explained the results of the present
In addition, each of k is represented by a cluster center and each study. The results of the study are categorized into two sub-
data point is assigned to the nearest cluster center namely the sections as follows:
centroid. The algorithm group data that has similar A. Exploratory data analysis
characteristics into the sample cluster, while data with different
In this step, we performed data pre-processing using EDA
characteristics are grouped into other clusters [32], [33].
such as data cleaning (i.e., missing value computation and data
Typically, the Euclidean distance is used as a distance measure.
noise treatment), data transformation, and data reduction. EDA
The calculation using the Euclidian Distance formula (equation
is important step in data analytic task because it performs initial
1) with the description of the formula in Table 2 is as follows:
investigation on data to discover patterns, to spot some
d (x, y) = √∑𝑛𝑖=1(𝑦𝑖 − 𝑥𝑖 )2 (1) anomalies, to test hypothesis, and to check assumptions using
summary statistics and graphical representations. In present
TABLE 2
study, we employed several Python libraries such as Pandas,
THE FORMULA DESCRIPTION
Symbol Description NumPy, Matplotlib, and ScikitLearn to perform the EDA’s
d Calculation of the distance to the center of the cluster operation and cluster analysis. The learning log dataset
x Point coordinates of the object comprises 4202 observations and 11 characteristics. We used
y Centroid coordinate data.head(10) function to show the dataset with only ten rows
𝑛
The amount of data to be measured, while i = 1 is the available (see Table 4). Furthermore, dataset information
∑(𝑦𝑖 − 𝑥𝑖 )2
clustering process starting from the first iteration including summary and missing value checking results is
𝑖=1
xi Coordinate the point of the i object presented in Figure 3. From the dataset summary, we can
yi i centroid coordinate point
identify the total of the column and the data type of each
column. Data has only non-null and integer values. In addition,
In the next step, new cluster centers are defined as the center of missing value analysis is used to check whether the dataset
mass of each cluster candidate. Unless the following contains a null value after the data pre-processing step. From
termination criterion is met, this process is repeated. The the result, we concluded that all columns have no missing
algorithm terminates if the last iteration did not lead to changes values.
in the assignment of each data point to the current cluster
centers [25]. The pseudocode is given in Table 3. Beside a K-
16
Journal of Advanced Technology and Multidiscipline (JATM)
Vol. 03, No. 01, 2024, pp. 13-20
e-ISSN: 2964-6162

Fig. 1 Research design

Fig. 2 The data log structures

Fig. 3 Dataset information. (a) data summary, (b) missing-value check

TABLE 4
THE DATASET WITH THE TOP 10 ROWS
User Operation Success_oper Simplify Success_simp Asking Answer Comment Understanding Log1 Log2
1 24 22 162 16 243 69 20 292 2621 518
2 45 44 128 20 242 74 14 287 2928 512
3 148 18 177 18 74 70 21 290 2601 541
4 9 38 138 44 97 73 12 258 2300 560
5 62 29 137 35 98 81 12 264 2382 1429
6 109 33 381 12 203 87 19 241 2290 3401
7 33 22 181 15 192 85 12 255 2892 506
8 26 18 239 12 107 93 12 291 2334 943
9 39 49 144 53 161 89 12 254 2284 567
10 48 20 155 16 153 135 12 244 2212 1418
17
Journal of Advanced Technology and Multidiscipline (JATM)
Vol. 03, No. 01, 2024, pp. 13-20
e-ISSN: 2964-6162

Fig. 5 Silhouette score

B. Cluster analysis with K-means algorithm and
agglomerative method TABLE 5
Cluster analysis in step 1, the student’s clusters of learning THE CLUSTER CHARACTERISTIC
Cluster 1 Cluster Cluster 3
behavior are identified using a K-means algorithm. We used Characteristic
(N=16) 2 (N=8) (N=1)
two methods to select the optimum number of clusters k. The Total attempts of fraction operation 109.0 42.4 36.9
first method is based on Elbow Method, an empirical method to Total of successful fraction operation 33.0 22.9 26.0
obtain the best value of k. This method calculates the sum of Total attempts at fraction 381.0 173.1 186.6
simplification
the square of the points and the average distance. Figure 4 Total of successful fraction’s 12 22.7 22.8
shows the result of the elbow method. We concluded that the simplification
optimal value of the cluster is 3 as presented by the last elbow Problem-posing 203.0 158.0 125.9
point (Figure 4). The second method is called the Silhouette Problem-solving 87.0 99.1 96.6
Peer assessment 19.0 19.5 15.4
method. It calculates the silhouette coefficient of every point. Fraction understanding 241.0 263.5 264.0
The value of the Silhouette score varies from -1 to 1. Silhouette Total data logging 1 2290.0 2426.4 2445.2
score 1 means the cluster is dense and well-separated than other Total data logging 2 3401.0 569.9 1501.1
clusters. A value near 0 represents overlapping clusters with
samples very close to the decision boundary of the neighboring Based on the visualization, three clusters of students were
clusters. A negative score indicates that the samples might have identified. Cluster 1 comprised more students with active
got assigned to the wrong clusters. In this cluster analysis, we learning behavior based on the total logs, total problems posed,
obtained 3 clusters as the optimum value because it has a higher and the total attempts in fraction operation and simplification.
score of the Silhouette method (Figure 5). Furthermore, the Students in clusters 2 and 3 had a higher attempt at problem-
characteristic of each cluster is shown in Table 5. The result solving instead of problem-posing. Both clusters also focused
revealed different clusters of students based on their learning on fraction understanding. Additionally, students in cluster 1
behavior variable related to a ubiquitous learning activity. We were similar to those in cluster 2 in terms of peer assessment
also presented the comparison of each cluster of students’ activity. In step 2 of cluster analysis, we performed hierarchical
learning behavior using parallel coordinates plots (Figure 6). clustering based on the agglomerative method. Figure 7 shows
a dendrogram, a diagram of the hierarchical relationship
between the students. The clades that are close to the same
height are similar to each other. The result revealed all students
of each cluster that has similar learning behavior characteristics.
According to the result, there are three clusters performed.
Cluster 1 consists of 16 students, followed by cluster 2 with 8
students and cluster 3 with 1 student. This result of hierarchical
clustering using the agglomerative method is similar to the
previous method using a K-means algorithm as partitional
clustering.

V. CONCLUSIONS
This research used the unsupervised learning method of
machine learning to discover a similar pattern of students’
learning log data and perform cluster analysis in order to obtain
Fig. 4. Elbow method students’ behavior clustering. The dataset is collected from
students’ learning activity while using the ubiquitous fraction
app called U-Fraction. Data are processed in the initial step
using EDA for data cleaning and transformation. Partition-
based clustering methods using the K-means algorithm and
hierarchical clustering methods using an agglomerative
approach are used to create a cluster of students. The result
showed three different clusters of students with different
learning behavior characteristics. Cluster 1 comprised more
students with active learning behavior based on the total logs,
total problems posed, and the total attempts in fraction
operation and simplification. Students in clusters 2 and 3 had a
higher attempt at problem-solving instead of problem-posing.
Both clusters also focused on fraction understanding. However,
no significant difference in peer assessment activity among the
groups. The outcome of this study can help educational
18
Journal of Advanced Technology and Multidiscipline (JATM)
Vol. 03, No. 01, 2024, pp. 13-20
e-ISSN: 2964-6162
stakeholders to provide preventive learning strategies tailored
to different clusters of students.

Fig. 6 The comparison of each cluster

Fig. 7 Clustering results using hierarchical clustering

[2] D. J. Lemay, C. Baek, and T. Doleck, “Comparison of learning analytics

and educational data mining: A topic modeling approach,” Computers and
ACKNOWLEDGMENT Education: Artificial Intelligence, vol. 2, p. 100016, 2021, doi:
All authors read and approved the final manuscript. https://doi.org/10.1016/j.caeai.2021.100016.
[3] A. A. Mubarak, H. Cao, and S. A. M. Ahmed, “Predictive learning
analytics using deep learning model in MOOCs’ courses videos,” Educ Inf
REFERENCES Technol (Dordr), vol. 26, no. 1, pp. 371–392, 2021, doi: 10.1007/s10639-
[1] B. Quadir, M. Chang, and J. C. Yang, “Categorizing learning analytics 020-10273-6.
models according to their goals and identifying their relevant components: [4] M. Shorfuzzaman, M. S. Hossain, A. Nazir, G. Muhammad, and A. Alamri,
A review of the learning analytics literature from 2011 to 2019,” “Harnessing the power of big data analytics in the cloud to support learning
Computers and Education: Artificial Intelligence, vol. 2, p. 100034, 2021, analytics in mobile learning environment,” Comput Human Behav, vol. 92,
doi: https://doi.org/10.1016/j.caeai.2021.100034. pp. 578–588, 2019, doi: https://doi.org/10.1016/j.chb.2018.07.002.
19
Journal of Advanced Technology and Multidiscipline (JATM)
Vol. 03, No. 01, 2024, pp. 13-20
e-ISSN: 2964-6162
[5] R. Bodily, R. Nyland, and D. Wiley, “The RISE Framework: Using [22] J. N. Walsh and A. Rísquez, “Using cluster analysis to explore the
Learning Analytics to Automatically Identify Open Educational Resources engagement with a flipped classroom of native and non-native English-
for Continuous Improvement The RISE Framework: Using Learning speaking management students,” International Journal of Management
Analytics to Automatically Identify Open Educational Resources for Education, vol. 18, no. 2, Jul. 2020, doi: 10.1016/j.ijme.2020.100381.
Continuous Improvement Bodily, Nyland, and Wiley,” 2017. [23] S. Brown, S. White, and N. Power, “Cluster Analysis of Assessment in
[6] K. M. L. Jones, “Learning analytics and higher education: a proposed Anatomy and Physiology for Health Science Undergraduates,”
model for establishing informed consent mechanisms to promote student International Journal of Teaching and Learning in Higher Education, vol.
privacy and autonomy,” International Journal of Educational Technology 28, no. 1, pp. 102–109, 2016, [Online]. Available:
in Higher Education, vol. 16, no. 1, p. 24, 2019, doi: 10.1186/s41239-019- http://www.isetl.org/ijtlhe/
0155-0. [24] W. Greller and H. Drachsler, “Translating Learning into Numbers: A
[7] J. Han, K. H. Kim, W. Rhee, and Y. H. Cho, “Learning analytics Generic Framework for Learning Analytics,” 2012. [Online]. Available:
dashboards for adaptive support in face-to-face collaborative http://groups.google.com/group/learninganalytics
argumentation,” Comput Educ, vol. 163, p. 104041, 2021, doi: [25] J. Patel and R. S. Yadav, “Applications of Clustering Algorithms in
https://doi.org/10.1016/j.compedu.2020.104041. Academic Performance Evaluation,” OAlib, vol. 02, no. 08, pp. 1–14,
[8] M. T. Nafis and R. Biswas, “A secure technique for unstructured big data 2015, doi: 10.4236/oalib.1101623.
using clustering method,” International Journal of Information [26] R. S. Yadav, “Application of hybrid clustering methods for student
Technology, vol. 14, no. 3, pp. 1187–1198, 2022, doi: 10.1007/s41870- performance evaluation,” International Journal of Information Technology,
019-00278-x. vol. 12, no. 3, pp. 749–756, 2020, doi: 10.1007/s41870-018-0192-2.
[9] A. Shakarami, A. Shahidinejad, and M. Ghobaei-Arani, “An autonomous [27] M. De Smet, H. Van Keer, and M. Valcke, “Blending asynchronous
computation offloading strategy in Mobile Edge Computing: A deep discussion groups and peer tutoring in higher education: An exploratory
learning-based hybrid approach,” Journal of Network and Computer study of online peer tutoring behaviour,” Comput Educ, vol. 50, no. 1, pp.
Applications, vol. 178, p. 102974, 2021, doi: 207–223, Jan. 2008, doi: 10.1016/j.compedu.2006.05.001.
https://doi.org/10.1016/j.jnca.2021.102974. [28] G. Lust, M. Vandewaetere, E. Ceulemans, J. Elen, and G. Clarebout, “Tool-
[10] S. S. Hamidi, E. Akbari, and H. Motameni, “Consensus clustering use in a blended undergraduate course: In Search of user profiles,” Comput
algorithm based on the automatic partitioning similarity graph,” Data Educ, vol. 57, no. 3, pp. 2135–2144, Nov. 2011, doi:
Knowl Eng, vol. 124, p. 101754, 2019, doi: 10.1016/j.compedu.2011.05.010.
https://doi.org/10.1016/j.datak.2019.101754. [29] J. Jovanović, D. Gašević, S. Dawson, A. Pardo, and N. Mirriahi, “Learning
[11] T. Li, A. Rezaeipanah, and E. M. Tag El Din, “An ensemble agglomerative analytics to unveil learning strategies in a flipped classroom,” Internet and
hierarchical clustering algorithm based on clusters clustering technique and Higher Education, vol. 33, pp. 74–85, Apr. 2017, doi:
the novel similarity measurement,” Journal of King Saud University - 10.1016/j.iheduc.2017.02.001.
Computer and Information Sciences, vol. 34, no. 6, Part B, pp. 3828–3842, [30] A. K. Jain, R. P. W. Duin, and J. Mao, “Statistical pattern recognition: a
2022, doi: https://doi.org/10.1016/j.jksuci.2022.04.010. review,” IEEE Trans Pattern Anal Mach Intell, vol. 22, no. 1, pp. 4–37,
[12] A. Mathrani, T. Susnjak, G. Ramaswami, and A. Barczak, “Perspectives 2000, doi: 10.1109/34.824819.
on the challenges of generalizability, transparency and ethics in predictive [31] C.-M. Liu, Z. Niu, and K.-T. Liao, “Mechanisms to improve clustering
learning analytics,” Computers and Education Open, vol. 2, p. 100060, uncertain data with UKmeans,” Data Knowl Eng, vol. 116, pp. 61–79,
2021, doi: https://doi.org/10.1016/j.caeo.2021.100060. 2018, doi: https://doi.org/10.1016/j.datak.2018.05.004.
[13] S. Dumais, R. Jeffries, D. M. Russell, D. Tang, and J. Teevan, [32] W.-Y. Hwang, I. Q. Utami, S. W. D. Purba, and H. S. L. Chen, “Effect of
“Understanding User Behavior Through Log Data and Analysis,” in Ways Ubiquitous Fraction App on Mathematics Learning Achievements and
of Knowing in HCI, J. S. Olson and W. A. Kellogg, Eds., New York, NY: Learning Behaviors of Taiwanese Students in Authentic Contexts,” IEEE
Springer New York, 2014, pp. 349–372. doi: 10.1007/978-1-4939-0378- Transactions on Learning Technologies, vol. 13, no. 3, pp. 530–539, 2020,
8_14. doi: 10.1109/TLT.2019.2930045.
[14] W.-Y. Hwang, I. Q. Utami, and H. Chen, “An Evaluation Study of [33] N. Nidheesh, K. A. Abdul Nazeer, and P. M. Ameer, “An enhanced
Learning Behaviors and Achievements with Ubiquitous Fraction (u- deterministic K-Means clustering algorithm for cancer subtype prediction
Fraction) for Elementary School Student,” in 2018 IEEE 18th International from gene expression data,” Comput Biol Med, vol. 91, pp. 213–221, 2017,
Conference on Advanced Learning Technologies (ICALT), 2018, pp. 350– doi: https://doi.org/10.1016/j.compbiomed.2017.10.014.
354. doi: 10.1109/ICALT.2018.00087.
[15] I. Q. Utami and W.-Y. Hwang, “The impact of collaborative problem Ika Qutsiati Utami is a Ph.D. student at the Graduate
posing and solving with ubiquitous-decimal app in authentic contexts on Institute of Network Learning Technology, College of
math learning,” Journal of Computers in Education, vol. 9, no. 3, pp. 427– Electrical Engineering & Computer Science, National
454, 2022, doi: 10.1007/s40692-021-00209-5. Central University, ROC, Taiwan. She received B.S.
[16] J. N. Walsh and A. Rísquez, “Using cluster analysis to explore the Degree in Information Systems from the Faculty of
engagement with a flipped classroom of native and non-native English- Computer Science, Brawijaya University, Indonesia &
speaking management students,” International Journal of Management obtained M.S. Degree in Network Learning Technology
Education, vol. 18, no. 2, Jul. 2020, doi: 10.1016/j.ijme.2020.100381. from the College of Electrical Engineering & Computer
[17] W. Greller and H. Drachsler, “Translating Learning into Numbers: A Science, National Central University, Taiwan. She is also working as a lecturer
Generic Framework for Learning Analytics,” 2012. [Online]. Available: at the Faculty of Advanced Technology and Multidiscipline, Universitas
http://groups.google.com/group/learninganalytics Airlangga, Indonesia. Her research interests are in the areas of Human-
[18] D. Gašević, S. Dawson, T. Rogers, and D. Gasevic, “Learning analytics Computer Interaction, AI/ML for Future Education, and Educational Data
should not promote one size fits all: The effects of instructional conditions Science.
in predicting academic success,” Internet and Higher Education, vol. 28,
pp. 68–84, Jan. 2016, doi: 10.1016/j.iheduc.2015.10.002. Wu-Yuin Hwang is a professor affiliated with both the
[19] M. B. Gilboy, S. Heinerichs, and G. Pazzaglia, “Enhancing student Department of Computer Science and Information
engagement using the flipped classroom,” J Nutr Educ Behav, vol. 47, no. Engineering, College of Science and Engineering,
1, pp. 109–114, Jan. 2015, doi: 10.1016/j.jneb.2014.08.008. National Dong Hwa University, Taiwan, and the
[20] J. Jovanović, D. Gašević, S. Dawson, A. Pardo, and N. Mirriahi, “Learning Institute of Network Learning Technology, National
analytics to unveil learning strategies in a flipped classroom,” Internet and Central University, Taiwan. His current research
Higher Education, vol. 33, pp. 74–85, Apr. 2017, doi: interests are related to the integration of IOT, AI, and
10.1016/j.iheduc.2017.02.001. multimedia sensors of mobile devices for learning and
[21] I.-H. Jo, D. Kim, and M. Yoon, “International Forum of Educational interactions among humans and all things in AR contexts like smart buildings
Technology & Society Constructing Proxy Variables to Measure Adult and campuses. Dr. Hwang received the Outstanding Research Award, from the
Learners’ Time Management Strategies in LMS,” Source: Journal of Ministry of Science and Technology, Taiwan in 2021. He is also ranked in the
Educational Technology & Society, vol. 18, no. 3, pp. 214–225, 2015, doi: top 7 scholars of the world in terms of high-quality journal publication
10.2307/jeductechsoci.18.3.214. performance of instructional design and technology.
20
Journal of Advanced Technology and Multidiscipline (JATM)
Vol. 03, No. 01, 2024, pp. 13-20
e-ISSN: 2964-6162
Ratih Ardiati Ningrum is a lecturer and researcher in
Engineering Faculty at Airlangga University where she
has been a faculty member since 2020. She graduated
with a Statistics master’s joint degree program from
National Chiao Tung University, Taiwan, and Institut
Teknologi Sepuluh Nopember, Surabaya. Currently, her
research focuses on survival analysis and structural
equation modeling.

Student Behavior Clustering in Learning
No ratings yet
Student Behavior Clustering in Learning
7 pages
Latika Project
No ratings yet
Latika Project
30 pages
Data Mining for Moodle Learning Optimization
No ratings yet
Data Mining for Moodle Learning Optimization
9 pages
Educational Data Mining: A Literature Review
No ratings yet
Educational Data Mining: A Literature Review
9 pages
A Framework To Support Educational Decis
No ratings yet
A Framework To Support Educational Decis
10 pages
Predicting Student Success in E-Learning
No ratings yet
Predicting Student Success in E-Learning
20 pages
Learning Analytics for Educators
No ratings yet
Learning Analytics for Educators
10 pages
Literature Review 1. Computational Methods For The Analysis of Learning and Knowledge Building Communities
No ratings yet
Literature Review 1. Computational Methods For The Analysis of Learning and Knowledge Building Communities
13 pages
Abstract Educational Data Mining
No ratings yet
Abstract Educational Data Mining
2 pages
Learning Analytics As A Tool For Analysing Student Agency in Higher Education
No ratings yet
Learning Analytics As A Tool For Analysing Student Agency in Higher Education
20 pages
M. Eulàlia Torras-Virgili, Andreu BELLOT-URBANO: Learning Analytics: Online Higher Education in Management
No ratings yet
M. Eulàlia Torras-Virgili, Andreu BELLOT-URBANO: Learning Analytics: Online Higher Education in Management
7 pages
AICS 2016 Paper 9
No ratings yet
AICS 2016 Paper 9
12 pages
Dne 110309 F
No ratings yet
Dne 110309 F
11 pages
Educational Data Mining Insights
No ratings yet
Educational Data Mining Insights
8 pages
Chapter 12 Baker Siemens V 3
No ratings yet
Chapter 12 Baker Siemens V 3
30 pages
Student Cluster Analysis Based On Moodle Data and Academic Performance Indicators
No ratings yet
Student Cluster Analysis Based On Moodle Data and Academic Performance Indicators
4 pages
Analysis of Data Mining Techniques Applied To LMS For Personalized Education
No ratings yet
Analysis of Data Mining Techniques Applied To LMS For Personalized Education
5 pages
Handling Missing Values in Decision Trees
No ratings yet
Handling Missing Values in Decision Trees
6 pages
Ej 1301320
No ratings yet
Ej 1301320
33 pages
Penulisan La Group
No ratings yet
Penulisan La Group
19 pages
Preprocessing and Analyzing Educational Data Set Using X-API For Improving Student's Performance
No ratings yet
Preprocessing and Analyzing Educational Data Set Using X-API For Improving Student's Performance
5 pages
Daud 2017
No ratings yet
Daud 2017
7 pages
Inggris Mining
No ratings yet
Inggris Mining
15 pages
IGI - Revised - Manuscript CleanCopy With Tables
No ratings yet
IGI - Revised - Manuscript CleanCopy With Tables
40 pages
Learning Analytics Enhancing Student Success
No ratings yet
Learning Analytics Enhancing Student Success
5 pages
Infedu 19 3 Infe2020 - 3 - 17
No ratings yet
Infedu 19 3 Infe2020 - 3 - 17
38 pages
Using Virtual Earning
No ratings yet
Using Virtual Earning
24 pages
Comparison of Clustering Algorithms For Learning A
No ratings yet
Comparison of Clustering Algorithms For Learning A
8 pages
Learning Analytics in Higher Education
No ratings yet
Learning Analytics in Higher Education
17 pages
Yash 21BSDS12 Perdictive Analysis Report
No ratings yet
Yash 21BSDS12 Perdictive Analysis Report
20 pages
OnlyFans Management Agreement
No ratings yet
OnlyFans Management Agreement
7 pages
Muslim Biologist and Other Biologist Contribution
No ratings yet
Muslim Biologist and Other Biologist Contribution
2 pages
Lee Chong Wei: Badminton Legend Biography
No ratings yet
Lee Chong Wei: Badminton Legend Biography
1 page
Tejan Kharade PM Resume
No ratings yet
Tejan Kharade PM Resume
2 pages
Physics Measurements and Units Guide
No ratings yet
Physics Measurements and Units Guide
5 pages
Diding (Math L 12-13)
100% (1)
Diding (Math L 12-13)
4 pages
Gender Equality
No ratings yet
Gender Equality
1 page
Namma Kalvi 6th Maths Sura Sample Guide em Term 2 218562
No ratings yet
Namma Kalvi 6th Maths Sura Sample Guide em Term 2 218562
21 pages
Explorers 6 Intro Final 0
No ratings yet
Explorers 6 Intro Final 0
25 pages
Clearance Certificate SHIHAB
No ratings yet
Clearance Certificate SHIHAB
1 page
Major Synopsis IPU PDF
No ratings yet
Major Synopsis IPU PDF
17 pages
Check List: (For Joining As .. . . .. in AIIMS, Bhubaneswar)
No ratings yet
Check List: (For Joining As .. . . .. in AIIMS, Bhubaneswar)
31 pages
Chapter 2 Cognitive Neuroscience
No ratings yet
Chapter 2 Cognitive Neuroscience
23 pages
Biomechanical Design Considerations For Transradial Prosthetic Interface: A Review
No ratings yet
Biomechanical Design Considerations For Transradial Prosthetic Interface: A Review
12 pages
Practice Scenario Laceration Chart
No ratings yet
Practice Scenario Laceration Chart
7 pages
IELTS Speaking Practice Guide
No ratings yet
IELTS Speaking Practice Guide
5 pages
5B Cash in The Hat Game
No ratings yet
5B Cash in The Hat Game
18 pages
Visual Basic Basics Notes
No ratings yet
Visual Basic Basics Notes
339 pages
Digital Business Models Explained
No ratings yet
Digital Business Models Explained
69 pages
SYLLABUS
No ratings yet
SYLLABUS
20 pages
Claims Employees
No ratings yet
Claims Employees
58 pages
Enzyme Models for Students
No ratings yet
Enzyme Models for Students
2 pages
Biological Diversity Act and ITK
No ratings yet
Biological Diversity Act and ITK
18 pages
Je 04-08022024
No ratings yet
Je 04-08022024
1 page
Importing Backing Sheet FRM CAD - Draft.
No ratings yet
Importing Backing Sheet FRM CAD - Draft.
3 pages
AANOUKIS 2023 Corporate Overview
No ratings yet
AANOUKIS 2023 Corporate Overview
37 pages
MSC USA Booking Confirmation Details
No ratings yet
MSC USA Booking Confirmation Details
4 pages
Baics of SystemVerilog
100% (1)
Baics of SystemVerilog
4 pages
Revised 2017 STUDENT HEALTH VERIFICATION PDF
No ratings yet
Revised 2017 STUDENT HEALTH VERIFICATION PDF
1 page
Time Table MBA IV Sem 2023
No ratings yet
Time Table MBA IV Sem 2023
1 page

Clustering Student Learning Behaviors

Uploaded by

Clustering Student Learning Behaviors

Uploaded by

13

Journal of Advanced Technology and Multidiscipline (JATM)

Student’s Behavior Clustering based on

Fig. 1 Research design

Fig. 2 The data log structures

Fig. 3 Dataset information. (a) data summary, (b) missing-value check

Fig. 5 Silhouette score

Fig. 6 The comparison of each cluster

Fig. 7 Clustering results using hierarchical clustering

[2] D. J. Lemay, C. Baek, and T. Doleck, “Comparison of learning analytics

You might also like