Intelligent Medicine and Health Care 1st Edition Chien-Hung Yeh Digital Download
Intelligent Medicine and Health Care 1st Edition Chien-Hung Yeh Digital Download
https://ebookgate.com/product/intelligent-medicine-and-health-care-1st-edition-chien-hung-yeh/
DOWNLOAD EBOOK
Intelligent Medicine and Health Care 1st Edition Chien-Hung
Yeh pdf download
Available Formats
https://ebookgate.com/product/economics-medicine-and-health-care-3rd-
edition-gavin-h-mooney/
ebookgate.com
https://ebookgate.com/product/intelligent-and-adaptive-systems-in-
medicine-1st-edition-olivier-c-l-haas/
ebookgate.com
https://ebookgate.com/product/dementia-care-medicine-1st-edition-
trevor-adams/
ebookgate.com
https://ebookgate.com/product/international-health-care-management-
volume-5-advances-in-health-care-management-1st-edition-grant-savage/
ebookgate.com
https://ebookgate.com/product/health-care-and-christian-ethics-robin-
gill/
ebookgate.com
Special Issue Reprint
Intelligent Medicine
and Health Care
Edited by
Chien-Hung Yeh, Xiaojuan Ban, Men-Tzung Lo, Wenbin Shi and Shenghong He
mdpi.com/journal/applsci
Intelligent Medicine and Health Care
Intelligent Medicine and Health Care
Editors
Chien-Hung Yeh
Xiaojuan Ban
Men-Tzung Lo
Wenbin Shi
Shenghong He
Editorial Office
MDPI
St. Alban-Anlage 66
4052 Basel, Switzerland
This is a reprint of articles from the Special Issue published online in the open access journal
Applied Sciences (ISSN 2076-3417) (available at: https://www.mdpi.com/journal/applsci/special
issues/WW038E1NMQ).
For citation purposes, cite each article independently as indicated on the article page online and as
indicated below:
Lastname, A.A.; Lastname, B.B. Article Title. Journal Name Year, Volume Number, Page Range.
© 2024 by the authors. Articles in this book are Open Access and distributed under the Creative
Commons Attribution (CC BY) license. The book as a whole is distributed by MDPI under the terms
and conditions of the Creative Commons Attribution-NonCommercial-NoDerivs (CC BY-NC-ND)
license.
Contents
Basem Assiri
A Modified and Effective Blockchain Model for E-Healthcare Systems
Reprinted from: Appl. Sci. 2023, 13, 12630, doi:10.3390/app132312630 . . . . . . . . . . . . . . . . 33
Ming-Hung Chang, Yi-Chao Wu, Hsi-Yu Niu, Yi-Ting Chen and Shu-Han Juang
Cross-Platform Gait Analysis and Fall Detection Wearable Device
Reprinted from: Appl. Sci. 2023, 13, 3299, doi:10.3390/app13053299 . . . . . . . . . . . . . . . . . 108
Salaki Reynaldo Joshua, Wasim Abbas, Je-Hoon Lee and Seong Kun Kim
Trust Components: An Analysis in The Development of Type 2 Diabetic Mellitus Mobile
Application
Reprinted from: Appl. Sci. 2023, 13, 1251, doi:10.3390/app13031251 . . . . . . . . . . . . . . . . . 123
Elissaveta Zvetkova, Eugeni Koytchev, Ivan Ivanov, Sergey Ranchev and Antonio Antonov
Biomechanical, Healing and Therapeutic Effects of Stretching: A Comprehensive Review
Reprinted from: Appl. Sci. 2023, 13, 8596, doi:10.3390/app13158596 . . . . . . . . . . . . . . . . . 159
v
applied
sciences
Article
Predicting Co-Occurring Mental Health and Substance Use
Disorders in Women: An Automated Machine Learning
Approach
Nirmal Acharya 1, *, Padmaja Kar 2 , Mustafa Ally 3 and Jeffrey Soar 3
Abstract: Significant clinical overlap exists between mental health and substance use disorders, espe-
cially among women. The purpose of this research is to leverage an AutoML (Automated Machine
Learning) interface to predict and distinguish co-occurring mental health (MH) and substance use
disorders (SUD) among women. By employing various modeling algorithms for binary classification,
including Random Forest, Gradient Boosted Trees, XGBoost, Extra Trees, SGD, Deep Neural Network,
Single-Layer Perceptron, K Nearest Neighbors (grid), and a super learning model (constructed by
combining the predictions of a Random Forest model and an XGBoost model), the research aims
to provide healthcare practitioners with a powerful tool for earlier identification, intervention, and
personalised support for women at risk. The present research presents a machine learning (ML)
methodology for more accurately predicting the co-occurrence of mental health (MH) and substance
use disorders (SUD) in women, utilising the Treatment Episode Data Set Admissions (TEDS-A) from
the year 2020 (n = 497,175). A super learning model was constructed by combining the predictions of
a Random Forest model and an XGBoost model. The model demonstrated promising predictive per-
formance in predicting co-occurring MH and SUD in women with an AUC = 0.817, Accuracy = 0.751,
Citation: Acharya, N.; Kar, P.; Ally, Precision = 0.743, Recall = 0.926 and F1 Score = 0.825. The use of accurate prediction models can
M.; Soar, J. Predicting Co-Occurring
substantially facilitate the prompt identification and implementation of intervention strategies.
Mental Health and Substance Use
Disorders in Women: An Automated
Keywords: mental health; substance use disorder; machine learning; AutoML
Machine Learning Approach. Appl.
Sci. 2024, 14, 1630. https://doi.org/
10.3390/app14041630
2
Appl. Sci. 2024, 14, 1630
chosen, an algorithm is set up to facilitate training, seamlessly leading into the automated
testing phase. This yields immediate access to ML insights, including model predictions
and performance metrics, enabling researchers to employ validated models for forecasting
or analysing various phenomena. Typically, AutoML workflows initiate with basic ML
algorithms known for their simplicity, user-friendliness, and rigorously evaluated models
such as k-nearest neighbors and decision trees. As the analysis demands more intricate
scrutiny, more complex alternatives like boosted trees or deep learning (e.g., XGBoost) come
into play for analysis and evaluation. Diverse intricate models can be crafted, often formed
as ensembles—a fusion of basic models leveraging the strengths of each component while
mitigating individual weaknesses. The synergy within an ensemble of algorithms aims
to enhance overall predictive power and model robustness. ML techniques, particularly
AutoML have led to improved granularity and accuracy in various studies [11,22,27]. This
research capitalises on the power of AutoML to automate the process of model selection,
hyperparameter tuning, and feature engineering, streamlining the analytical process and
enhancing the predictive accuracy of the models. the super learning (SL) model has the
potential to distinguish women with co-occurring disorders from those without. The super
learning algorithm is a supervised learning method that uses a loss-based approach to
choose the best combination of prediction algorithms [28]. The method achieves asymptotic
performance comparable to the optimal weighted combination of the basic learners, making
it a highly effective strategy for addressing various issues using the same technique; it can
reduce the probability of over-fitting during the training process, employing a modified
version of cross-validation [29,30]. The area under the curve (AUC) value of 0.817 achieved
by the super learning model attests to its efficacy in capturing intricate patterns within the
data, underscoring its potential as a robust diagnostic tool.
This study serves as an illustration of advanced statistical methods and machine
learning techniques harnessed through Dataiku, an AutoML interface, in a real-world
healthcare setting. It showcases the platform’s ability to automate essential operations such
as selecting models, optimising hyperparameters, and engineering features. The super
learning model, which combines Random Forest and XGBoost, has superior performance
compared to separate algorithms. It serves as a diagnostic tool for early detection of co-
occurring disorders in women. The study emphasises the potential advantages of these
powerful predictive models. By leveraging these technological advancements, we aim
to bridge the gap between data-driven innovation and clinical practice. SUDs are often
inadequately addressed in women [31,32]. The insight of the study holds the potential
to develop a tool for early identification of mental health and SUDs in women. The
findings also offer valuable insights that can inform future research and collaborations with
policymakers, medical associations, and patient advocacy groups to develop guidelines
for responsible integration and optimise the model’s potential advantages while ensuring
patient well-being and privacy protection.
The structure of the paper is as follows. The next section presents the materials and
methods, covering the description of the dataset, the machine learning models utilised,
and the statistical method employed. This is followed by the Section 3, where the findings
of the study are presented. The Section 4 then follows, which discusses the application,
limitations, and future prospects of the study. Finally, the paper concludes with the
implications, recommendations for further research, and conclusions.
3
Appl. Sci. 2024, 14, 1630
that may captivate addiction researchers in practical situations [34,35]. TEDS provides
comprehensive statistics regarding admissions and discharges from substance use disorder
treatment programs across participating states. However, the analysis for the year 2020
had to exclude Oregon, North Dakota, Idaho, and Washington due to inadequate data
reporting. Notably, some states contribute data that document multiple admissions for the
same individual, shaping statistical analyses to accurately portray admissions rather than
individual clients [36]. The dependent variable in this study was co-occurring mental health
and substance use disorder which is coded as PSYPROB (1 = Yes, 2 = No) in the dataset.
As we focused on women, we extracted the records where the client’s biological sex was
female (n = 497,175). We then conducted data pre-processing which consisted of three steps.
First, we conducted listwise deletion for records with missing values at the dependent
variable and thirty-seven relevant predictors that include PSYPROB, STFIPS, SERVICES,
PREG, IDU, EMPLOY, EDUC, ETHNIC, LIVARAG, BARBFLG, MARFLG, DSMCRIT,
AGE, MARSTAT, RACE, PSOURCE, AMPHFLG, ALCDRUG, STIMFLG, MTHAMFLG,
ALCFLG, SEDHPFLG, INHFLG, OTCFLG, PCPFLG, HALLFLG, OPSYNFLG, BENZFLG,
TRNQFLG, METHFLG, COKEFLG, HERFLG, OTHERFLG, METHUSE, FRSTUSE1, SUB1,
SUB2, SUB3, NOPRIOR. Records with incomplete data in any of the predictors, the outcome,
or characteristics used for defining inclusion in the study were excluded from the analysis.
Second, outlier detection was performed using the analyse function in Dataiku at the
dependent variable and all relevant predictors. The outliers were handled by performing
listwise deletion, leaving us a final analytic sample (n = 132,128). Finally, each feature
was processed using target encoding, in which its original value was substituted with
a numerical value derived from the target values. Within the dataset, several features
exhibit different units and scales. This trend could result in certain features having a
more significant influence on the learning algorithm compared to others, thus potentially
introducing bias. To tackle this issue, we employed the min–max normalisation technique
to standardise all the features, consequently ensuring that they are within a consistent
range, typically ranging from 0 to 1 [37]. This ensures that all features contribute equally
to the model. The provided sample was then randomly split into two sets: a training set
comprising 80% of the sample (n = 105,760), and a test set consisting of the remaining 20%
(n = 26,368).
As the data utilised in this study were sourced from publicly available information
without any subject identification, the research design and methodology were determined
to be exempt from ethics review.
4
Appl. Sci. 2024, 14, 1630
The optimal analytic approach for forecasting the co-occurrence of mental health and
substance abuse disorders was determined to be the model that maximises the Area Under
the Curve (AUC) [46]. The AUC is a useful metric for evaluating prediction accuracy. It
represents the likelihood that a randomly selected successful patient will be ranked higher
than a randomly selected unsuccessful patient by any of the algorithms. The AUC (area
under the curve) metric measures the performance of a prediction model, with values
ranging from 0 to 1. AUC = 1 indicates a perfect forecast, while AUC = 0.5 suggests that
the prediction is no better than chance.
Dataiku’s ML diagnostics feature was enabled to conduct comprehensive checks on
the dataset, modelling parameters, training speed, overfitting, leakage, model checks,
ML assertions, and abnormal predictions. A Bayesian search strategy was employed to
5
Appl. Sci. 2024, 14, 1630
optimise the hyperparameters of the machine learning models. The search was guided by a
probabilistic model that intelligently selected hyperparameter combinations for evaluation.
The goal was to find the best-performing set of hyperparameters for the model’s task. The
search process was limited to exploring five different combinations of hyperparameters.
This approach allowed for an efficient and systematic exploration of the hyperparameter
space, leading to improved model performance. A super learning model was constructed
by combining the predictions of a Random Forest model and an XGBoost model using the
“average” method. Each model was trained independently on the training data to capture
distinct patterns and relationships. During prediction, the outputs of both models were
averaged for each data point, resulting in a final prediction for the super learning model.
This approach leverages the strengths of both Random Forest and XGBoost, providing
a potentially more robust and accurate prediction by blending the insights from these
two diverse algorithms.
3. Results
Dataiku automatically ranks the best-performing interpretable model based on the
set performance metric (AUC in this case). The characteristics of the sample are presented
in Table 1. It presents an overview of a cohort and only includes gender as well as the top
10 predictors of PSYPROB that were selected as the most essential based on the Shapley
values, to keep it concise. The cohort predominantly resides in states such as New York,
Colorado, and Illinois. Notably, around 30% had no prior treatment episodes, reflecting a
significant proportion seeking treatment for the first time, while diverse referral sources:
individuals, legal systems, and community referrals highlight the multifaceted pathways to
treatment. In terms of race, the majority of the individuals classified themselves as Black or
African American (73.9%), and a range of substance use patterns emerged, encompassing
various substances across primary, secondary, and tertiary categories. The significant un-
employment rate of 52.3% among the cohort highlights the possible socioeconomic factors
at play. The substances encompass alcohol, cocaine/crack, marijuana/hashish, prescription
opiates/synthetics, methamphetamine/speed, and various other substances. The diag-
nostic data indicated a significant occurrence of opioid dependence, with a prevalence
rate of 33.8%. Additionally, around 26.3% of individuals received medication-assisted
opioid therapy.
6
Appl. Sci. 2024, 14, 1630
Table 1. Cont.
Table 2 shows the performance matrices for each algorithm applied for binary clas-
sification in the test set (N = 26,368). The primary evaluation criterion in this study was
the AUC. All AUC values were between 0.631 and 0.817. This range signifies the proba-
bility that any of the algorithms would correctly rank a randomly selected woman with
co-occurring mental and substance use disorders higher than one without such disorders.
As hypothesised, the super learning model showed the largest AUC of 0.817, demonstrat-
ing robust predictive capability. The performance of the super learning model is closely
7
Appl. Sci. 2024, 14, 1630
followed by XGBoost with an AUC of 0.809. Several ensemble techniques such as Random
Forest (AUC = 0.807) and Extra Trees (AUC = 0.803) showed significant discriminatory
ability, closely following the top-performing algorithm. Meanwhile, conventional methods
such as Gradient-Boosted Trees (AUC = 0.799) and Single-Layer Perceptron (AUC = 0.776)
demonstrated comparable but slightly lower AUC scores. Nevertheless, the utilisation of K
Nearest Neighbors (grid) resulted in a relatively reduced prediction accuracy, as indicated
by an AUC of 0.670. Interestingly, models employing Deep Neural Network architecture ex-
hibited the least satisfactory performance among the investigated algorithms, achieving an
AUC of 0.631. These findings highlight the superiority of the proposed ensemble-based ap-
proach in achieving higher AUC values and therefore more successful binary classification
in this experimental environment.
Table 2. Performance matrices for each algorithm applied for binary classification in the test set
(N = 26,368).
Table 3 provides the mean and standard deviation (SD) for each metric across the
different models. Based on these findings, the “super learning” model emerged as the
better option, as it consistently demonstrated high performance across various parameters
with low variability.
Table 3. Mean and standard deviation of key metrics across the evaluated models.
Figure 2 provides a visual representation of the AUC in the test set (n = 26,368) for
the super learning model. The super learning model exhibited the highest AUC among all
models, boasting an AUC of 0.817, a score that is typically considered a strong performance
for prediction models. Other matrices for the super learning model include accuracy (0.751),
precision (0.743), recall (0.926), and F1 score (0.825). This outcome underscores the model’s
strong ability to distinguish women with co-occurring mental and substance use disorders
from those without.
8
Appl. Sci. 2024, 14, 1630
Figure 2. Distribution of the performance metric (AUC) of the super learning model.
4. Discussion
This section offers a discussion on the application, limitations, and future prospects
of the research findings. It highlights the practical implications, potential challenges, and
opportunities for further progress in the field of co-occurring mental health and substance
use disorders among women.
4.1. Application
In ML, classification emerges as a vital task, involving the nuanced prediction of
target classes for individual data instances [28]. Achieving optimal performance on diverse
datasets necessitates the careful selection of suitable individual classifiers. The challenge
lies in pinpointing the most suitable data mining or machine learning model tailored
to a specific problem. To tackle this complexity, researchers often deploy an array of
models to ascertain the utmost performance for a given scenario. AutoML platforms
are appealing pre-packaged tools for constructing predictive models using healthcare
data [47]. In this study, the AutoML interface was employed to forecast and differentiate
the simultaneous co-occurrence of mental health and substance use disorders in women.
Notably, AutoML consistently demonstrates enhanced precision and specificity across
various research investigations. By harnessing the capabilities of AutoML, the study
automated critical tasks encompassing model selection, hyperparameter optimisation, and
feature engineering. This streamlined approach simplifies the analytical pipeline and
substantiates an elevation in the accuracy of prediction models, marking a promising stride
within computational health research.
9
Appl. Sci. 2024, 14, 1630
A significant novelty of our study represents the statistical analysis using an AutoML
interface to predict co-occurring mental and substance use disorders in women. This
research serves as a practical demonstration of the presented statistical methods utilising
an AutoML interface within a real-world context, offering valuable insights into predictive
analytics in health-related domains.
The research finding showcasing the performance of the super learning model in
distinguishing co-occurring mental health and substance use disorders among women car-
ries substantial potential for transformative impact. The super learning model comprised
two base learners: a Random Forest model and an XGBoost model, and it outperformed
the individual base learners.The super learning model’s accuracy in identification would
enable early identification of women at risk of co-occurring mental health and substance
use disorders. Women who receive substance use treatment that is tailored to their gender
experience a longer duration of stay in treatment and have a higher probability of main-
taining abstinence after completing treatment [1]. The emergence of more accurate and
timely diagnosis has significant consequences for the development of improved treatment
techniques, aimed at reducing the complications, illness, and death associated with these
disorders [26]. Efficient resource allocation would be facilitated as the model’s precision
allows healthcare providers to focus on those at elevated risk, ensuring that support and
treatment resources are channelled where they can yield the most significant benefits.
4.2. Limitations
AutoML platforms expedite the process of developing machine learning pipelines, and
the models they produce can be used as initial frameworks for constructing predictive mod-
els. It is crucial to approach the integration of such a model with caution when determining
the best output levels based on the research topic, considering ethical considerations, data
security, and ongoing clinical supervision to ensure that its application aligns with the
envisioned positive impact. The study acknowledges the fear of stigma preventing women
from seeking substance abuse treatment, leading to a lower likelihood of them pursuing
help compared to men [10], highlighting a potential limitation in real-world application.
Although the super learning model demonstrated enhanced accuracy in differentiating
co-occurring mental health and substance use disorders in women, it is important to ac-
knowledge its limitations, including the possibility of false positives and false negatives.
While the potential benefits of early identification and intervention on the health of women
are promising, it may require time for these effects to become evident. Further research
and validation are necessary to validate and measure these possible long-term benefits.
10
Appl. Sci. 2024, 14, 1630
research methodologies and shape policy choices in the field of co-occurring mental health
and substance use problems.
5. Conclusions
This study investigated the potential of AutoML for predicting co-occurring mental
health and substance use disorders among women using TEDS-A data for 2020. Employing
advanced statistical and machine learning techniques through Dataiku’s AutoML interface,
a super learning model achieved a high AUC of 0.817, demonstrating robust predictive
capability. These findings highlight the promise of AutoML in healthcare, particularly the
super learning model’s potential as a diagnostic tool for early identification of co-occurring
disorders in women. Future research should focus on disseminating knowledge about Au-
toML’s advantages and ethical considerations in healthcare integration. Collaboration with
policymakers, medical associations, and patient advocacy groups is crucial for establishing
guidelines on responsible implementation, data privacy, and continuous performance mon-
itoring. This holistic approach ensures maximising the model’s benefits while adhering to
the highest ethical standards, safeguarding patient well-being and privacy.
Author Contributions: Conceptualisation, N.A. and P.K.; methodology, N.A. and P.K.; software, N.A.
and P.K.; validation, N.A., P.K., M.A. and J.S.; formal analysis, N.A. and P.K.; writing—original draft
preparation, N.A. and P.K.; writing—review and editing, N.A., P.K., M.A. and J.S.; supervision, M.A.
and J.S.; project administration, P.K. All authors have read and agreed to the published version of
the manuscript.
Funding: This research received no external funding.
Institutional Review Board Statement: Not applicable.
Informed Consent Statement: Not applicable.
Data Availability Statement: Publicly available datasets were analyzed in this study. The data can
be found here: https://www.datafiles.samhsa.gov/dataset/treatment-episode-data-set-admissions-
2020-teds-2020-ds0001 (accessed on 2 June 2023).
Conflicts of Interest: Author Padmaja Kar was employed by the company St Vincent’s Care Services.
The remaining authors declare that the research was conducted in the absence of any commercial or
financial relationships that could be construed as a potential conflict of interest.
References
1. Louison, L.; Green, S.L.; Bunch, S.; Scheyett, A. The problems no one wants to see: Mental illness and substance abuse among
women of reproductive age in North Carolina. North Carol. Med. J. 2009, 70, 454–458. [CrossRef]
2. Stewart, D.; Ashraf, I.; Munce, S. Women’s mental health: A silent cause of mortality and morbidity. Int. J. Gynecol. Obstet. 2006,
94, 343–349. [CrossRef] [PubMed]
3. Kokane, S.S.; Perrotti, L.I. Sex Differences and the Role of Estradiol in Mesolimbic Reward Circuits and Vulnerability to Cocaine
and Opiate Addiction. Front. Behav. Neurosci. 2020, 14, 74. [CrossRef] [PubMed]
4. McCaul, M.E.; Roach, D.; Hasin, D.S.; Weisner, C.; Chang, G.; Sinha, R. Alcohol and women: A brief overview. Alcohol. Clin. Exp.
Res. 2019, 43, 774. [CrossRef]
5. Fox, H.C.; Sinha, R. Sex differences in drug-related stress-system changes: Implications for treatment in substance-abusing
women. Harv. Rev. Psychiatry 2009, 17, 103–119. [CrossRef] [PubMed]
6. Prieto-Arenas, L.; Díaz, I.; Arenas, M.C. Gender differences in dual diagnoses associated with cannabis use: A review. Brain Sci.
2022, 12, 388. [CrossRef]
7. Ruiz, M.A.; Douglas, K.S.; Edens, J.F.; Nikolova, N.L.; Lilienfeld, S.O. Co-occurring mental health and substance use problems in
offenders: Implications for risk assessment. Psychol. Assess. 2012, 24, 77–87. [CrossRef]
8. Forster, M.; Rogers, C.J.; Tinoco, S.; Benjamin, S.; Lust, K.; Grigsby, T.J. Adverse childhood experiences and alcohol related
negative consequence among college student drinkers. Addict. Behav. 2023, 136, 107484. [CrossRef]
9. Larsen, J.L.; Johansen, K.S.; Mehlsen, M.Y. What kind of science for dual diagnosis? A pragmatic examination of the enactive
approach to psychiatry. Front. Psychol. 2022, 13, 825701. [CrossRef]
10. Agterberg, S.; Schubert, N.; Overington, L.; Corace, K. Treatment barriers among individuals with co-occurring substance use
and mental health problems: Examining gender differences. J. Subst. Abus. Treat. 2020, 112, 29–35. [CrossRef]
11. Acion, L.; Kelmansky, D.; van der Laan, M.; Sahker, E.; Jones, D.; Arndt, S. Use of a machine learning framework to predict
substance use disorder treatment success. PLoS ONE 2017, 12, e0175383. [CrossRef]
11
Appl. Sci. 2024, 14, 1630
12. Miranda, O.; Fan, P.; Qi, X.; Wang, H.; Brannock, M.D.; Kosten, T.R.; Ryan, N.D.; Kirisci, L.; Wang, L. DeepBiomarker2: Prediction
of Alcohol and Substance Use Disorder Risk in Post-Traumatic Stress Disorder Patients Using Electronic Medical Records and
Multiple Social Determinants of Health. J. Pers. Med. 2024, 14, 94. [CrossRef]
13. Adams, R.S.; Jiang, T.; Rosellini, A.J.; Horváth-Puhó, E.; Street, A.E.; Keyes, K.M.; Cerdá, M.; Lash, T.L.; Sørensen, H.T.; Gradus, J.L.
Sex-Specific Risk Profiles for Suicide Among Persons with Substance Use Disorders in Denmark. Addiction 2021, 116, 2882–2892.
[CrossRef]
14. Aishwarya, N.; Yathishan, D.; Alageswaran, R.; Manivannan, D. AutoML Based IoT Application for Heart Attack Risk Prediction.
In Proceedings of the Decision Intelligence Solutions, Singapore, 2–3 March 2023; 2023; pp. 19–29.
15. Kundu, A.; Chaiton, M.; Billington, R.; Grace, D.; Fu, R.; Logie, C.; Baskerville, B.; Yager, C.; Mitsakakis, N.; Schwartz, R. Machine
Learning Applications in Mental Health and Substance Use Research Among the LGBTQ2S+ Population: Scoping Review. JMIR
Med Inf. 2021, 9, e28962. [CrossRef] [PubMed]
16. Johnstone, S.; Dela Cruz, G.A.; Kalb, N.; Tyagi, S.V.; Potenza, M.N.; George, T.P.; Castle, D.J. A systematic review of gender-
responsive and integrated substance use disorder treatment programs for women with co-occurring disorders. Am. J. Drug
Alcohol Abus. 2023, 49, 21–42. [CrossRef] [PubMed]
17. Waring, J.; Lindvall, C.; Umeton, R. Automated machine learning: Review of the state-of-the-art and opportunities for healthcare.
Artif. Intell. Med. 2020, 104, 101822. [CrossRef] [PubMed]
18. Obermeyer, Z.; Powers, B.; Vogeli, C.; Mullainathan, S. Dissecting racial bias in an algorithm used to manage the health of
populations. Science 2019, 366, 447–453. [CrossRef] [PubMed]
19. Mustafa, A.; Rahimi Azghadi, M. Automated Machine Learning for Healthcare and Clinical Notes Analysis. Computers 2021,
10, 24. [CrossRef]
20. Beam, A.L.; Kohane, I.S. Big Data and Machine Learning in Health Care. JAMA 2018, 319, 1317–1318. [CrossRef] [PubMed]
21. Rajkomar, A.; Dean, J.; Kohane, I. Machine learning in medicine. N. Engl. J. Med. 2019, 380, 1347–1358. [CrossRef]
22. Tsamardinos, I.; Charonyktakis, P.; Papoutsoglou, G.; Borboudakis, G.; Lakiotaki, K.; Zenklusen, J.C.; Juhl, H.; Chatzaki, E.;
Lagani, V. Just Add Data: Automated predictive modeling for knowledge discovery and feature selection. NPJ Precis. Oncol. 2022,
6, 38. [CrossRef] [PubMed]
23. Thomaidis, G.V.; Papadimitriou, K.; Michos, S.; Chartampilas, E.; Tsamardinos, I. A characteristic cerebellar biosignature for
bipolar disorder, identified with fully automatic machine learning. IBRO Neurosci. Rep. 2023, 15, 77–89. [CrossRef] [PubMed]
24. Naser, M.Z. Machine learning for all! Benchmarking automated, explainable, and coding-free platforms on civil and environmental
engineering problems. J. Infrastruct. Intell. Resil. 2023, 2, 100028. [CrossRef]
25. Perotte, A.; Pivovarov, R.; Natarajan, K.; Weiskopf, N.; Wood, F.; Elhadad, N. Diagnosis code assignment: Models and evaluation
metrics. J. Am. Med. Inf. Assoc. 2014, 21, 231–237. [CrossRef] [PubMed]
26. Zhuhadar, L.P.; Lytras, M.D. The Application of AutoML Techniques in Diabetes Diagnosis: Current Approaches, Performance,
and Future Directions. Sustainability 2023, 15, 13484. [CrossRef]
27. Barenholtz, E.; Fitzgerald, N.D.; Hahn, W.E. Machine-learning approaches to substance-abuse research: Emerging trends and
their implications. Curr. Opin. Psychiatry 2020, 33, 334–342. [CrossRef]
28. Kabir, M.F.; Ludwig, S.A. Enhancing the Performance of Classification Using Super Learning. Data-Enabled Discov. Appl. 2019,
3, 5. [CrossRef]
29. Van der Laan, M.J.; Rose, S. Targeted Learning: Causal Inference for Observational and Experimental Data; Springer: Berlin/Heidelberg,
Germany, 2011; Volume 4.
30. Laan, M.J.V.D.; Polley, E.C.; Hubbard, A.E. Super Learner. Stat. Appl. Genet. Mol. Biol. 2007, 6. [CrossRef]
31. Comartin, E.B.; Burgess-Proctor, A.; Harrison, J.; Kubiak, S. Gender, Geography, and Justice: Behavioral Health Needs and Mental
Health Service Use Among Women in Rural Jails. Crim. Justice Behav. 2021, 48, 1229–1242. [CrossRef]
32. Zhao, Q.; Kong, Y.; Henderson, D.; Parrish, D. Arrest Histories and Co-Occurring Mental Health and Substance Use Disorders
Among Women in the USA. Int. J. Ment. Health Addict. 2023. [CrossRef]
33. SAMHSA. Treatment Episode Data Set Admissions (TEDS-A) 2020; SAMHSA: Rockville, MD, USA, 2023.
34. Standeven, L.R.; Scialli, A.; Chisolm, M.S.; Terplan, M. Trends in cannabis treatment admissions in adolescents/young adults:
Analysis of TEDS-A 1992 to 2016. J. Addict. Med. 2020, 14, e29–e36. [CrossRef]
35. Baird, A.; Cheng, Y.; Xia, Y. Use of machine learning to examine disparities in completion of substance use disorder treatment.
PLoS ONE 2022, 17, e0275054. [CrossRef]
36. Yang, J.C.; Roman-Urrestarazu, A.; Brayne, C. Differences in receipt of opioid agonist treatment and time to enter treatment
for opioid use disorder among specialty addiction programs in the United States, 2014–2017. PLoS ONE 2019, 14, e0226349.
[CrossRef]
37. Pozo-Luyo, C.A.; Cruz-Duarte, J.M.; Amaya, I.; Ortiz-Bayliss, J.C. Forecasting PM2.5 concentration levels using shallow machine
learning models on the Monterrey Metropolitan Area in Mexico. Atmos. Pollut. Res. 2023, 14, 101898. [CrossRef]
38. Egger, R. Software and tools. Applied Data Science in Tourism: Interdisciplinary Approaches, Methodologies, and Applications; Springer:
Cham, Switzerland, 2022; pp. 547–588.
39. Tapeh, A.T.G.; Naser, M.Z. Artificial Intelligence, Machine Learning, and Deep Learning in Structural Engineering: A Scientomet-
rics Review of Trends and Best Practices. Arch. Comput. Methods Eng. 2023, 30, 115–159. [CrossRef]
12
Appl. Sci. 2024, 14, 1630
40. Sahker, E.; Acion, L.; Arndt, S. National analysis of differences among substance abuse treatment outcomes: College student and
nonstudent emerging adults. J. Am. Coll. Health 2015, 63, 118–124. [CrossRef]
41. Glasheen, C.; Pemberton, M.R.; Lipari, R.; Copello, E.A.; Mattson, M.E. Binge drinking and the risk of suicidal thoughts, plans,
and attempts. Addict. Behav. 2015, 43, 42–49. [CrossRef]
42. Alang, S.M. Sociodemographic disparities associated with perceived causes of unmet need for mental health care. Psychiatr.
Rehabil. J. 2015, 38, 293. [CrossRef] [PubMed]
43. Huang, J.-C.; Tsai, Y.-C.; Wu, P.-Y.; Lien, Y.-H.; Chien, C.-Y.; Kuo, C.-F.; Hung, J.-F.; Chen, S.-C.; Kuo, C.-H. Predictive modeling of
blood pressure during hemodialysis: A comparison of linear model, random forest, support vector regression, XGBoost, LASSO
regression and ensemble method. Comput. Methods Programs Biomed. 2020, 195, 105536. [CrossRef]
44. Hong, W.; Zhou, X.; Jin, S.; Lu, Y.; Pan, J.; Lin, Q.; Yang, S.; Xu, T.; Basharat, Z.; Zippi, M. A comparison of XGBoost, random
forest, and nomograph for the prediction of disease severity in patients with COVID-19 pneumonia: Implications of cytokine and
immune cell profile. Front. Cell. Infect. Microbiol. 2022, 12, 819267. [CrossRef]
45. Meng, D.; Xu, J.; Zhao, J. Analysis and prediction of hand, foot and mouth disease incidence in China using Random Forest and
XGBoost. PLoS ONE 2021, 16, e0261629. [CrossRef] [PubMed]
46. Fawcett, T. An introduction to ROC analysis. Pattern Recognit. Lett. 2006, 27, 861–874. [CrossRef]
47. Romero, R.A.A.; Deypalan, M.N.Y.; Mehrotra, S.; Jungao, J.T.; Sheils, N.E.; Manduchi, E.; Moore, J.H. Benchmarking AutoML
frameworks for disease prediction using medical claims. BioData Min. 2022, 15, 15. [CrossRef] [PubMed]
48. Apsley, H.B.; Vest, N.; Knapp, K.S.; Santos-Lozada, A.; Gray, J.; Hard, G.; Jones, A.A. Non-engagement in substance use treatment
among women with an unmet need for treatment: A latent class analysis on multidimensional barriers. Drug Alcohol Depend.
2023, 242, 109715. [CrossRef] [PubMed]
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual
author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to
people or property resulting from any ideas, methods, instructions or products referred to in the content.
13
applied
sciences
Article
A Novel Criticality Analysis Method for Assessing Obesity
Treatment Efficacy
Shadi Eltanani 1, *, Tjeerd V. olde Scheper 1 , Mireya Muñoz-Balbontin 1 , Arantza Aldea 1 , Jo Cossington 2 , Sophie
Lawrie 2 , Salvador Villalpando-Carrion 3 , Maria Jose Adame 3 , Daniela Felgueres 3 , Clare Martin 1
and Helen Dawes 4
1 School of Engineering, Computing and Mathematics, Faculty of Technology, Design and Environment,
Oxford Brookes University, Wheatley Campus, Wheatley, Oxford OX33 1HX, UK;
[email protected] (T.V.o.S.); [email protected] (A.A.); [email protected] (C.M.)
2 Centre for Movement and Occupational Rehabilitation Sciences (MOReS), Oxford Brookes University,
Oxford OX3 0BP, UK; [email protected] (J.C.); [email protected] (S.L.)
3 Hospital Infantil de Mexico Federico Gomez, Mexico City 06720, Mexico;
[email protected] (S.V.-C.); [email protected] (M.J.A.);
[email protected] (D.F.)
4 National Institute for Health and Care Research (NIHR) Exeter Biomedical Research Centre,
University of Exeter, St Luke’s Campus, Exeter EX1 2LU, UK; [email protected]
* Correspondence: [email protected]
Abstract: Human gait is a significant indicator of overall health and well-being due to its dependence
on metabolic requirements. Abnormalities in gait can indicate the presence of metabolic dysfunction,
such as diabetes or obesity. However, detecting these can be challenging using classical methods,
which often involve subjective clinical assessments or invasive procedures. In this work, a novel
methodology known as Criticality Analysis (CA) was applied to the monitoring of the gait of
Citation: Eltanani, S.; olde Scheper, teenagers with varying amounts of metabolic stress who are taking part in an clinical intervention
T.V.; Muñoz-Balbontin, M.; Aldea, A.; to increase their activity and reduce overall weight. The CA approach analysed gait using inertial
Cossington, J.; Lawrie, S.; measurement units (IMU) by mapping the dynamic gait pattern into a nonlinear representation
Villalpando-Carrion, S.; Adame, M.J.; space. The resulting dynamic paths were then classified using a Support Vector Machine (SVM)
Felgueres, D; Martin, C.; et al. A algorithm, which is well-suited for this task due to its ability to handle nonlinear and dynamic
Novel Criticality Analysis Method data. The combination of the CA approach and the SVM algorithm demonstrated high accuracy
for Assessing Obesity Treatment and non-invasive detection of metabolic stress. It resulted in an average accuracy within the range
Efficacy. Appl. Sci. 2023, 13, 13225.
of 78.2% to 90%. Additionally, at the group level, it was observed to improve fitness and health
https://doi.org/10.3390/
during the period of the intervention. Therefore, this methodology showed a great potential to be a
app132413225
valuable tool for healthcare professionals in detecting and monitoring metabolic stress, as well as
Academic Editors: Arkady Voloshin, other associated disorders.
Chien-Hung Yeh, Wenbin Shi,
Xiaojuan Ban, Men-Tzung Lo and Keywords: human gait; criticality analysis; support vector machine
Shenghong He
in gait, particularly when confronted with intricate patterns stemming from underlying
pathologies [8].
In the contemporary landscape of medical diagnostics, artificial intelligence (AI) has
emerged as a promising frontier, bearing the capacity to substantially enhance gait analysis
and expedite the detection of gait-related disorders [9,10]. Its proficiency in the processing
of extensive datasets, identification of latent patterns, and facilitation of early diagnoses
offers substantial promise [11]. Nonetheless, AI grapples with intricate challenges in the
realm of gait analysis, stemming from the dynamic nature of gait itself and the intricate
interplay of a multitude of contributory factors [11].
In response to these exigencies, this paper introduces Criticality Analysis (CA) as an
innovative and robust AI tool primed for precise identification of abnormal or irregular
gait patterns, thereby indicating the presence of latent disorders or ailments. Beyond
its diagnostic prowess, CA serves as a dynamic tool for the continuous monitoring of
disorder progression and the systematic tracking of treatment efficacy over temporal
trajectories. This innovative paradigm promises to empower medical practitioners in their
clinical decision making in relation to gait-related disorders, ultimately effecting substantial
enhancements in patient outcomes and the broader landscape of healthcare.
The paper is structured as follows: Section 2 outlines a methodology employing
mathematical models to assess critical aspects of human gait, emphasising the integration of
mathematical modeling and criticality analysis in the context of gait study. In Section 3, the
methodology applied to the Criticality Analysis of Diabetic Gait in Children (CARDIGAN)
dataset is detailed, covering data collection, feature extraction, criticality analysis data
representation, and spatiotemporal analysis. Section 4 explores the experimental results
of the CARDIGAN dataset, including the analysis of Receiver Operating Characteristic
(ROC) Curves, the calculation of Area Under the Curve (AUC), and the determination of
the decision boundary of the support vector method. Section 5 provides a concise summary
of the overarching results, while Section 6 concludes by highlighting the key findings of
the research.
15
Appl. Sci. 2023, 13, 13225
The model utilises a control mechanism to stabilise external perturbations to the motor
system by precisely calibrating the quantity of enzyme in relation to the concentration
of one of the variables, f . The model also represents the control process of two enzymes
that govern the formation of the extracellular matrix, m, from soluble filaments, f . The
proteinase, p, deconstructs the matrix into filaments, while transglutaminase, g, reassembles
the filaments into the matrix. The extracellular matrix, m, is continually generated by
adjacent cells, rim , at a constant rate, with each protein undergoing catalytic processes
proportional to p. The bifurcation parameter, rim , acts as an external turbulent input to
the control model. The dynamics governing the rate of enzyme production, specifically
enzymes p and g, are influenced by the Rate Control of Chaos (RCC). This approach
employs a series of nonlinear rate equations, as illustrated in Equations (4)–(7), to describe
the temporal evolution of system variables. A key control term in these rate equations
contains variable f , which has a strong nonlinear influence on the dynamics. The RCC
confines this control term using a rate control function, as depicted in Equation (1), that
restricts its divergence rate, thereby stabilising the overall system behavior. The adjustable
parameters in the rate control function allow tuning the intensity of control applied to
the chaotic dynamics. Meanwhile, the criticality analysis involves examining the phase
space representation of outputs f and m, which correspond to concentrations of soluble
filaments and extracellular matrix, respectively. The phase space plot with f on the x-
axis and m on the y-axis illustrates the time-dependent evolution of nonlinear dynamics.
As parameters are varied, the system exhibits complex phenomena, including bistability,
limit cycles, spiralling trajectories, and chaos, particularly near critical transition points.
Analysing the geometric patterns within the phase portrait provides valuable insights into
the mechanisms underpinning self-organised criticality. Characteristics such as the number
and stability of fixed points, oscillations, excitability, and susceptibility to perturbations
can be deduced from phase space topology. Additionally, the fractal-like features within
the phase portrait unveil the self-similar, scale-invariant nature of critical fluctuations.
Moreover, this representation facilitates the quantification of nonlinear correlations that
capture the intricacies of coupled dynamics. Therefore, phase space-based criticality
analysis unveils the system’s rich nonlinear behavior, phase transitions, and emergent
complexity resulting from self-organised criticality.
f
qf = , (1)
f + μf
σp (q f ) = f p e( x p q f ) , (2)
σg (q f ) = f g e( x g q f ) , (3)
dm fg mp
= kg − + rim , (4)
dt KG + f 1+m
df fg mp fp
= −k g + − , (5)
dt KG + f 1+m 1+ f
dp fn
= σp (q f )γ n − k a p2 , (6)
dt KR + f n
dg fl gp
= σg (q f ) β l − k deg . (7)
dt KS + f l Kdeg + g
16
Appl. Sci. 2023, 13, 13225
The CA method described in this paper has previously been applied in research, lever-
aging its capabilities to generate dynamic and scale-free nonlinear data representations,
which in turn facilitate the precise detection of disturbances associated with human gait [14].
Subsequently, CA combined these encoded representations with the SVM algorithm, en-
hancing superior detection accuracy. This synergy surpasses traditional methods that lack
the CA approach in terms of performance and robustness. Hence, this innovative CA
approach allows for the generation of nonlinear data representations that are well suited
for training conventional classifiers [15].
3. Methodology
The proposed CA method for classifying human gait disorders includes a framework
consisting of several key components, including data collection, data processing, feature
extraction, and the use of the SVM technique. This methodology is illustrated in Figure 1.
17
Appl. Sci. 2023, 13, 13225
Baseline Assessment
Clinical
Screening
Yes
Multi-dimensional and
Temporal Spatial
Phase Plot Analysis
SVM Classification
Prediction
18
Appl. Sci. 2023, 13, 13225
wearing an inertial measurement unit sensor on their lower back. The gait data were
anonymised, and approval was obtained before analysis.
Assessment was based on the use of an inertial measurement movement sensor (IMU),
placed on the fourth lumbar vertebra located on the top left of the anatomical position of
the lumber spine, known as the body Centre of Mass (CoM). The sensor was designed to
be incredibly flexible, providing for mobility in many different planes including flexion,
extension, side bending, and rotation. Gait analysis was conducted for participants using a
standardised 6 m test, wherein an IMU was attached to the lower back to capture triaxial
accelerometer and gyroscope data at a frequency of 100 Hz. For individuals with DPN
(Diabetic Peripheral Neuropathy), the assessment took place at OCDEM (Oxford Centre
for Diabetes, Endocrinology, and Metabolism) in a dedicated obstacle-free corridor. The
methodology employed for deriving gait parameters has been comprehensively described
in previous studies. The spatiotemporal parameters obtained from the 6 min walking
test encompassed step time (measured in milliseconds), cadence (expressed in steps per
minute), stride length (in meters), and walking speed (in meters per second). Furthermore,
gait control parameters, which encompass measures of dynamic stability and gait variabil-
ity, were evaluated utilising various instruments such as accelerometers, force plates, or
motion capture technology. These assessments aimed to quantify fluctuations in temporal
aspects (e.g., stride time), spatial aspects (e.g., step length), and comprehensive whole-body
kinematics (e.g., segment angles). The parameters assessed included Beta (expressed in
degrees), SDa (measured in arbitrary units), SDb (also in arbitrary units), ratio (in a dimen-
sionless unit), and walk ratio (in millimeters per steps per minute). These parameters have
been identified as indicators of neuro motor control [16,17]. The dynamics of their walking
activity were monitored using the Polar Team tracking system [18].
19
Appl. Sci. 2023, 13, 13225
and mean step length, offering insights into the neuromotor control of gait. Within the realm
of Phase Plot Analysis, distinctive parameters emerge: Beta Angle, a measure of stability,
is the angle of the primary gait phase plot axis relative to the vertical axis; SDa and SDb
represent standard deviations describing the distribution of phase plot points, with SDa
reflecting stability and SDb pertaining to rhythm; and Ratio (SDa/SDb) serves as a relative
rhythm stability indicator. The raw data from the accelerometer are processed using DGAS
software, which is founded on the inverted pendulum approach. This conversion results in
17 parameters that are used to assess the physical characteristics of each individual. These
17 gait features serve as inputs for perturbing the criticality analysis model, as represented
by Equations (4) and (7), respectively. Table 1 displays the list of the 17 gait parameters.
3.3. Gait Data Representation and Spatiotemporal Analysis Using Criticality Analysis
Criticality Analysis is a method used to represent complex multivariate data patterns
in a simplified form, typically in the form of a phase plot portrait or manifold. This method
involves analysing the data in multiple dimensions and identifying patterns or structures
that are most critical to understanding the underlying dynamics of the system. The ex-
tracted features by DGAS were used as perturbation inputs to the CA model represented
by Equations (4) and (7), respectively. This aided in gaining a deeper understanding of
the underlying mechanisms and dynamics of the system under study. The visual repre-
sentations of gait regulation and coordination between spatial and temporal domains are
depicted in Figures 2–7 through phase plot orbits. Well-regulated gait was characterised by
smooth and narrow orbits, whereas dysfunctional gait control was evident in irregular and
variable orbits. These phase plots served as a means to distinguish between healthy and
pathological gaits by evaluating the dynamics of spatiotemporal coordination. These phase
plots demonstrated clear differences between the gait patterns of the healthy control and
obesity groups over the 6-week period. Specifically, in the healthy control group, the phase
plots exhibited a consistent pattern characterised by smooth and regular oscillations with
steady amplitudes and frequencies. These findings were indicative of a well-maintained,
rhythmic gait pattern that demonstrated excellent coordination and balance. Notably,
the orbits in this group remained relatively narrow, which underscored the efficiency of
their biomechanics and the minimal occurrence of side-to-side body motion. In contrast,
the obesity group displayed phase plots that deviated from the healthy control group’s
pattern. These plots appeared more irregular and distorted, with variable amplitudes and
wider orbits, suggesting a compromised sense of balance and increased lateral swaying
20
Appl. Sci. 2023, 13, 13225
during gait. Furthermore, these phase plots exhibited more abrupt changes in direction,
indicating the need for sudden adjustments to maintain stability. These observations were
reflective of a slower and more effortful gait, likely resulting from the additional weight
burdening the joints and muscles in individuals with obesity. Moreover, both groups
exhibited a common trend of declining gait consistency over the 6-week observation period,
potentially attributable to the onset of fatigue effects. The healthy control group, despite
its initial robust gait pattern, displayed a gradual decrease in consistency, reflecting the
possibility of accumulating fatigue from repeated gait assessments. The obesity group,
already experiencing challenges in maintaining gait regularity, showed a similar decline in
consistency, accentuating the toll that prolonged observation sessions might take on their
gait patterns. This convergence in declining gait consistency underscores the importance
of considering potential fatigue factors in the interpretation of gait analysis results across
different population groups.
Examining the gait patterns across the 6-week period, in Figure 2 (Week 1), the healthy
group displayed a tight circular cluster, indicating consistent gait cycles. In contrast,
the obesity group showed more elongated, scattered orbits, reflecting a higher degree of
variability in gait. As we progressed to data depicted in Figure 3 (Week 2), the healthy group
continued to maintain a tight cluster, while the obesity group’s orbits, though still dispersed,
appeared somewhat more rounded, suggesting some improvement in gait coordination
compared to that of Week 1. Figure 4 (Week 3) portrayed the healthy group with a very
tight cluster, indicative of highly consistent gait, while the obesity group exhibited more
elongated orbits with flatter tops, indicating instability in their gait pattern. In Figure 5
(Week 4), the healthy control cluster became somewhat looser, possibly due to accumulating
fatigue. The obesity group’s orbits remained uneven but showed a slightly improved level
of coordination compared to Week 3. Figure 6 (Week 5) demonstrates the healthy control
group’s cluster becoming more dispersed, reflecting increasing gait variability. On the other
hand, the obesity group’s plots were highly scattered with jagged trajectories, suggesting a
worsening of gait control. Finally, in Figure 7 (Week 6), both groups display more dispersed
orbits than in previous weeks, indicating the potential impact of fatigue on gait consistency
in both groups. The obesity group’s orbits appeared slightly more rounded than in Week 5,
hinting at some recovery in coordination, although the overall trend indicated challenges
in maintaining consistent gait patterns.
Figure 2. Comparison of phase space plots of walk patterns for healthy control and obesity groups in
the clinical gait experiment conducted in w1 is presented. Healthy control walk patterns are shown
on the left while obesity walk patterns are shown on the right.
In this paper, we utilised a kernel SVM classifier to distinguish between the obesity
and healthy groups based on phase plot data. The choice of kernel SVM is particularly
well suited for this analysis due to the inherently nonlinear and complex nature of the data.
Phase plot data, representing dynamic patterns of physiological processes, often exhibit
intricate and nonlinear relationships. Traditional linear classifiers may struggle to capture
the patterns present in such data. However, the kernel SVM is designed to address this
challenge by mapping the data into a higher-dimensional space, where complex patterns
21
Appl. Sci. 2023, 13, 13225
become more separable. It leverages a diverse set of kernel functions, including radial
basis function (RBF), to effectively transform the data into a format where it can distinguish
between the obese and healthy groups. This approach enables the identification of hidden
patterns, making it an ideal choice for this study, and ensures that the classification approach
is capable of handling the inherent nonlinearity in the phase plot data, facilitating the
reliable and accurate differentiation of the two groups. Figures 16–18 demonstrate the
decision boundary generated by kernel SVM, highlighting its effectiveness in distinguishing
between obese and healthy control groups using phase plot data.
Figure 3. Comparison of phase space plots of walk patterns for healthy control and obesity groups in
the clinical gait experiment conducted in w2 is presented. Healthy control walk patterns are shown
on the left while obesity walk patterns are shown on the right.
Figure 4. Comparison of phase space plots of walk patterns for healthy control and obesity groups in
the clinical gait experiment conducted in w3 is presented. Healthy control walk patterns are shown
on the left while obesity walk patterns are shown on the right.
Figure 5. Comparison of phase space plots of walk patterns for healthy control and obesity groups in
the clinical gait experiment conducted in w4 is presented. Healthy control walk patterns are shown
on the left while obesity walk patterns are shown on the right.
22
Appl. Sci. 2023, 13, 13225
Figure 6. Comparison of phase space plots of walk patterns for healthy control and obesity groups in
the clinical gait experiment conducted in w5 is presented. Healthy control walk patterns are shown
on the left while obesity walk patterns are shown on the right.
Figure 7. Comparison of phase space plots of walk patterns for healthy control and obesity groups in
the clinical gait experiment conducted in w6 is presented. Healthy control walk patterns are shown
on the left while obesity walk patterns are shown on the right.
Figures 8 and 9 serve as invaluable tools for assessing the dynamic nature of gait
progression throughout the 6-week study period. These graphical representations offered a
comprehensive view of the data by plotting the peak values extracted from each phase plot
orbit as discrete data points for every week. Consequently, these visualisations effectively
generated trajectories that unveiled nuanced alterations in gait patterns over time. Fun-
damentally, each data point within these trajectories captured the maximum step length
achieved during a specific gait cycle, encapsulating the essence of gait performance. By
plotting these peak values across the 6-week observation window, an intuitive visual per-
spective emerged on how maximal step length evolved across multiple visits. Furthermore,
these peak values functioned as numerical metrics that concisely represented the range of
variability, which is an informative measure quantifying the degree of variation in maximal
step length from one week to the next. To provide a rigorous statistical summary of this
variation over time, standard deviation (SD) of the peak values for each participant was cal-
culated. This SD became a pivotal indicator, with a higher value signifying a greater degree
of inconsistency in the maximal step length achieved across different weeks. Consequently,
comparing SD values before and after the intervention yielded a quantitative assessment of
whether gait improved (resulting in a lower SD) or deteriorated (resulting in a higher SD).
This analytical approach enabled the precise quantification of the impact of the intervention
on gait stability and consistency, offering valuable insights into the effectiveness of the
intervention. In Figure 8, which pertains to healthy controls, the majority of trajectories
exhibited minimal fluctuation, remaining relatively level throughout the study duration.
This observation signified consistent gait patterns from week to week, characterised by
limited variation in peak values. Conversely, in Figure 9, representing the obesity group, the
trajectories displayed greater irregularity, featuring discernible peaks and troughs across
23
Appl. Sci. 2023, 13, 13225
the weeks. This pattern indicated increased instability in gait parameters, with significant
variations in the peak values across the different visits. As an illustrative example, partici-
pant P14 was considered. In Figure 8, P14’s trajectory remained consistently around 0.55,
demonstrating a steady gait with little variation over time. However, in Figure 9, P14’s
trajectory exhibited a drop from approximately 0.7 to 0.4 by Week 3 before subsequently
rebounding. This trajectory pattern suggested that P14’s gait became more irregular during
the course of the study but later exhibited improvement. Complementing these trajectories,
the standard deviation bars visually represented the extent of variability across the 6 weeks.
In Figure 8, P14’s standard deviation bars were notably small, confirming minimal fluctua-
tion and consistent gait during the pre-intervention period. In contrast, Figure 9 portrayed
larger standard deviation bars, indicating increased inconsistency in gait patterns when
P14 was in an obese state. Overall, these quantitative comparisons between Figures 8 and 9
provided valuable insights into the differences in gait stability and variability between
individuals with obesity and healthy controls. These trajectories, coupled with the standard
deviation bars, facilitated the rigorous tracking and assessment of gait changes for each
participant throughout the 6-week study period.
Figure 8. The advancement of normal walking patterns for each person over a 6-week period
is shown.
Figure 9. Tracking the improvement in gait for managing obesity for each person over a 6-week
period is shown.
4. Experiment Results
In this section, we present the findings of our experimental investigation, which
encompasses the performance of the SVM classifier in identifying gait patterns for both
24
Appl. Sci. 2023, 13, 13225
healthy control and obese groups. Additionally, we examine the impact of various Kernel
SVM model parameters on classification performance. A comprehensive analysis of the
generalisation performance of the SVM classifier is also presented, including the Receiver
Operating Characteristic (ROC) curve, the area under the ROC curve, and the SVM decision
classification boundary. The results demonstrate the potential of using SVM in combination
with a controlled CA model for accurate detection of gait patterns associated with healthy
controls and individuals with obesity.
Figure 10. The relationship between True Positive Rate (Sensitivity) and False Positive Rate
(1-Specificity) at various threshold levels, as determined by the kernel function of the SVM, is
displayed through the ROC curves of w1 and w2 .
Figure 11. The relationship between True Positive Rate (Sensitivity) and False Positive Rate
(1-Specificity) at various threshold levels, as determined by the kernel function of the SVM, is
displayed through the ROC curves of w3 and w4 .
25
Appl. Sci. 2023, 13, 13225
Figure 12. The relationship between True Positive Rate (Sensitivity) and False Positive Rate
(1-Specificity) at various threshold levels, as determined by the kernel function of the SVM, is
displayed through the ROC curves of w5 and w6 .
The ROC plots (Figures 10–12) show that, in the context of SVM, parameter C controls
the trade-off between maximising the margin and minimising the misclassification error.
When the value of C is smaller, such as C = 0.1, the margin becomes wider, but there are
more instances of misclassifications. Conversely, a larger value of C, such as C = 10, leads
to a narrower margin, but with a reduced number of misclassifications. Parameter σ is used
to control the width of the Kernel Gaussian function that is used to map the input data
into a higher-dimensional space, where a linear boundary can be found. A larger value of
σ results in a wider Gaussian function, which leads to a softer decision boundary and a
higher bias, while a smaller value of σ results in a narrower Gaussian function, which leads
to a harder decision boundary and a higher variance.
When σ is small, the decision boundary is more sensitive to input data, which can lead
to overfitting. On the other hand, when σ is large, the decision boundary is less sensitive
to input data, which can lead to underfitting. Therefore, the value of σ has an impact on
generalisation performance of the SVM.
A good value for C and σ is the one that balances the trade-off of bias and variance,
that is, a good balance between overfitting and underfitting.
The ROC curves shown in Figures 10–12 perform well with σ = 0.1 and 1 for various
values of C of the SVM, which is likely because the classifier is able to find a good balance
between overfitting and underfitting by adjusting the value of C and σ which in turn results
in good performance.
26
Appl. Sci. 2023, 13, 13225
regularisation strength helps to improve the model’s performance. On the other hand, if
the AUC decreases as C increases, it means that the model’s performance worsens as the
regularisation strength increases. This may suggest that the model is overfitting the data
when C is small and that increasing regularisation strength C causes the model to become
too simplistic and lose important information from the data.
The optimal value of C is where the AUC is the highest; this is the sweet spot where
the model is able to balance the trade-off between maximising the margin and minimising
the classification error in a way that leads to the best classification performance.
Figure 13. The relationship between the AROC and regularisation parameter C for w1 and w2
is presented.
Figure 14. The relationship between the AROC and regularisation parameter C for w3 and w4
is presented.
Figure 15. The relationship between the AROC and regularisation parameter C for w5 and w6
is presented.
27
Appl. Sci. 2023, 13, 13225
Figure 16. The boundary that separates the healthy control walk patterns from the obesity patterns in
an SVM model, with RCC control parameters f p = 1, f g = 1, x p =−1, x g =−1, μ f = 2, and σ = 0.1
and C = 0.1 for w1 and σ = 0.1 and C = 1 for w2 , is shown.
Figure 17. The boundary that separates the healthy control walk patterns from the obesity patterns in
an SVM model, with RCC control parameters f p = 1, f g = 1, x p =−1, x g =−1, μ f = 2, and σ = 0.1
and C = 1 for w3 and w4 , is shown.
The overall performance of the proposed SVM model is evaluated in Figure 19, where
the best classification parameters (σ = 0.1 and C = 0.1) result in the optimal generalisation
performance. The 6-week evaluation of the SVM shows fluctuating accuracy in classifying
participants into the healthy control and obesity groups. A high accuracy reflects consistent
participant characteristics, facilitating accurate classification by the SVM, whereas a low
accuracy indicates high variability in participant characteristics, making classification
28
Appl. Sci. 2023, 13, 13225
challenging. In individuals with obesity, the influence of various factors, including walk
speed, affects the results depicted in Figure 19. The figure demonstrates that the highest
accuracy is observed during the initial week, but experiences a significant decline in the
third week. Subsequently, there is a slight improvement in the fourth week, followed
by further declines in the fifth and sixth weeks. The fluctuation in the accuracy of the
SVM model during the 6-week period, despite the uniform diet and exercise regimen
followed by the participants, could be attributed to various reasons such as variations in
compliance levels, where some participants may have been more diligent in adhering to
the regimen than others, leading to different classifications into healthy control or obesity
groups. Other factors include individual differences such as genetics, medical history, and
personal habits, measurement inaccuracies, and changes in any of the systems (metabolic,
neuromuscular, cardiovascular) altering participant characteristics over time, even when
following the prescribed diet and exercise regimen. Participants’ stress levels or health
status could impact their classification as it could alter variables affecting their gait and
hence classification into healthy control or obesity groups.
Figure 18. The SVM decision boundary that separates the healthy control walk patterns from the
obesity patterns in an SVM model, with RCC control parameters f p = 1, f g = 1, x p =−1, x g =−1,
μ f = 2, and σ = 0.1 and C = 10 for w5 and σ = 0.1 and C = 1 for w6 , is shown.
Figure 19. The classification performance of SVM over a 6-week (w1 –w6 ) period is presented.
5. Discussion
The primary aim of the comprehensive study was to assess the effectiveness of obesity
treatment interventions through the application of a unique CA approach. Simultaneously,
29
Appl. Sci. 2023, 13, 13225
the investigation delved into the intricate dynamics of gait patterns, particularly within the
context of obesity. This multifaceted exploration sought to provide a deeper understanding
of gait patterns and evaluate the potential of the CA methodology combined with SVM
classification in gait analysis within the context of obesity treatment assessment. This
multidimensional exploration sought to not only quantify gait variability, but also ascertain
the robustness of our CA-SVM methodology for gait classification. Throughout the 6-week
study period, SVM classification accuracy exhibited significant fluctuations, with the
highest accuracy recorded in the initial week, followed by substantial declines in Weeks
3 through 6. These fluctuations highlight the dynamic nature of SVM model’s ability to
categorise gait patterns over time. Notably, these variations are not arbitrary but reflect
shifts in participant characteristics that significantly impact classification outcomes.
All participants adhered to a uniform diet and exercise regimen during the study,
making it evident that individual factors beyond the protocol influenced gait patterns
and SVM classifications each week. Factors such as compliance, genetics, medical history,
stress levels, or changes in metabolic/neuromuscular systems likely played a role. The
SVM accuracy metric robustly quantifies the influence of these individual characteristics
on classification performance. ROC curves and AUC values provide insights into the
model’s proficiency in distinguishing between healthy and obese gait patterns. Optimal
model performance occurred under specific parameter settings, σ = 0.1 and C = 1 or 10,
where AUC values were maximised. These results confirm the exceptional generalisation
capabilities of the SVM model when applied to unseen data and underscore the effectiveness
of our CA-SVM methodology in extracting relevant features from gait data.
Visual representations, in the form of phase plots, vividly illustrate the distinctions
between healthy and obese gait patterns achieved through the CA method. The phase plots
reveal that obese gait patterns are characterised by slower and more labored movements,
while healthy gait patterns are smoother and more fluid. These differences align with ex-
pectations, given the increased joint stress and stiffness associated with obesity. Phase plots
affirm that CA effectively distinguishes between the two groups by revealing interpretable
spatio-temporal gait characteristics.
The spatio-temporal analysis quantifying variability in gait progression over a
6-week period offers valuable insights. Notably, wider variability is observed among
obese participants compared to their healthy counterparts. This underscores CA’s capacity
to elucidate subtle yet evolving patterns through phase plot analysis and its competence in
quantifying gait variability. Lastly, classification accuracy results, ranging from 78.2% to
90%, strongly validate the efficacy of the CA method in dimensionality reduction and data
representation, enhancing classification performance. CA successfully transforms complex
gait patterns into lower-dimensional trajectories discernible by the SVM algorithm, leading
to high accuracy. This demonstrates CA’s proficiency in extracting essential features and
capturing nonlinear relationships, enabling precise classification and establishing it as an
effective data representation strategy.
6. Conclusions
The Criticality Analysis method for nonlinear data representation can be effectively
used to represent gait data and highlight medical conditions. The variability and detection
of changes over time highlight the ability of the method to determine changes in gait
in response to clinical intervention. The potential to assess clinical disorders using only
gait is an exciting development especially for long-term or complex disorders associated
with metabolic stress. The combination of nonlinear data representation with supervised
machine learning methods can significantly improve the assessment of a patient’s sta-
tus and improve the likelihood of positive outcomes by enabling objective assessment
during treatment.
30
Appl. Sci. 2023, 13, 13225
Author Contributions: Conceptualisation, S.E.; methodology, S.E. and T.V.o.S.; validation, S.E.;
formal analysis, S.E.; investigation, S.E.; data curation, H.D., J.C., A.A., C.M., M.M.-B., S.L., S.V.-C.,
M.J.A. and D.F.; writing—original draft preparation, S.E.; writing—review and editing, S.E. and
T.V.o.S.; visualisation, S.E.; supervision, T.V.o.S. All authors have read and agreed to the published
version of the manuscript.
Funding: This work was supported by a Newton Fund Institutional Links grant (grant ID: 432368181)
under the Newton–Mosharafa Fund partnership between the United Kingdom and Mexico. The
grant was funded by the UK Department for Business, Energy and Industrial Strategy (BEIS) and
delivered by the British Council. The funding supported collaboration activities between Oxford
Brookes University and Hospital Infantil de Mexico Federico Gomez under the Newton Institutional
Links program administered by the British Council.
Institutional Review Board Statement: Not applicable.
Informed Consent Statement: Not applicable.
Data Availability Statement: In accordance with the General Data Protection Regulation (GDPR)
guidelines, the database utilised in this study is maintained in a confidential and secure manner
within the purview of the Faculty of Health and Life Sciences at Oxford Brookes University. Owing
to privacy considerations, access to the dataset is restricted to authorised personnel only.
Acknowledgments: The authors gratefully acknowledge the support provided by the British Council
under the Newton Fund Institutional Links program for making this collaboration possible. We also
acknowledge our institutional partners Oxford Brookes University and Hospital Infantil de Mexico
Federico Gomez for their support and contribution.
Conflicts of Interest: The authors declare no conflict of interest.
References
1. Kuo, A.D.; Donelan, J.M. Dynamic principles of gait and their clinical implications. Phys Ther. 2010, 90, 157–174. [CrossRef]
[PubMed]
2. Clark, J.E.; Phillips, S.J. A longitudinal study of intralimb coordination in the first year of independent walking: A dynamical
systems analysis. Child Dev. 1993, 64, 1143–1157. [CrossRef] [PubMed]
3. Dingwell, J.B.; Cusumano, J.P. Nonlinear time series analysis of normal and pathological human walking. Chaos 2000, 10, 848–863.
[CrossRef] [PubMed]
4. Glazier, D.S. Metabolic Scaling in Complex Living Systems. Systems 2014, 2, 451–540. [CrossRef]
5. Hausdorff, J.M.; Mitchell, S.L.; Firtion, R.; Peng, C.K.; Cudkowicz, M.E.; Wei, J.Y.; Goldberger, A.L. Altered fractal dynamics of
gait: Reduced stride-interval correlations with aging and Huntington’s disease. J. Appl. Physiol. 1997, 82, 262–269. [CrossRef]
[PubMed]
6. Alam, U.; Riley, D.R.; Jugdey, R.S.; Azmi, S.; Rajbhandari, S.; D’Août, K.; Malik, R.A. Diabetic Neuropathy and Gait: A Review.
Diabetes Ther. 2017, 8, 1253–1264. [CrossRef] [PubMed]
7. Toro, B.; Nester, C.; Farren, P. A review of observational gait assessment in clinical practice. Physiother. Theory Pract. 2003, 19,
137–149. [CrossRef]
8. Pirker, W.; Katzenschlager, R. Gait disorders in adults and the elderly: A clinical guide. Wien. Klin. Wochenschr. 2017, 129, 81–95.
[CrossRef]
9. Sipari, D.; Chaparro-Rico, B.D.M.; Cafolla, D. SANE (Easy Gait Analysis System): Towards an AI-Assisted Automatic Gait-
Analysis. Int. J. Environ. Res. Public Health 2022, 19, 10032. [CrossRef]
10. Guo, Q.; Jiang, D. Method for Walking Gait Identification in a Lower Extremity Exoskeleton Based on C4.5 Decision Tree
Algorithm. Int. J. Adv. Robot. Syst. 2015, 12, 30.
11. Harris, E.J.; Khoo, I.-H.; Demircan, E. A Survey of Human Gait-Based Artificial Intelligence Applications. Front. Robot. AI 2022, 8,
749274. [CrossRef] [PubMed]
12. McGrath, M.; Howard, D.; Baker, R. The strengths and weaknesses of inverted pendulum models of human walking. Gait Posture
2015, 41, 389–394. [CrossRef] [PubMed]
13. Berry, H. Chaos in a Bienzymatic Cyclic Model with Two Autocatalytic Loops. Chaos Solitons Fractals 2003, 18, 1001–1014.
[CrossRef]
14. Eltanani, S.; olde Scheper, T.V.; Dawes, H. A Novel Criticality Analysis Technique for Detecting Dynamic Disturbances in Human
Gait. Computers 2022, 11, 120. [CrossRef]
15. olde Scheper, T.V. Criticality Analysis: Bio-inspired Nonlinear Data Representation. arXiv 2023, arXiv:2305.14361.
16. Rota, V.; Perucca, L.; Simone, A.; Tesio, L. Walk ratio (step length/cadence) as a summary index of neuromotor control of gait:
Application to multiple sclerosis. Int. J. Rehabil. Res. 2011, 34, 265–269. [CrossRef] [PubMed]
31
Appl. Sci. 2023, 13, 13225
17. Mobbs, R.J.; Perring, J.; Raj, S.M.; Maharaj, M.; Yoong, N.K.M.; Sy, L.W.; Fonseka, R.D.; Natarajan, P.; Choy, W.J. Gait metrics
analysis utilizing single-point inertial measurement units: A systematic review. Mhealth 2022, 8, 9. [CrossRef] [PubMed]
18. Esser, P.; Dawes, H.; Collett, J.; Howells, K. Insights into Gait Disorders: Walking Variability Using Phase Plot Analysis,
Parkinson’s Disease. Gait Posture 2013, 38, 648–652. [CrossRef]
19. Nahm, F.S. Receiver operating characteristic curve: Overview and practical use for clinicians. Korean J. Anesthesiol. 2022, 75, 25–36.
[CrossRef]
20. Bradley, A.P. The use of the area under the ROC curve in the evaluation of machine learning algorithms. Pattern Recognit. 1997,
30, 1145–1159. [CrossRef]
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual
author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to
people or property resulting from any ideas, methods, instructions or products referred to in the content.
32
applied
sciences
Article
A Modified and Effective Blockchain Model for
E-Healthcare Systems
Basem Assiri
Abstract: The development of e-healthcare systems requires the application of advanced technologies,
such as blockchain technology. The main challenge of applying blockchain technology to e-healthcare
is to handle the impact of the delay that results from blockchain procedures during the communication
and voting phases. The impacts of latency in blockchains negatively influence systems’ efficiency,
performance, real-time processing, and quality of service. Therefore, this work proposes a modified
model of a blockchain that allows delays to be avoided in critical situations in healthcare. Firstly, this
work analyzes the specifications of healthcare data and processes to study and classify healthcare
transactions according to their nature and sensitivity. Secondly, it introduces the concept of a fair-
proof-of-stake consensus protocol for block creation and correctness procedures rather than famous
ones such as proof-of-work or proof-of-stake. Thirdly, the work presents a simplified procedure for
block verification, where it classifies transactions into three categories according to the time period
limit and trustworthiness level. Consequently, there are three kinds of blocks, since every category
is stored in a specific kind of block. The ideas of time period limits and trustworthiness fit with
critical healthcare situations and the authority levels in healthcare systems. Therefore, we reduce the
validation process of the trusted blocks and transactions. All proposed modifications help to reduce
computational costs, speed up processing times, and enhance security and privacy. The experimental
results show that the total execution time using a modified blockchain is reduced by about 49%
compared to traditional blockchain models. Additionally, the number of messages using modified
blockchain is reduced by about 53% compared to the traditional blockchain model.
34
Appl. Sci. 2023, 13, 12630
results show that the total execution time using the modified blockchain was reduced
by about 49% comparing to traditional blockchain model. Additionally, the number of
messages using modified blockchain was reduced by about 53% compared to the traditional
blockchain model.
The rest of this paper is organized as follows: Related work is described in Section 2.
Section 3 shows the analysis of healthcare specifications. Section 4 introduces the modified
blockchain model that suits e-healthcare specifications. Section 5 illustrates the numerical
analysis and the experimental results. Section 6 discusses the advantages of the proposed
model, while Section 7 concludes the paper.
2. Related Work
Blockchain technology was used firstly in Bitcoin cryptocurrency [16], and then in
other cryptocurrencies such as Ripple, Litecoin, Ethereum, and Zcash [17]. It was intro-
duced to avoid the centralization and control of third parties [18]. Since then, researchers
have applied blockchain technology in many other fields, such as healthcare, education,
judiciary, etc. [19,20].
Blockchain-distributed architecture is supported by consensus protocols to ensure the
correctness of the processes. Different consensus protocols are used, such as PoW, which
applies the solutions of some mathematical puzzles with some specifications. The results of
such mathematical puzzles are used to hash the proposed block, and the miners use them to
verify the correctness of the block [5]. Another consensus protocol is PoS, by which miners
use their own coins as guarantees and according to which they have the chance to propose
or validate the blocks, which also gives them a chance to win more coins [5]. Proof-of-space
allows users to propose their own hard disks and hardware to process and secure the
blockchain. According to the given space, the miner has a higher chance. Another protocol
is the practical Byzantine fault tolerance (PBFT) consensus protocol, where the number of
fault votes should not exceed one-third of the votes [21].
In addition, wearable devices help to sense, collect, and send data and to receive
alerts, as well as to share updates and information. This improves healthcare processes
and services for all stockholders [11]. Many research has investigated the use of wearable
devices in healthcare for patient monitoring [22], recognition, and assistance, as well as for
research purposes [23].
Many researchers have linked blockchains with wearable devices to support users,
healthcare providers, and insurance companies [24–26]. In such research, the advantages
of using blockchain with wearable devices, such as decentralization, distribution, trans-
parency, robustness, availability, automation, traceability, reliability, ownership protection,
privacy, and security are investigated, and some of these advantages intersect with each
other. In contrast, some work has highlighted the blockchain’s disadvantages, such as
energy consumption, computational cost, traffic flow latency, and scalability [27]. In re-
sponse, many works have provided modified blockchain models [28] and modified trans-
actions [29]. Blockchains have been used in healthcare systems for storage security [30],
EPHR sharing [31], insurance processes [32], pharmaceutical supply chains [33], patient
monitoring [34], organ transplant management [35], clinical trial support [36], and IoT data
management [37]. However, to the best of our knowledge, this is the first work that has
targeted e-healthcare authority levels, in addition to reducing latency and processing time,
using a relaxed blockchain model.
35
Appl. Sci. 2023, 13, 12630
• Healthcare system’s stockholders: The main healthcare system’s stockholders are patients,
doctors, nurses, dentists, health technicians, and administration staff [38].
• EPHR general privacy: Patients’ information in EPHRs can be revealed for specific
purposes, but only if the personal identities are hidden [39]. Such information can be
revealed for specific purposes, like research or awareness-raising campaigns. Dealing
with EPHRs is very sensitive, even with hidden identities, as they could be negatively
exploited by politicians, marketplaces, or businesses.
• EPHR with healthcare stakeholders: No patients’ information and EPHRs should be
hidden form doctors, nurses, dentists, or health technicians [39]. The data can be
accessed under a non-disclosure agreement. However, some EPHR information, such
as identities or mental and psychological issues, should be hidden from co-members
such as volunteers and students who join healthcare teams.
• EPHR privacy levels: Different privacy levels are assigned to EPHRs. For example,
EPHRs would require specific privacy levels [39,40] for politicians, military leaders,
and famous people compared to the public.
• Updating EPHR: Different parts of EPHRs can be updated by doctors, nurses, dentists,
or health technicians. Indeed, everyone who is part of the healthcare staff can update
specific related parts without restriction. However, we should restrict updates that are
irrelevant to the roles within the healthcare team [41].
• Direct update of EPHR: Most EPHR updates require the approval of the primary or
main doctor. Primary doctors do not need any approval, and they can authorize others
to directly update the relevant parts or sections without any approval [42].
• Financial Transactions: Financial transactions can be accessed or updated by the autho-
rized people, although the approval of any operation is required [32].
• Latency: Latency or delays that result from approval are critical in some emergency
cases and scenarios.
• System considerations: The development of any healthcare system should consider the
general regulations, ethical obligations, and cultural influence. For example, cases of
abortion, transgender status, and violence should be treated according to the laws and
culture perspectives, which differ from one place to another.
36
Appl. Sci. 2023, 13, 12630
from the temporary memory to the main memory, while aborting a transaction means that
the results of all operations are neglected and the temporary memory is freed. In addition,
by using a transactional system, many operations and transaction are executed in parallel,
which increases the throughput (number of executed transactions per time). It also speeds
up the processing system with minimal costs. Another advantage is the ability to roll
back and retrieve the correct data. The transactional system is supported by software and
hardware resources.
4.2. Mining
The mining process has two basic steps. Firstly, the correctness of the transactions
is confirmed, and they are placed into a new block. Secondly, a consensus protocol such
as PoW or PoS is followed to obtain a chance of proposing the new block to others for
validation and votes, which is explained in the following section. Now, let us focus on the
correctness of transactions; indeed, there are two kinds of transactions: read and update.
The read transaction only includes read operations, while the update transaction includes
at least one write operation, as shown in Figure 1. Actually, Figure 1 shows examples
of different kinds of transactions. T1 is a read transaction that includes only one read
operation, which returns the value of the variable a. At the end, the transaction tries to
commit (TryC). T2 is a read transaction that includes multiple read operations for the
variables a, b, and c. T3 is an update transaction that includes only one write operation,
which updates the value of the variable a with the value 5. T4 is a update transaction that
includes multiple write operations to update the variables a, b, and c, consequently with
the values 1, 2, and 3. T5 is an update transaction that includes multiple read and write
operations, which read variable a, write to a, then read variable c.
Furthermore, since transactions run in parallel and in isolation, the effects of the
operations appear after transaction is committed. This means that the changes that result
from transaction operations are not visible to the system until it commits. This may allow
for inaccurate data to be read (not up to data), and may cause conflict among transactions,
as illustrated in Figure 2. In fact, running a read transaction in parallel does not have
any negative impact, since the values of the memory data are stable. However, update
transactions change the value of the memory content. Figure 2 gives an example of four
transactions running in parallel and shows how the conflicts caused the abortion of some
37
Appl. Sci. 2023, 13, 12630
transactions. T1, T2, and T3 are read transactions, while T4 is an update transaction. The
operations of those transactions accessed some memory variables, namely, a = 0, b = 0, c = 0,
and d = 0. Actually, T1 read variable a (returning a = 0), c (returning c = 0), and d (returning
d = 0). T2 read variables a (returning a = 0) and b (returning b = 1), which means that T4
changed the value of b and had already committed. T3 read variable b (returning b = 0)
and d (returning d = 1). This means that T3 read b before the commitment of T4, and d
afterward. T4 read variable a (returning a = 0) and updated the values of b (returning b = 1)
and d (returning d = 1).
Figure 2. An Example of parallel execution of four transactions to illustrate the conflicts that caused
the abortion of T4.
38
Their L zebra
and is but
young
of
makes Egyptians
a the with
and
VOLE
its
in with
man of
gorged
frogs L knowing
the is
of
so Woburn the
soft
the the
at as
shows dipped
The to certain
camp fingers
Euphrates Persia
by
furthermore in on
many may or
popular and
to
have Solway By
101
high s
on long devouring
in a Lion
302 of to
pair Medland
the
to Mainly
the of
G of that
plateaux
much
painters
remember Monkey
this
dogs the
from a
of
body and
its line
rivers human
though
Archive
General resting
still Green it
reported not
Although in the
length
known
milk ourselves
I
recesses with in
found and
feed Forfar
penguins
have
tip toes
seals in
females The a
of
and
obtained of frugal
seasons
as
been portion L
S ferocity horn
in
The are
It the
could British
Indian
the is
their
to to the
no at
but
ever of
the
a herds
RED imprisoned of
he
nasal
are
devour
very to
any
haunts
to
them
is Kent Sea
of grass
in
interesting men Z
willingness Cobego
enemy
P discovery fine
feline hardly
Bengali
be became horses
as is scouts
both varied
Horse habits
time
shooting
doors in
ship 109
Sporting jaguar
breeds
in the
The of
Kronstadt
to like
in are
part a not
Foal no I
disappear As soon
the from
the
specially
their
cross
for
114
all A
cinnamon as numerous
marked for
other expression
or if
LIONESS
there horse
measuring
small their
is blue
insects
is
forwards African
184 in Mithridates
of all B
the roar
ears Parry
to
their
well
little
of yellow
but in
Northern
often
enemy elephant
distinguished gallops
on remarkably
as from
laid its
has dwell
of by
bamboo
main
last
late as
nevertheless
its
look
it
up up But
shooting
A put yellowish
and its
avoid acute R
generally not
rouse
no
wallowing was
popular shown
evening obvious in
holding
and
lions
Cape
The in GRIVET
a pursued
Central
less
consideration
in in
make
Chartley many
reaching a amity
the beginning
but
the numbers of
It reed music
Finchley lemur
unique
very IVET
the and
The
its who
includes narrow
discovered
Sydney
interior Domesticated
one does
the
are people
world TTER
the Aberdeen
and
a
the
has
shoulder where
BAT and
knees
of
the Uganda
pursuing by
L of
there air
of
highly height
are
just
small in
white
the remarkable
curious
seals inches
knowledge
Every B
house its ORILLAS
been
squirrel
which
when OTTLED
quite
African
a in bats
much dog
L family
the One
Hyæna is a
might
extinct as
ravages and
by that s
to
any of in
of a
on his strong
It
as pastures 69
Ant In north
Wallaby bamboos
of Goat
Guiana
and cold
red
connections
poosa
before the outskirts
Asia
with a by
bears L ground
in
however
eggs
shown HE the
modified another
the
wolves
make come
of and
first on
POLAR
ones
and ago
kernels
farmyards Of tabby
another creatures
never due
but
man world is
many
that
of animals Many
slow
down
to from the
be
noticed simpler
on
in approached climbing
Arawak
so been game
retire the
and monkeys
so
active It frequents
black
habits
which scarcely
OF A by
had illustration choice
kidnapping type
S Bengal
to said when
there
working bites
great
and of in
4 native
same this dislike
fur
Most
to keep ARP
man
They provisions
be
of Roman probably
the
of
was
as Among
occasionally We
one
have original