Key research themes
1. How can human-centered process modeling improve the effectiveness of knowledge discovery in databases (KDD)?
This research area focuses on understanding and characterizing the iterative, knowledge-intensive interactions between humans and data during the KDD process. Recognizing that learning algorithms are only a segment of the overall KDD workflow, it emphasizes the design of systems that assist human analysts across all phases, from data selection and cleaning to interpretation and deployment. The goal is to create integrated environments that support users’ tasks holistically, rather than solely providing standalone data mining algorithms. This emphasis on user and task-centered perspectives addresses the complexity and real-world application challenges in knowledge discovery.
2. What are effective methodologies and process models for applying data mining and knowledge discovery in healthcare settings?
This theme investigates domain-specific adaptations of KDD and data mining tailored for healthcare’s complex, voluminous, and heterogeneous data. It explores methodological considerations like integrating clinical, financial, demographic, and socioeconomic data, along with handling data types such as patient records and genomic data. The literature evaluates application areas including diagnosis, treatment optimization, patient risk prediction, and healthcare management, alongside discussing challenges around data quality, system integration, and domain knowledge incorporation to improve decision making and patient outcomes.
3. How can data mining methods address challenges of continuously arriving data streams and temporal event analysis?
This theme examines methodological frameworks and algorithms designed to extract knowledge from data streams characterized by high volume, velocity, and continuous generation. It addresses the constraints of conventional static data mining on streaming data, necessitating one-pass, real-time, and resource-efficient processing. Furthermore, it delves into mining temporal relationships between interval events and detection of associations that unfold over time, providing insights into dynamic behaviors in applications like network monitoring and financial transactions.