Making Absence Visible: The Roles of Reference and Prompting in Recognizing Missing Information

Hagit Ben Shoshan [email protected] 0009-0007-5945-695X University of HaifaHaifaIsrael , Joel Lanir 0000-0002-9838-5142 University of HaifaHaifaIsrael [email protected] , Pavel Goldstein 0000-0002-5224-1725 University of HaifaHaifaIsrael [email protected] and Osnat Mokryn 0000-0002-1241-9015 University of HaifaHaifaIsrael [email protected]

(5 June 2009)

Abstract.

Interactive systems that explain data, or support decision making often emphasize what is present while overlooking what is expected but missing. This presence bias limits users’ ability to form complete mental models of a dataset or situation. Detecting absence depends on expectations about what should be there, yet interfaces rarely help users form such expectations. We present an experimental study examining how reference framing and prompting influence people’s ability to recognize expected but missing categories in datasets. Participants compared distributions across three domains (energy, wealth, and regime) under two reference conditions: Global, presenting a unified population baseline, and Partial, showing several concrete exemplars. Results indicate that absence detection was higher with Partial reference than with Global reference, suggesting that partial, samples-based framing can support expectation formation and absence detection. When participants were prompted to look for what was missing, absence detection rose sharply. We discuss implications for interactive user interfaces and expectation-based visualization design, while considering cognitive trade-offs of reference structures and guided attention.

Absence, Reference framing, Data visualization, Expectations, Learning via surprisability, Missing commonalities

^†^†copyright: acmlicensed^†^†journalyear: 2018^†^†doi: XXXXXXX.XXXXXXX^†^†conference: Make sure to enter the correct conference title from your rights confirmation email; June 03–05, 2018; Woodstock, NY^†^†isbn: 978-1-4503-XXXX-X/2018/06^†^†ccs: Human-centered computing Visualization design and evaluation methods^†^†ccs: Human-centered computing User studies^†^†ccs: Computing methodologies Artificial intelligence

1. Introduction

Gregory: “Is there any other point to which you would wish to draw my attention?”
Holmes: “To the curious incident of the dog in the night-time.”
Gregory: “The dog did nothing in the night-time.”
Holmes: “That was the curious incident.”
—Arthur Conan Doyle, “Silver Blaze” (1892) (Doyle, 2024)

Absences are often as revealing as presences. Arthur Conan Doyle’s Silver Blaze dramatizes this through Sherlock Holmes’s attention to the dog that did not bark. Under ordinary circumstances, a watchdog is expected to bark; its silence was therefore not emptiness but a violation of expectation—a missing event that became a decisive clue. The case illustrates a general principle: absence is meaningful only against a model of what should have been present. It is not a signal on its own, but a deviation from expectation (Farennikova, 2013; Martin and Dokic, 2013; Cavedon-Taylor, 2017).

Expectation frames govern both everyday inference and data interpretation. We notice a friend who fails to arrive, or a skipped heartbeat, precisely because we anticipated them. In data analysis and visualization, insight arises not only from observed values but also from those that are expected yet missing. A product review that omits any mention of “price” or a national dataset lacking “healthcare” may reveal more by what it leaves out than by what it shows, provided the observer has a reference of what is typical.

Human cognition, however, rarely performs this comparison automatically. The well-documented feature-positive effect (Newman et al., 1980) shows that people detect present cues more readily than absent ones and often overlook missing information unless explicitly guided (Hearst, 1989; Mazor et al., 2020). Judgments of absence are slower, less confident, and strongly dependent on contextual support (Tversky and Kahneman, 1974; Legrenzi et al., 1993). In cognitive terms, absence is perceived when an internal prediction is violated. In interface terms, it becomes visible only when expectations are made explicit.

This expectation-based perspective reframes a classic visualization problem. Standard charts excel at showing what exists, but provide few cues for what is missing and should exist. A missing bar may be indistinguishable from an unmeasured one; a gap may simply disappear into the background. Prior work in visualization and decision-making shows that comparative structures, such as baselines, benchmarks, and base rates, make expectations explicit and enable more accurate reasoning (Attfield et al., 2010; Margoni et al., 2024; Gigerenzer et al., 1988). From this standpoint, reference frames function as externalized expectations that allow viewers to detect deviations, including absences. Computational methods such as Latent Personal Analysis (LPA) and Learning via Surprisability (LvS) formalize this idea by building population-level reference models that expose deviations from expectation (Mokryn and Ben-Shoshan, 2021; Mokryn et al., 2025). LvS then identifies missing commonalities as features prevalent in the population but absent in a specific instance, thus turning absence into structured information.

Here, we examine how reference framing and guidance jointly influence the recognition of missing information in data visualizations. We hypothesized that partial reference frames would encourage detection of absence by prompting viewers to internally complete missing context. We further hypothesized that explicit prompting, which provides guided attentional support, would substantially increase detection across both reference conditions.

Using open-ended responses to comparative bar-chart visualizations, we measure how people notice missing categories under (1) partial exemplar reference and (2) global aggregate reference, both before and after guided prompting.

Our contributions are threefold:

(1)

We introduce the concept of expectation-based visualization, which embeds explicit reference models to externalize what “should be” present.
(2)

We provide empirical evidence that reference framing meaningfully shapes how people recognize missing information: partial frames prompt viewers to reconstruct expectations, whereas global frames support direct comparison. Guided prompting dramatically amplifies detection across both conditions, showing that attentional guidance can overcome the inherent presence bias in spontaneous interpretation.
(3)

We outline design implications for intelligent visualization systems that make expectations explicit and guide users’ attention to missing information, balancing cognitive effort with insight on how absences are presented.

By connecting cognitive bias (the feature-positive effect) with interactive design, we show how visualization systems can help humans “see what is not there”. Absences, when framed through explicit expectations and guided attention, become perceptible, interpretable, and actionable - but only when users are prompted to adopt an expectation-oriented stance.

2. Related Work

We review key cognitive principles and empirical findings that demonstrate how human perception and reasoning are systematically biased towards detecting presence rather than absence, highlighting the crucial role of contextual expectations and internal models in enabling the detection of missing features.

Philosophy offers a complementary view. Mumford (Mumford, 2021) argues that absence is neither directly perceived nor merely inferred. Instead, it is a hybrid phenomenon: our senses register what is present, our minds compare this against what is expected, and the result is a felt experience of absence, a kind of “user illusion”. This account suggests that absence perception is itself a reference-framing process: without an expectation against which the present is compared, no absence would be experienced.

2.1. Presence bias

Presence bias is the systematic tendency to detect, learn, and reason from things that are present more readily than from things that are absent. In learning and judgment, this asymmetry is well known as the feature-positive effect (FPE): organisms more easily associate “A and B” than “A and not B”, and people process present cues more fluently than missing ones (Newman et al., 1980). Evidence for this bias appears early in development. Infants attend more to added than to removed features, revealing a deep perceptual asymmetry (Coldren and Haaf, 2000). As Kahneman (Kahneman, 2011) notes, people construct coherent stories from available cues and tend to ignore what is missing unless it is made salient.

In data visualization, this cognitive bias has concrete implications. Human vision is tuned to presence over absence: targets defined by an added feature “pop out”, whereas those defined by a missing feature are systematically harder to detect—the classic search asymmetry (Treisman and Gormican, 1988). In quantitative displays, zeros are particularly meaningful because they represent true absences rather than missing or incomplete data. Making such zeros explicit is therefore crucial to counteract presence bias: otherwise, attention focuses only on where something happens, not where it does not. When data values are absent, viewers often fail to perceive that feature at all, interpreting the omission as missing information rather than a legitimate zero. Research on visualizing missing data shows that explicit markers or gaps improve comprehension and decision-making (Song and Szafir, 2018).

2.2. Absence and missing data

Absences are not merely missing data, but a missing part of a cognitive reference frame, features that are to be expected, yet are missing. Absence is not a data-quality issue but an expectation-dependent cognitive phenomenon. Unlike absence, missing values refer to data quality issues. In prior work on visualizing missing data, Kandel (Kandel et al., 2011) addresses how systems can surface missing values such as nulls, incomplete rows, or gaps arising from data collection. In contrast, our study examines cases where the data themselves are complete, but the viewer fails to notice a category that is semantically expected, given a reference frame. Our focus is on how reference structures and attentional guidance influence the recognition of absent expected categories that “should have been present”, a capability fundamentally different from detecting missing entries in the data. The experimental study reported here applies this reasoning to data visualization, testing how different forms of reference framing (global versus partial) shape human detection of missing commonalities in graphical data.

2.3. Perceiving and Detecting Absence

Cognitive and neural perspectives of absence.

Experiences of absence are common, such as noticing that a car is missing from its usual place. Yet, the neural and cognitive mechanisms underlying how the mind “perceives nothing” remain debated (Mumford, 2021; Barnett and Fleming, 2024). Some theories argue that absence can be directly perceived, while others maintain it is inferred from contextual cues (Cavedon-Taylor, 2017; Mumford, 2021). The distinction between perceiving and inferring absence is subtle but fundamental: it determines whether absence is experienced as a percept or as a cognitive reconstruction.

The role of internal models.

Recognizing absence requires prior expectations. Without anticipating a particular feature, one cannot register its lack. Human perception continually compares incoming sensory input against internal predictive models; absence is detected when expected signals fail to appear. Thus, the perception of absence arises from a mismatch between prediction and observation. As Mazor (Mazor, 2025) notes, such inference depends on a self-model capable of transforming the “absence of evidence” into “evidence of absence”.

Research using the violation-of-expectation (VOE) paradigm supports this model: when expected objects or events are missing, infants exhibit surprise, prompting learning and model revision (Margoni et al., 2024). This demonstrates that absence detection is not a passive process but an active one: triggering attention, expectation adjustment, and cognitive updating.

Absence detection as cognitive inference

Detecting absence involves comparing current input with internalized expectations and reconciling discrepancies between them. When the observed and the expected diverge, attention shifts and higher-order inference processes engage to explain the gap. Recognition of absence thus depends on the same predictive mechanisms that support surprise and learning. Empirical work confirms that judgments of absence differ systematically from those of presence: people are slower and less confident when deciding that a stimulus is absent, even when correct (Meuwese et al., 2014; Mazor et al., 2020). Absence detection is therefore a dynamic and cognitively demanding process—less fluent, slower, and less confident than presence detection, yet essential for adaptive learning and reasoning about what is missing.

2.4. Reference Framing and the Role of Expectations

A recurring insight across cognitive psychology and visual analytics is that the ability to detect absence depends on the presence of an explicit or internalized reference frame. Gigerenzer et al. (Gigerenzer et al., 1988) demonstrated that base-rate use is not a fixed heuristic bias but a function of problem representation: when participants directly observed the random-sampling process rather than merely hearing about it, their probability judgments approached Bayesian reasoning. Presentation, therefore, shapes the internal model against which new evidence is evaluated. The same logic underlies the violation-of-expectation paradigm in developmental research (Margoni et al., 2024), where infants’ surprise at impossible or missing events reveals that perception operates through predictive mental models—an absence is noticed only when it violates an expectation of presence. In visual reasoning, Attfield et al. (Attfield et al., 2010) extend this principle to analytic contexts, describing sensemaking as an iterative loop between foraging for data and updating explanatory hypotheses. Effective visualization systems, they argue, are those that externalize expectations, enabling users to recognize both anomalies and absences through visible discrepancies between what is observed and what is assumed. Neisser’s perceptual cycle account offers a complementary cognitive mechanism for how reference frames support absence detection: perception is guided by anticipatory schemata that direct attention and are updated through sampling. When the available input is partial, viewers must actively construct and refine an internal model of “what should be there” (Neisser, 1976)

Together, these traditions converge on the idea that reference framing provides the cognitive substrate for absence detection: it anchors perception to an explicit baseline, making deviations—positive or negative—computationally and perceptually tractable. From a computational perspective, this logic is mirrored in methods that formalize reference-based comparison as a means of expectation modeling. Approaches such as predictive coding, surprisal-based modeling, and distributional reference frameworks treat data interpretation as a process of quantifying deviation from an expected baseline (Mokryn and Ben-Shoshan, 2021; Feng et al., 2024). In visual analytics, population-level summaries (averages, density plots, or global baselines) often function as reference frames, contextualizing local data points and exposing deviations or absences. This design principle is consistent with approaches in recent work that compare individual observations to model or aggregate expectations (Suschnigg et al., 2025; Mokryn et al., 2025). Whether implemented through probabilistic inference, divergence metrics, or learned priors, these methods instantiate the same principle that underlies human reasoning about absence: the identification of what is missing emerges only through explicit contrast with what was expected to be there. In the context of data visualization, reference framing thus functions as a bridge between cognitive and computational reasoning about expectation, clarifying how context shapes the salience of absence.

3. Hypotheses

The detection of missing information depends on expectations: people notice absences only relative to what they believe should be present. The experiment tests two complementary factors that influence this process: the form of reference framing, which determines how expectations are anchored, and attentional guidance, which determines whether participants are prompted to search for absences.

3.1. Reference Framing

Building on Gigerenzer et al (Gigerenzer et al., 1988), we propose that the perception of information surplus or absence depends on how population-level context is represented and experienced. In their experiments, participants correctly incorporated base-rate information only when they observed the sampling process: they could see an urn containing, for example, 70 engineer cards and 30 lawyer cards and observe one being drawn. This visual experience created an internal model of the population. When the same numerical information was only verbally asserted, participants no longer treated it as evidence about the population. Instead, they focused on the descriptive traits of each case (for instance, whether a person sounded like an engineer or a lawyer) and ignored the stated base rates. We apply this principle to the domain of reference-based data interpretation. The experiment manipulates how reference information, that is, the information used to form expectations about the focal entity, is presented. A partial reference displays several concrete exemplars from a comparison set. In the energy domain, for example, this corresponds to showing the energy production of several other countries. This format parallels Gigerenzer et al.’s “observable sampling” condition, where population structure becomes accessible through concrete examples. A global reference, by contrast, summarizes the entire population through an aggregate visualization such as a mean or range. In the energy example, this corresponds to displaying the world’s average energy production profile.

Two types of reference frames are compared:

Partial reference :: In this condition, people are presented with several concrete examples from the population as a reference. This framing invites reasoning and active expectation building. This prediction is also consistent with Neisser’s view that partial input initiates an anticipatory schema and recruits exploratory processing to complete what is expected. By requiring participants to infer the broader pattern from exemplars, partial framing should strengthen the internal reference model against which absences can be registered (Neisser, 1976).
Global reference:: In this condition, people are presented with framing that summarizes the population, supporting abstract, norm-based comparison.

Similar to Gigerenzer et al and Neisser, we expect that constructing expectations from partial samples will heighten sensitivity to what is missing. When viewers must infer the population pattern from limited examples, they engage more deeply in building a mental model of what should be present, making deviations and absences more noticeable.

In contrast, a global statistical reference may promote passive comparison to an abstract average, reducing the salience of specific missing elements.

H1 (Reference Framing). Absence detection will be higher under a partial reference than under a global reference, because inferring expectations from a few concrete examples makes missing information more salient.

3.2. Guided Attention

People naturally attend to what is present. Known as the feature-positive effect (Newman et al., 1980), this is a globally acknowledged phenomenon. Explicit prompts that direct attention to what might be missing should counter this bias and elicit more systematic search for absence (Tversky and Kahneman, 1974).

H2 (Guided Attention). Guided prompts will markedly increase absence detection compared with spontaneous descriptions, regardless of reference framing.

4. Methodology

In order to explore absence detection, we conducted a remote study in which 100 participants interpreted bar-chart visualizations comparing a focal entity to either global or exemplar-based references, first spontaneously and then under guided prompting. The study involving human participants was reviewed and approved [Omitted for anonymity] Participants were recruited via the Prolific platform, where they reviewed an online consent form and provided informed consent prior to participation. All participants received monetary compensation and could exit at any time without penalty. No personally identifying information was collected.

4.1. Participants and Design

Refer to caption — Figure 1. Overview of the experimental design. The study employed a 2×2 mixed factorial design with Reference Framing (Partial vs. Global) as between-subjects and Prompting Mode (Spontaneous vs. Guided) as within-subjects. Participants (N = 100) were randomly assigned to one of two reference conditions: Partial Reference, where the focal entity (E1) was compared to multiple concrete exemplars, or Global Reference, where E1 was compared to a single aggregate baseline. All participants responded to both prompting conditions across three datasets (political regimes, income brackets, and energy sources), resulting in six open-ended responses per participant.

As illustrated in Figure 1, the study followed a mixed factorial design combining between- and within-subject factors. We recruited 100 participants through the Prolific online platform to complete a web-based visualization interpretation task. Participants were randomly assigned to one of two reference framing conditions: a Partial Reference condition presenting several concrete exemplars and a Global Reference condition presenting an aggregated population baseline. Reference framing was manipulated between subjects, while prompting mode (Guided vs. Spontaneous) and dataset domain (Regime, Wealth, Energy) varied within subjects. Each participant viewed three datasets under both prompting conditions and produced a total of six open-ended text responses.

Table 1. Participant demographics

Demographics	Values
Age	Range: 22–80; Mean: 42; SD: 13.1
Gender	Female: 41%; Male: 59%
Education	High School: 26%; Bachelor’s: 41%; Master’s: 20%; Other: 13%
Country of Residence	United States: 62%; United Kingdom: 34%; Other: 4%
Ethnicity	White: 67%; Asian: 15%; Black: 4%; Mixed/Other: 8%; Unspecified: 6%
Employment Status	Full-time: 41%; Part-time: 7%; Not in paid work: 7%; Unemployed: 5%; Other/No answer: 40%

Table 1 summarizes the demographic composition of the sample. Participants represented a diverse adult population with balanced gender and educational backgrounds. Most held higher education degrees and resided primarily in the United States and the United Kingdom. The two experimental conditions were comparable in demographic makeup.

4.2. Data and Conditions

Three datasets captured distinct categorical domains (Table 2): political regimes, income distribution, and energy production. The visualization presented were as vertical bar charts showing percentage distributions across categories. Each chart included a focal entity labeled E1 (highlighted in yellow) compared with one or more reference entities. E1 was constructed as an extreme case concentrated in a single category—representing a strong presence of one feature and notable absence of others—while reference entities displayed more balanced distributions.

Table 2. Datasets and variables used in the experiment.

Dataset	Variable Description	Categories
Regime Type Distribution	Percentage of population living under each political regime type	Closed Autocracy, Electoral Autocracy, Democracy, Liberal, No Regime
Wealth Distribution	Percentage of population in each daily per-capita income bracket	Up to $3, $3–$4.20, $4.20–$8.30, $8.30–$10, Above $10
Energy Production	Percentage of annual energy production by source (TWh)	Bio, Coal, Gas, Hydro, Nuclear, Oil, Other, Solar, Wind

Partial Reference (Concrete Exemplars).

E1 appeared alongside two or three specific entities, providing visible samples from the broader population and enabling reasoning by analogy to observed variation. Example comparisons included Asia, North America, and Europe in the Regime Type dataset; China, Indonesia, and Ireland in the Wealth dataset; and Costa Rica, Cuba, and France in the Energy dataset.

Figure 2 illustrates this condition, where the focal entity (E1) is compared with several concrete instances representing distinct distributions.

Global Reference (Summary Aggregate).

E1 was displayed alongside a single entity labeled World, representing the aggregated population distribution for the same variable. This reference provided an overall statistical norm rather than individual examples. For instance, in the Regime dataset, the World bar showed the global percentage of populations under each regime type (approximately 45% electoral autocracy, 35% democracy, 20% other). In the Wealth dataset, the global income distribution was centered on the $4.20–$8.30 range, and in the Energy dataset, the global energy mix was dominated by coal and gas, with hydro, oil, and renewables.

Figure 3 shows this condition, in which E1 is evaluated against a unified global baseline that encapsulates the population-wide expectation.

4.3. Procedure

Participants viewed visualizations from the three datasets under their assigned reference condition. In the spontaneous phase, they answered the open-ended question: “How does E1 compare to what you observe in this visualization?” These initial responses captured natural, unprompted comparative reasoning. In the subsequent guided phase, the same visualizations were shown again with the targeted prompt: “What would you expect to see in E1 that appears to be missing or notably minimal?” Separating the two phases ensured that the first responses reflected intuitive impressions unshaped by explicit direction to search for absences.

4.4. Data Analysis

To analyze the open-ended responses, we developed two large language models (LLMs) that were integrated into the research workflow: the Absence Detector and the Surplus Detector. These modules automatically identified linguistic signals of absence or surplus within participants’ textual explanations. Our module’s prompts and code, are available in our repository ¹¹1https://anonymous.4open.science/r/IUI_anonymized-DD71/README.md

Automated analysis pipeline.

All responses were exported from Google Sheets, lowercased, and tokenized using the tidytext and stringr packages in R. Stopwords, punctuation, and HTML artefacts were removed prior to model input. Each cleaned response was then passed to the Gemini-2.5-flash engine using a standardized API call (temperature = 0.2, top- $p$ = 0.9) to ensure stable, deterministic behavior across runs. The model was prompted with the following fixed instruction:

“You are an expert in literary and linguistic analysis, specializing in identifying themes of absence, loss, and omission. Analyze the given text and determine whether it conveys the idea of ‘missing’ or ‘absence’. Respond only with a JSON object containing a Boolean field ("absence": true/false) and a brief justification string.”

Each response returned a structured JSON object containing: (i) a binary indicator of whether absence was expressed, and (ii) a short textual explanation of the model’s reasoning. For example, the input “E1 has no closed autocracy” was classified as "absence": true with the justification “explicit negation of a category indicates missingness”. All outputs were logged and version-controlled to ensure full reproducibility.

This approach provided a reproducible and linguistically grounded method for classifying nuanced semantic expressions of absence in participants’ narratives, aligning closely with human expert judgments (Tang et al., 2024).

Validation.

To verify reliability, a random $20\%$ subset of responses ( $n{=}120$ ) was independently coded by a trained linguist specializing in English semantics. Inter-annotator agreement between the human coder and the LLM output reached $\kappa=0.84$ (substantial agreement). Based on this validation, the model achieved an overall accuracy of 0.92, precision of 0.94, recall of 0.89, and $F_{1}$ score of 0.92. Discrepancies were reviewed manually to identify systematic errors, which primarily involved ambiguous cases (e.g., implied absence without explicit negation).

All source code, prompts, and validation procedures are archived in the project’s open repository which will be provided upon acceptance.

Statistical analysis.

Because the dependent measures are binary (detection vs. no detection), categorical comparisons between conditions were analyzed using Pearson’s $\chi^{2}$ tests with Yates’ continuity correction. Separate tests evaluated (a) the effect of reference frame within each prompting mode, and (b) the effect of prompting mode within each reference frame. This approach aligns with the factorial design, where both manipulations are nominal variables with two levels each. $\chi^{2}$ tests were chosen because they assess differences in proportions without assuming normality, making them suitable for frequency-based comparisons with discrete outcomes.

Effect sizes were computed using Cramer’s $V$ (Cramér, 1999), which provides a normalized measure of association strength for contingency tables ( $0$ – $1$ ). Condition-wise means and standard deviations of detection proportions were calculated using the group_by() and summarise() functions in dplyr. All quantitative analyses were conducted in R (version 4.2.0) using the dplyr, tidyr, and ggplot2 packages (Wickham et al., 2019).

5. Results

5.1. Absence Detection by Reference Framing and Prompting Mode

	Spontaneous Condition					Guided Condition
Domain	Global	Partial	$\chi^{2}$	$p$	$V$	Global	Partial	$\chi^{2}$	$p$	$V$
	(%)	(%)	(1)			(%)	(%)	(1)
Energy	28.0	58.0	9.18	$<.01$	0.30	88.0	94.0	1.10	$\geq.10$	0.10
Wealth	28.0	42.0	2.15	$\geq.10$	0.15	84.0	90.0	0.80	$\geq.10$	0.09
Regime	36.0	50.0	2.00	$\geq.10$	0.14	92.0	90.0	0.12	$\geq.10$	0.03
Combined	30.7	50.0	11.65	$<.001$	0.20	88.0	91.3	0.90	$\geq.10$	0.05

Table 3. Framing Effect Summary: Global vs. Partial Reference. Detection rates and statistical comparisons across domains under Spontaneous and Guided conditions.

Framing effects.

Table 3 summarizes how reference framing (Global vs. Partial) influenced absence detection under both spontaneous and guided conditions. Across domains, reference framing produced a clear pattern in the spontaneous condition but not in the guided one (Table 3). When participants were not explicitly prompted to look for what was missing, absence detection was substantially higher under the Partial Reference framing (mean = 50.0%) than under the Global Reference framing (mean = 30.7%), $\chi^{2}(1)=11.65$ , $p<.001$ , $V=.20$ . Results were strongest in the Energy dataset and consistent across the other domains. When attention was guided toward absences, detection rates converged across framings.

Prompting effect.

Prompting Effect Summary: Spontaneous vs. Guided
Global Reference
Domain	Spontaneous (%)	Guided (%)	Difference (pp)	$\chi^{2}(1)$	$p$	$V$
Energy	28.0	88.0	+60.0	36.92	$<.001$	0.61
Wealth	28.0	84.0	+56.0	32.08	$<.001$	0.57
Regime	36.0	92.0	+56.0	33.60	$<.001$	0.58
Combined	30.7	88.0	+57.3	102.14	$<.001$	0.58
Partial Reference
Domain	Spontaneous (%)	Guided (%)	Difference (pp)	$\chi^{2}(1)$	$p$	$V$
Energy	58.0	94.0	+36.0	18.69	$<.001$	0.43
Wealth	42.0	90.0	+48.0	25.53	$<.001$	0.51
Regime	50.0	90.0	+40.0	19.05	$<.001$	0.44
Combined	50.0	91.3	+41.3	63.11	$<.001$	0.46

Table 4. Prompting Effect Summary: Spontaneous vs. Guided. Improvements in detection rate and statistical comparisons within each reference frame across domains.

Across all domains and reference conditions, prompting produced a large and highly significant increase in absence detection (Table 4). When participants were explicitly asked to look for what might be missing, detection rates rose sharply—from roughly one-third of participants in the spontaneous phase to nearly nine in ten under guided prompting. Across both reference types, explicit guidance produced large, highly significant gains ( $\approx+40–60$ pp, $V\approx.5–.6$ ), confirming a robust prompting effect. Large Cramer’s $V$ values ( $.43$ – $.61$ ) across all tests indicate a strong effect size.

5.1.1. Qualitative Example

Table 5. Illustrative participant responses across spontaneous and guided conditions.

Condition	Participant’s Answer	Absence Detected?
Spontaneous	E1 is very democratic	False
Spontaneous	Gas represents 70% of the production, which is a lot higher than any other country.	False
Spontaneous	E1 is predominantly democratic (80%), well above the world average. It has almost no autocracies or authoritarian electoral regimes, suggesting a more stable and participatory political environment.	True
Spontaneous	E1 produces the most gas, coal and solar, respectively, of all the countries represented. E1 is also the only country that does not produce hydro energy.	True
Spontaneous	The daily per capita income shown in E1 is mostly over $10, showing a wealthy country compared to China and Indonesia, but not dissimilar to Ireland, although E1 has a small number of lower earners that Ireland does not	True
Guided	Data labels, explanation of terms, overall population number	False
Guided	Nothing appears to be missing. The totals appear to sum to 100%. It just contrasts with the other countries given the dependence on gas	False
Guided	A lack of eco fuels, namely nuclear, hydro, bio, and wind.	True
Guided	E1 has no one making $4.20 or below.	True
Guided	E1 has far less hydropower but much more gas use than the others	True

Table 5 provides illustrative examples of participant responses across spontaneous and guided conditions, with the engine’s evaluation. Generally, responses shifted from descriptive commentary in the spontaneous phase to explicit absence language once attention was guided, with markers such as lack, no one, and doesn’t have any.

5.2. Patterns Across Datasets

Figure 4 presents absence detection patterns across experimental conditions. Panel (a) shows detection rates for the three domains (Energy, Wealth, and Regime) under Spontaneous and Guided prompting, each compared across Partial and Global reference frames. Panel (b) summarizes the overall interaction between reference framing and prompting mode. Error bars in both panels represent 95% Wilson score confidence intervals. Across datasets and domains, spontaneous absence detection was low but rose sharply when participants were guided to search for missing information.

These results confirm that absence recognition depends strongly on attentional direction rather than perceptual availability.

5.3. Summary of Main Findings

We summarize the results in the following key findings.

(1)

Moderate spontaneous detection of absence. In the spontaneous phase, when participants freely described what stood out in the visualizations, absence detection was generally low. Spontaneous absence detection was generally low ( $\approx$ 40%), showing that most participants focused on visible features even when cues were available.
(2)

Prompting as the dominant driver of absence recognition. When guided, detection nearly doubled across domains ( $\approx$ +50 pp), a large, highly significant effect ( $V\approx.5$ ).
(3)

Reference framing: small but consistent influence. Partial references modestly improved detection compared to Global referencing ( $\Delta\approx 10–15$ pp; $V\approx.12$ ), indicating small but consistent contextual facilitation.
(4)

No strong interaction between framing and prompting.
(5)

Cognitive guidance over visual availability. The clear difference between spontaneous and guided phases shows that the difficulty of perceiving absence lies not in visual salience but in cognitive framing. Identical visualizations elicited qualitatively different interpretations depending solely on whether participants were prompted to search for what was missing. Guidance effectively redirected attention from presence-based description to absence-based reasoning, revealing that absence detection in visualization is governed primarily by how attention is directed, not by what is visibly present.

Statistical Note.

We conducted four planned comparisons (Partial vs Global, Spontaneous vs Guided), all defined a priori, so no multiple-testing correction was applied (Perneger, 1998).

Chi-square tests revealed large, highly significant prompting effects for both reference types (Global: $\chi^{2}(1)=102.17$ , $p<.001$ , $V=0.58$ ; Partial: $\chi^{2}(1)=61.81$ , $p<.001$ , $V=0.45$ ). Reference frame effects were smaller: a modest but reliable difference in the spontaneous phase ( $\chi^{2}(1)=11.65$ , $p=.0006$ , $V=0.20$ ) and a non-significant difference in the guided phase ( $\chi^{2}(1)=0.90$ , $p=.343$ , $V=0.06$ ). Applying a conservative Bonferroni correction ( $\alpha=.0125$ ) does not alter these conclusions: both prompting effects remain highly significant, and only the spontaneous framing effect meets the adjusted threshold. Fisher’s exact tests yielded equivalent results, confirming robustness to small expected cell sizes.

6. Discussion

6.1. Theoretical Implications

The present study advances the understanding of how reference framing and cognitive guidance shape human recognition of absence. Consistent with theories of expectation-based perception (Margoni et al., 2024), our findings demonstrate that detecting missing information depends on the availability and explicitness of a reference model.

Across domains, spontaneous absence detection varied substantially by reference condition. While many participants initially focused on present features, a notable proportion, especially under the partial reference, identified missing categories even without explicit prompting. This shows that the limitation is not an inability to perceive absence but a tendency for attention to default toward visible information unless expectations are actively engaged.

Reference effect.

The results show that the kind of reference frame provided influences the way people form expectations: a global frame supplies an explicit baseline, while a partial frame prompts them to infer it from examples. This distinction seems to differentiate between two cognitive pathways for comparison. In the global condition, expectation is externally provided. The viewer’s task is to align observations with a ready-made standard. In the partial condition, expectation must be internally constructed from limited cues, requiring participants to abstract a pattern that generalizes beyond the given examples. These internally generated expectations are effortful but more cognitively engaging: they invite hypothesis formation and promote active comparison between what is observed and what is missing. Our results indicate that the participants noticed absence more in the partial condition, confirming that encouraging participants to construct their expectations enhances the awareness of absence.

This confirms that reference framing is not a neutral visualization choice but an active determinant of cognitive access to expectation.

Thus, visual framing determines whether expectation is supplied by the system or built by the user.

Guidance effect.

Guidance amplified absence detection across all conditions: when participants were asked to consider “what should be there but isn’t”, detection rates rose sharply. This contrast shows that recognition of absence depends less on the visibility of the data and more on whether viewers are encouraged to think about what should be present and notice when it is missing.

This result aligns with the long-documented feature-positive effect (Newman et al., 1980; Allan, 1993): organisms and humans detect and learn from presences more readily than from absences. Neuroscientific evidence further suggests that perceiving “nothing” recruits distinct neural systems from those encoding positive features (Nieder, 2016). In our task, this asymmetry manifested behaviorally: participants naturally described what they saw rather than what they failed to see. Yet, the rapid shift following an explicit absence-oriented prompt indicates that absence reasoning is cognitively available but typically dormant, requiring attentional reframing to surface. Notably, even when directly prompted, a few claimed that nothing is missing, or looked for missing details in the graph, e.g., “E1 has no country name”.

Cognitive load.

Absence detection is shaped by heuristics beyond the feature-positive effect and is highly sensitive to cognitive load. Under higher load, viewers are more likely to rely on fast, memory-driven strategies rather than systematically validating which categories should appear. The availability heuristic may bias attention toward categories most salient in memory (or visually prominent), making less familiar or less salient absences harder to notice. Similarly, representativeness may lead viewers to treat distributions that resemble familiar prototypes as “complete,” prematurely terminating search even when semantically important categories are missing. In addition, partial references require viewers to synthesize expectations from a small set of exemplars. When users possess relevant domain knowledge, such a synthesis enables them to infer what “should” be present and to identify when it is not. However, when domain familiarity is low, this inferential process may be more fragile (Gigerenzer et al., 1988). This highlights an important design trade-off: partial framing promotes active expectation construction, yet it may place greater cognitive demands on novice users. These heuristics help explain variability in spontaneous absence detection and illustrate why targeted prompting can be so effective. Such a synthesis enables them to infer what “should” be present and to identify when it is not. However, when domain familiarity is low, this inferential process may be more fragile (Gigerenzer et al., 1988). This highlights an important design trade-off: partial framing promotes active expectation construction, yet it may place greater cognitive demands on novice users. These heuristics help explain variability in spontaneous absence detection and illustrate why targeted prompting can be so effective. Extending Neisser’s perceptual-cycle account to the domain of absence detection, we propose that partial input can strengthen schema construction by initiating an anticipatory model while still requiring exploratory sampling to “fill in” what is expected. In this view, the additional effort induced by partial references is not merely a cost: it can deepen the internal reference structure that makes absences meaningful and detectable (Neisser, 1976)

Partial and global frames may impose different cognitive loads. Partial framing increases intrinsic load by requiring users to abstract across exemplars, yet this effort appears to deepen expectation formation and sharpen sensitivity to absences. Global framing reduces cognitive load by providing an explicit baseline, but may encourage more surface-level comparisons. Guided prompting introduces an attentional shift that can reduce the cognitive burden of searching for relevant absences. These results suggest that visualization systems should balance cognitive demands with the benefits of expectation construction, adjusting framing strategies to the user’s expertise and task context.

Taken together, the data support a layered cognitive account. While spontaneous viewing is largely guided by presence-oriented attention, some absence recognition still emerges when a reference frame provides enough context to evoke implicit expectations. Guided viewing, in turn, amplifies this latent sensitivity by explicitly directing attention toward what should be present but is not. Reference framing modulates this interplay: a global reference supplies a clear external model for comparison, whereas a partial reference invites participants to construct one internally—both enabling recognition, though through different routes. The ability to “see what is not there” therefore depends not only on explicit guidance, but also on how strongly the visualization itself supports the activation of expectations that make absence perceptible.

6.2. Practical and Societal Implications

The results highlight framing as a supporting factor in how users recognize and engage with what is missing. This finding has direct implications for the design of visualization systems, analytic dashboards, and intelligent interfaces that aim to support reasoning about incomplete, uncertain, or biased data. Rather than treating reference information as a fixed background, designers can use it as a cognitive means that determines how expectations are formed and compared.

Our results have direct implications for the design of intelligent user interfaces and AI-supported analytic systems. Many AI tools, such as summarization systems, recommender systems, and LLM-based assistants, provide selective information that can implicitly narrow user expectations. Interfaces that surface what is expected but absent can help mitigate over-trust and reveal blind spots in model outputs. Expectation-oriented framing can be integrated into explanation interfaces through exemplar-based comparisons (partial frames) or aggregate baselines (global frames). Guided prompting, as shown in our study, may further support users in identifying omissions in AI-generated summaries or decision-support dashboards. These mechanisms align with broader goals in explainable AI to help users understand not only what a model shows, but also what it leaves out.

Framing as a design parameter.

A global frame, which makes the reference explicit and comprehensive, provides users with a clear normative context for rapid evaluation. Such framing is effective in domains where consistency and efficiency are needed, such as monitoring dashboards in healthcare or finance, where users must quickly detect deviations from established baselines. By showing overall distributions, averages, or benchmark indicators alongside individual cases, global framing helps users spot large-scale gaps or anomalies with minimal effort. However, it may also encourage surface-level verification, e.g., checking whether something conforms, rather than reflective questioning about what might be absent. In contrast, a partial frame encourages the construction of expectations from a limited set of examples. This design strategy engages users more actively: they must infer “what should be there” based on partial cues, leading to deeper cognitive processing and increased sensitivity to omissions. Partial framing may therefore be advantageous in exploratory or educational contexts, such as scientific data analysis, public dashboards explaining social indicators, or learning environments, where engagement and hypothesis formation are more important than speed. In such cases, withholding some reference information (for instance, showing representative samples instead of complete aggregates) can prompt users to actively reconstruct the missing structure, which our data show enhances absence detection even without explicit guidance.

Adaptive and mixed framing.

Because global and partial references serve different cognitive purposes, visualization systems can treat framing as an interactive design parameter, rather than a static choice. Interactive interfaces can allow users to shift dynamically between global and partial references, adjusting the level of explicitness as their understanding evolves. For example, a climate-change dashboard could begin with region-level global baselines for temperature and emissions, then allow users to explore smaller subsets (partial frames) to infer where data are sparse or missing. Similarly, a hospital analytics interface might start with global performance benchmarks across departments, then switch to partial contextual views that encourage clinicians to identify overlooked risk factors or unrecorded conditions.

Implications for AI and Intelligent User Interfaces

Although our empirical stimuli are data visualizations, the core idea we present is detecting absent expected categories relative to a reference frame. This feature is generalized to AI-mediated sense-making. LLM-based systems often produce selective summaries or analyses that omit semantically expected elements (e.g., key topics, entities, or subgroups), and users may fail to notice these omissions without an explicit reference. Our results motivate expectation-aware IUI strategies, such as making the underlying reference frame explicit (global or partial) and integrating guided prompting as an explanation mechanism to steer attention toward what should be checked.

Societal Implications

Through the lenses of framing and expectations, absence detection can inform the design of intelligent interfaces that promote awareness, accountability, and critical engagement with data. By embedding mechanisms that surface missing categories, silent model features, or unreported metrics, designers can turn absences into signals for reflection rather than blind spots. Partial references, as shown in our study, may serve as a useful design strategy to stimulate cognitive attention, inviting users to actively reconstruct what might be missing instead of passively accepting what is shown. Such expectation-oriented framing can help users recognize incompleteness in algorithmic explanations, dashboards, or public data displays, supporting more cautious and equitable interpretations. In this way, interfaces that make absences perceptible do not merely fill information gaps. They help cultivate a mindset of mindful inquiry, strengthening trust and accountability in human–AI collaboration.

6.3. Limitations and Future Work

This study has several limitations. The guided prompt focused only on absences at the second phase. Future work should test symmetric prompts for surpluses. The task used static bar charts and written responses, unlike real interactive settings. Automated text analysis may have missed subtle expressions of omission, and the sample of English-speaking adults viewing categorical data limits generalization across cultures and data types.

Future research should further test how expectation framing interacts with domain familiarity, data sparsity, and cognitive load measurement. A key question emerging from both human and algorithmic evidence is whether systems should aim to help users construct expectations or provide them explicitly. Bridging these two modes of internal inference and external reference might advance the design of visualization systems and intelligent interfaces that not only display data, but also help people perceive and reason about what is missing.

7. Conclusion

This study shows that people’s ability to recognize what is missing in data depends on how reference information is framed. When expectations had to be inferred from a few examples (partial reference), participants often detected absences as well as—or even better than—those viewing an explicit global baseline. This suggests that constructing expectations internally can enhance engagement and sharpen sensitivity to missing information. Under guided prompting, absence detection rose sharply across all conditions, eliminating differences between reference types and confirming that explicit attentional direction can override structural framing effects. Together, these results indicate that recognizing absence is not a perceptual limitation but an attentional and contextual process shaped by how expectations are built and applied. Designing visualizations that help users “see what is not there” therefore requires cognitive designs and interaction designs that balance between explicit guidance and opportunities for active expectation construction that require deeper attention.

Generative AI Usage Disclosure

Generative AI tools (ChatGPT, GPT-5) were used exclusively for language refinement of this manuscript, including minor edits to improve grammar, clarity, and readability. All conceptual development, data analysis, and methodological decisions were made by the authors.

References

L. G. Allan (1993) Human contingency judgments: rule based or associative?. Psychological Bulletin 114 (3), pp. 435. Cited by: §6.1.
S. Attfield, S. Hara, and B. Wong (2010) Sense-making in visual analytics: processes and challenges. In EuroVAST 2010: The 1st European Symposium on Visual Analytics Science and Technology., Cited by: §1, §2.4.
B. Barnett and S. M. Fleming (2024) Symbolic and non-symbolic representations of numerical zero in the human brain. Current Biology 34 (16), pp. 3804–3811. Cited by: §2.3.
D. Cavedon-Taylor (2017) Touching voids: on the varieties of absence perception. Review of Philosophy and Psychology 8 (2), pp. 355–366. Cited by: §1, §2.3.
J. T. Coldren and R. A. Haaf (2000) Asymmetries in infants’ attention to the presence or absence of features. The Journal of genetic psychology 161 (4), pp. 420–434. Cited by: §2.1.
H. Cramér (1999) Mathematical methods of statistics. Vol. 9, Princeton university press. Cited by: §4.4.
A.C. Doyle (2024) The adventure of silver blaze. Modernista. External Links: ISBN 9789180945653, Link Cited by: §1.
A. Farennikova (2013) Seeing absence. Philosophical studies 166 (3), pp. 429–454. Cited by: §1.
Z. Feng, F. Zhu, H. Wang, J. Hao, S. Yang, W. Zeng, and H. Qu (2024) HoLens: a visual analytics design for higher-order movement modeling and visualization. Computational Visual Media 10 (6), pp. 1079–1100. Cited by: §2.4.
G. Gigerenzer, W. Hell, and H. Blank (1988) Presentation and content: the use of base rates as a continuous variable.. Journal of Experimental Psychology: Human Perception and Performance 14 (3), pp. 513. Cited by: §1, §2.4, §3.1, §6.1.
E. Hearst (1989) Backward associations: differential learning about stimuli that follow the presence versus the absence of food in pigeons. Animal Learning & Behavior 17 (3), pp. 280–290. Cited by: §1.
D. Kahneman (2011) Thinking, fast and slow. macmillan. Cited by: §2.1.
S. Kandel, A. Paepcke, J. Hellerstein, and J. Heer (2011) Wrangler: interactive visual specification of data transformation scripts. In Proceedings of the sigchi conference on human factors in computing systems, pp. 3363–3372. Cited by: §2.2.
P. Legrenzi, V. Girotto, and P. N. Johnson-Laird (1993) Focussing in reasoning and decision making. Cognition 49 (1-2), pp. 37–66. Cited by: §1.
F. Margoni, L. Surian, and R. Baillargeon (2024) The violation-of-expectation paradigm: a conceptual overview.. Psychological Review 131 (3), pp. 716. Cited by: §1, §2.3, §2.4, §6.1.
J. Martin and J. Dokic (2013) Seeing absence or absence of seeing?. Thought: A Journal of Philosophy 2 (2), pp. 117–125. Cited by: §1.
M. Mazor, K. J. Friston, and S. M. Fleming (2020) Distinct neural contributions to metacognition for detecting, but not discriminating visual stimuli. Elife 9, pp. e53900. Cited by: §1, §2.3.
M. Mazor (2025) Inference about absence as a window into the mental self-model. Open Mind 9, pp. 635–651. Cited by: §2.3.
J. D. Meuwese, A. M. van Loon, V. A. Lamme, and J. J. Fahrenfort (2014) The subjective experience of object recognition: comparing metacognition for object detection and object categorization. Attention, Perception, & Psychophysics 76 (4), pp. 1057–1068. Cited by: §2.3.
O. Mokryn and H. Ben-Shoshan (2021) Domain-based latent personal analysis and its use for impersonation detection in social media. User Modeling and User-Adapted Interaction 31 (4), pp. 785–828. Cited by: §1, §2.4.
O. Mokryn, T. Lazebnik, and H. Ben-Shoshan (2025) Interpretable transformation and analysis of timelines through learning via surprisability. Chaos: An Interdisciplinary Journal of Nonlinear Science 35 (7). Cited by: §1, §2.4.
S. Mumford (2021) Absence and nothing: the philosophy of what there is not. Oxford University Press. Cited by: §2.3, §2.
U. Neisser (1976) Cognition and reality: principles and implications of cognitive psychology. Cited by: §2.4, item Partial reference, §6.1.
J. P. Newman, W. T. Wolff, and E. Hearst (1980) The feature-positive effect in adult human subjects.. Journal of Experimental Psychology: Human Learning and Memory 6 (5), pp. 630. Cited by: §1, §2.1, §3.2, §6.1.
A. Nieder (2016) Representing something out of nothing: the dawning of zero. Trends in Cognitive Sciences 20 (11), pp. 830–842. Cited by: §6.1.
T. V. Perneger (1998) What’s wrong with bonferroni adjustments. Bmj 316 (7139), pp. 1236–1238. Cited by: §5.3.
H. Song and D. A. Szafir (2018) Where’s my data? evaluating visualizations with missing data. IEEE transactions on visualization and computer graphics 25 (1), pp. 914–924. Cited by: §2.1.
J. Suschnigg, B. Mutlu, G. Koutroulis, H. Hussain, and T. Schreck (2025) MANDALA—visual exploration of anomalies in industrial multivariate time series data. In Computer Graphics Forum, Vol. 44, pp. e70000. Cited by: §2.4.
X. Tang, H. Chen, D. Lin, and K. Li (2024) Harnessing llms for multi-dimensional writing assessment: reliability and alignment with human judgments. Heliyon 10 (14). Cited by: §4.4.
A. Treisman and S. Gormican (1988) Feature analysis in early vision: evidence from search asymmetries.. Psychological review 95 (1), pp. 15. Cited by: §2.1.
A. Tversky and D. Kahneman (1974) Judgment under uncertainty: heuristics and biases: biases in judgments reveal some heuristics of thinking under uncertainty.. science 185 (4157), pp. 1124–1131. Cited by: §1, §3.2.
H. Wickham, M. Averick, J. Bryan, W. Chang, L. D. McGowan, R. François, G. Grolemund, A. Hayes, L. Henry, J. Hester, et al. (2019) Welcome to the tidyverse. Journal of open source software 4 (43), pp. 1686. Cited by: §4.4.