Theeuwes (2022) argued that salient distractors generally have the inherent power to capture attention against our will, at least when they are more salient than the target. To explain why salient distractors sometimes fail to produce capture effects, Theeuwes proposed that capture can be avoided when people engage in a difficult, serial search, yielding a small attentional window. The theory seems unlikely as, even in a difficult search, one still presumably first searches broadly to decide where to best begin the search. More importantly, there is a simple alternative explanation for when capture occurs.
The studies cited by Theeuwes (2022) as demonstrating capture by color singletons all involved targets that are themselves singletons (or “near singletons”). When the target is a singleton, participants tend to search for singletons (singleton detection mode), making the color singleton task-relevant (e.g., Bacon & Egeth, 1994; Lamy & Egeth, 2003; Ruthruff et al., 2020). When the target is not a singleton, participants instead search for a specific target feature (feature search mode). Search mode (singleton vs. feature) provides an elegant and compelling explanation of the existing data, making it unnecessary to invoke attentional windows (see also Kerzel & Huynh Cong, 2022).
To illustrate our point, Table 1 shows four possible categories of search, defined by whether search is easy/parallel or difficult/serial and whether the target is a singleton or not. The attentional window account predicts that the critical factor determining capture is search ease, whereas the target search mode account predicts that the critical factor is whether the target is a singleton. Both accounts predict capture in the upper left quadrant (easy search with singleton targets) and both predict no capture in the lower right quadrant (difficult search with nonsingleton targets). The studies described by Theeuwes (2022) all fall into the two quadrants for which both accounts agree. What is missing from his review are studies in the other two cells, for which the accounts actually disagree. Below we review these findings.
Table 1
Capture studies can be divided into four categories based on whether the search is easy (parallel) vs. difficult (serial) and whether the target is a singleton or a nonsingleton. According to the attentional window account, capture occurs with an easy, parallel search but not with a difficult, serial search. The target search mode account, however, predicts that capture occurs with a singleton target but not with a nonsingleton target. The shaded cells are the only ones where the two models make conflicting predictions. Yes: capture; No: No capture.
| EASY/PARALLEL SEARCH | DIFFICULT/SERIAL SEARCH | |
|---|---|---|
| Singleton Target Search | Target Search Mode: Yes Attentional Window: Yes | Target Search Mode: Yes Attentional Window: No |
| Nonsingleton Target Search | Target Search Mode: No Attentional Window: Yes | Target Search Mode: No Attentional Window: No |
Easy Search with a Nonsingleton Target
Under easy search with a nonsingleton target, the attentional window account predicts capture but the target search mode account predicts no capture. Unfortunately, such studies seem rare. Theeuwes (2022) cited Theeuwes (2004; see Figure 1 for an example display) that reported capture in this cell (but see Wienrich and Jancyz, 2011, who failed to replicate the findings). As the original Theeuwes (2004) paper states, “observers searched… for a shape singleton (a diamond among circles)” (p. 67). But because there were two distractor singletons (a square and a triangle) in addition to the target singleton (a diamond), Theeuwes (2022) argued that “feature search mode had to be used”. However, this does not imply the absence of singleton detection mode. Searching for a singleton would help eliminate 17 of 20 search elements, reducing the effective setsize down to only 3 items.

Figure 1
An example of the search display (setsize 20) in Theeuwes (2004).
To test between the two competing accounts, what is needed is an easy search with a target that is clearly a nonsingleton. One such example is provided by Lien et al. (2010). Their displays had one letter in the target color (which changed from trial to trial), one in a distractor color, and two white letters (see Figure 2). Although their search was easy (a search slope of only about 6 ms/item), a color singleton precue produced little capture effect. A critic could reasonably note that the setsize was only four and it is logically possible that participants rapidly disengaged from the precue, before the target arrived. But note that when Folk et al. (1992; Experiment 4) used these same conditions – 4-item displays with precuing – combined with a singleton target, they observed large capture effects (40+ ms cue validity effect).

Figure 2
An example event sequence for the valid condition in Lien et al. (2010).
Difficult Search with a Singleton Target
Under difficult search with a singleton target, the target search mode account predicts capture but the attentional window account predicts no capture. Studies meeting this requirement are also relatively rare. However, one potential example is Ruthruff et al. (2020; Experiments 1 and 2) where participants searched for a circle (a singleton target) amongst a homogeneous background of ovals. In some conditions the distractor ovals were nearly circular, creating a very difficult search; mean response time (RT) was 1,041 ms (Experiment 1), nearly double that of the easy search (610 ms). Nevertheless, the color singletons produced unusually large cue validity effects (121 ms in Experiment 1 and 70 ms in Experiment 2). Clearly, difficult search does not eliminate capture effects, as suggested by the attentional window account. In Experiment 3, making the search display distractors heterogenous (discouraging singleton search mode) virtually eliminated cue validity effects (1 ms). Search RT was nearly unchanged. Thus, the dramatic decrease of the cue validity effect cannot be explained by search difficulty, but it can easily be explained by target search mode.
Capture under difficult search is even more clear for the case of abrupt onsets. In fact, Gaspelin et al. (2016) found that increasing search difficulty reliably increased capture effects by abrupt onsets (see also Gaspelin et al., 2012). Their most difficult visual search produced an enormous capture effect of 141 ms (see also Ruthruff et al., 2020, Experiment 4). This search is much more difficult, with longer RTs, than the visual searches that Theeuwes (2022) claims to have stifled capture by color singletons.
Concluding Remarks
The view that capture is determined by salience has been questioned by numerous findings of no capture by color singletons. Theeuwes (2022) proposed that this is because the search was too difficult, promoting serial search and a small attentional window that excludes salient distractors. We have reviewed evidence that search difficulty is not the key factor. For abrupt onsets, capture occurs even with very difficult searches. For color singletons, capture does not occur even for the easiest of searches, unless the color singleton is made task relevant by promoting singleton detection mode. The literature on capture by salient color singletons is predominated by studies promoting singleton detection mode and we believe that is unfortunate.
Ethics and Consent
Ethical approval and/or consent was not required.
Competing Interests
The authors have no competing interests to declare.
