The Flexibility of Working Memory in Drawing on Episodic Long-Term Memory Representations in Serial Recall

Ana Rodriguez; Philipp Musfeld; Lea M. Bartsch

doi:10.5334/joc.451

Full Article

When we are required to retain information for a brief period of time in order to accomplish current goals, we make use of our working memory (WM) system. Compared to long-term memory (LTM) – a system that can store extensive amount of information over long periods of time (Cowan, 2008; Squire, 2004) – WM is very limited in the amount of information that it can hold (Cowan, 2008; Oberauer, 2009).

Despite the functional differences between WM and LTM, both systems interact (Atkinson & Shiffrin, 1968; Cowan, 2008; Unsworth & Engle, 2007). In fact, current theories of WM (Cowan, 1999; Oberauer, 2009) propose that the WM system comprises activated representations in LTM that are actively bound to the current context, and one core function of WM would be to hold those bindings for task resolution (Oberauer, 2019). For instance, processing information in WM has been proposed to lead to the creation of new LTM representations (Atkinson & Shiffrin, n.d.; Bartsch et al., 2018; Cotton & Ricker, 2021, 2022), and LTM representations could help in using the limited capacity of WM more efficiently (Bartsch & Oberauer, 2023; Bartsch & Shepherdson, 2022; Chen & Cowan, 2005).

Contributions of LTM to WM

Contributions from LTM to WM are manifold and can result from many different aspects of our knowledge system (Bartsch, Fukuda, et al., 2024). Among others, this can include contributions from our lexical and phonological system (Hulme et al., 1991; Jefferies et al., 2006; Kowialiewski & Majerus, 2018; Saint-Aubin & Poirier, 2000), which are fundamental components of language, enabling the perception and comprehension of verbal information and thus supporting its encoding and retrieval; semantic knowledge about specific facts (Ericsson & Kintsch, 1995) that allows us to comprehend complex contents, or episodic knowledge, resulting from our previous experiences (Tulving, 1993, 2002).

Here, we want to focus on the contribution of episodic memory to WM, as this is an area which, so far, has received limited attention.

Episodic memory refers to our memory for specific events, which include context information about when (temporal), and where (spatial) these events have occurred. Classically, episodic memory has been distinguished from semantic memory, which is supposed to be independent of specific contextual information (Tulving, 1972; Renoult et al., 2019). Two ways have been proposed in which episodic knowledge can contribute to WM.

First, episodic LTM representations have been argued to be immediately formed within WM, serving as the basis of new rapid learning (Cowan, 2019). These representations can be accessed instantly and remain active throughout an immediate test of WM, but their contributions to performance are more likely to occur when WM capacity has been exceeded (Bartsch & Oberauer, 2023; Oberauer & Awh, 2022).

Second, another way in which episodic LTM can contribute to WM performance is through prior knowledge. Episodic prior knowledge represents the memory of events that participants have experienced in the past. In an experimental context, it is typically studied and manipulated through a learning phase prior to the WM task (e.g. Chen & Cowan, 2005). Here, participants are required to learn new associations that later on are included in WM trials.

Proposed mechanisms underlying the contributions of pre-learnt episodic LTM to WM performance

Including pre-learnt LTM information – which participants learn at the beginning of an experiment – within a test of WM has been demonstrated to benefit WM performance (Bartsch & Shepherdson, 2022; Chen & Cowan, 2005, 2009; Norris & Kalm, 2021; Thalmann et al., 2019), especially when WM load is high (Bartsch, Frischkorn, et al., 2024). Such a contribution of episodic LTM to WM has been shown to benefit immediate memory performance in two ways: First, immediate memory for information presented in a task that matches a stored episodic representation is better compared to novel information (Bartsch, Frischkorn, et al., 2024; Bartsch & Shepherdson, 2022, 2023; Chen & Cowan, 2005, 2009; Norris et al., 2020; Thalmann et al., 2019). Second, apart from better memory for the pre-learnt information itself, the presence of such pre-learnt information can free-up capacity for novel information (Bartsch & Shepherdson, 2022, 2023; Musfeld et al., 2024; Norris et al., 2020; Thalmann et al., 2019).

It is assumed that when pre-learnt LTM representations are included in trials of a WM task, benefits on novel information are due to a reduction in WM load at encoding (Norris & Kalm, 2021). Two main mechanisms have been discussed in the literature to account for this reduction in WM load: chunking and offloading. When pre-learnt information is recognized, chunking strategies could arise, allowing the information to be re-encoded or compressed into smaller units (Brady et al., 2009; Chekaf et al., 2016; Chen & Cowan, 2005; Norris & Kalm, 2021; Thalmann et al., 2019). Specifically, the multiple elements of an LTM representation could be compressed and integrated in a unified representation (Brady et al., 2009; Chekaf et al., 2016), or individuals could selectively encode just pieces of the information and retrieve the associated information directly from LTM at test (Chen & Cowan, 2005; Musfeld et al., 2024; Thalmann et al., 2019). This results in a reduced necessity for encoding the information in its original presentation in WM, hence freeing-up resources. Alternatively, the information could be offloaded to LTM (Bartsch & Shepherdson, 2022, 2023; Cowan, 2008; Huang & Awh, 2018; Ngiam et al., 2019; Schurgin et al., 2018). Here, pre-learnt and accessible information in LTM would not be encoded into WM at all, but is only retrieved at test given a recall cue, thereby freeing-up WM resources for encoding other information into WM.¹

In contrast, in case benefits arise solely for information in WM that matches pre-learnt knowledge stored in LTM, but no freeing of capacity is observed, a third alternative explanation has been proposed: redintegration. Redintegration is a process occurring at retrieval (rather than during encoding), where available LTM representations help to reconstruct degraded information held in WM (Hulme et al., 1991, 1997; Jones & Farrell, 2018). According to this account, the presence of LTM information does not alter WM representations during encoding (like it is the case for chunking or offloading), but it helps to rebuild a degraded WM representation during recall. This is more likely to be successful, when the LTM representation is strong and easily accessible (Schweickert, 1993; Thorn et al., 2005). Thus, redintegration entails that LTM information is encoded and maintained as originally presented (Norris et al., 2020), and does not predict a freeing-up of capacity in WM. Rather, the immediate memory benefit is limited to the pre-learnt LTM over novel information because it is more likely to be reconstructed during testing.

The role of structure and representations – how flexible can WM draw on LTM?

In previous studies on the topic using verbal representations, both, the information that participants learnt prior to the WM task, as well as the stimuli presented in the WM task itself, had the same encoding structure (i.e. simultaneous presented item-item associations in form of word pairs; Bartsch & Shepherdson, 2022, 2023; Chen & Cowan, 2005, 2009).

The use of the same encoding structure might have facilitated the recognition of LTM information, making it easier to engage in strategies that ultimately reduced the amount of resources required for encoding LTM word pairs (Bartsch & Shepherdson, 2022, 2023; Chen & Cowan, 2005, 2009). This, in turn, could have caused the observed freeing of capacity (the benefits on novel information), when the lists were a mix of pre-learnt and new representations (Bartsch & Shepherdson, 2022, 2023).

While the aforementioned research has provided insight into when and how WM draws on LTM when WM and LTM representations are matched in structure, a question remains: How does WM flexibly draw on prior knowledge stored in episodic LTM when the stored representations do not match the structure of representations required for a WM task? These circumstances can be created by breaking up the representational structure of pre-learnt information when used in a WM task. Specifically, when episodic LTM representations of word pairs comprise an item-item structure (e.g., Chen & Cowan, 2005, 2009), and the information presented in serial recall tasks leads to the creation of representations of item-positional bindings (Burgess & Hitch, 1999; Henson, 1999). So far only two studies have investigated the benefit of pre-learnt item-item associations in such a case – on immediate serial recall.

On the one hand, Norris et al., (2020) used a serial recall task with mixed lists that included words that matched pre-learnt associations in LTM and newly encountered singletons (i.e., novel words, not paired with another word). They found that superiority in performance of lists including pre-learnt word pairs was driven predominantly by better memory for the LTM information itself, consistent with a redintegration account. At the same time, there was no clear advantage for singletons within the same list, except for the condition in which the lists included three word pairs. This advantage was attributed to the constrained serial positions in which singletons could appear when more word pairs were included within the list.

On the other hand, Thalmann et al. (2019, Experiment 1) presented participants with WM trials of two independent word lists (of 2 or 4 items), each presented sequentially, followed by serial recall of each list in random order. When one list consisted of pre-learnt associations of either two or four elements, there was evidence for a reduction in WM load – namely, the other list comprising of novel stimuli was remembered better compared to a condition where both lists where novel. Hence, freeing-up of WM capacity was observed.

Although converging on evidence that at least the pre-learnt LTM information is remembered better than new words in an immediate serial recall task, these previous studies leave open the question of what information WM is drawing on when utilizing pre-learnt item-item episodic LTM representations, and under which circumstances this allows to free-up capacity for other items in WM.

Referring back to conceptualizations of the interaction of WM and LTM (such as Cowan, 1999; Oberauer, 2009), and in the context of serial recall, where pre-learnt item-item episodes are disrupted by the sequential presentation of the single elements, WM could draw on LTM in two ways: 1) via the activation of item-level information in LTM, and 2) through the binding of the activated representation to the correct serial position within the list. This means, that the WM system could draw on two types of information: (1) the item activation of each word in LTM individually or (2) the entire episode entailing the associations between the items once the words have been presented.

Depending on which of these WM can actually draw on, the processing of the information will have different implications that ultimately underly the benefit of pre-learnt episodic LTM on WM performance. If WM relies primarily on the activation of the individual LTM items, regardless of whether they were part of pre-learnt item-item representations, these items will have a higher level of activation compared to novel items, making them easier to redintegrate them at recall. However, if WM primarily draws on the entire episode (the elements including their episodic binding) the representation could be submitted to chunking or offloading, which in consequence will free-up WM capacity for processing novel information.

Locus of the benefits on novel information within WM

Lastly, past research has shown that benefits of contribution of LTM to WM are dependent on the position of the LTM information within the list (Mizrak & Oberauer, 2022, Thalmann et al., 2019, but see Bartsch & Shepherdson, 2023): In case WM can draw on the full episodic representation and the information can be chunked or offloaded, it has been observed that a reduction in WM load occurs for new information being presented following – not preceding – LTM information. These so-called proactive benefits particularly occur in case the LTM information is presented at the very beginning of the list. It has been proposed that this effect occurs because information already stored in WM experiences interference from the individually encoded LTM elements before the latter can be chunked or offloaded from WM.

In summary, it is unclear whether WM relies on the entire pre-learnt episode for small item-item representations, such as word pairs in intermixed lists for serial recall, after each word is presented sequentially, or if it draws from the activation of each associated word independently.

Additionally, if WM can flexibly draw on the pre-learnt episodes for serial recall, the information can be subjected to encoding strategies such as chunking or offloading. The use of these strategies can effectively free-up WM capacity, a benefit not observed with redintegration. However, it is unclear in this context, if a reduction in WM load is influenced by the exact position of the episode within the list, as offloading and recoding are likely more effective when LTM information is presented at the beginning of the list. To address this, it is necessary to control and counterbalance the serial positions of sequentially presented LTM words within the lists.

The present study

The present study aimed to investigate the extent to which WM can flexibly and effectively utilize prior episodic item-item associations to benefit WM performance, particularly when the WM task requires to remember item-positional bindings.

Our first goal was to investigate whether WM benefits from item activation in LTM, which would show a general benefit for words studied during a learning phase independent of the binding between them; or whether participants benefit from pre-learnt associations by drawing on a representation containing the episodic binding – even when the WM task requires the formation of new item-positional bindings.

Our second goal pertains to examining whether the inclusion of LTM representations that consist of pre-learnt item-item association can free-up WM capacity, depending on the position in which LTM word pairs were introduced within the list.

Across three experiments, participants underwent a LTM learning phase in which they were asked to memorize word pairs (item-item associations) before performing an immediate serial recall task. The serial recall task included lists of words presented individually in sequence. Critically, these lists included words presented in succession matching specific word pairs from the LTM learning phase (word pairs condition; item-item associations available), or two words in succession from different word pairs (singletons condition; high familiarity, but no item-item associations available).

If WM can draw on LTM representations encompassing a bound episode (i.e. a word pair), then immediate memory performance in which two words from a pre-learnt word pair are presented in succession should lead to higher overall recall accuracy compared to new words of the same trials (within the condition), and compared to new words presented in the same serial positions of trials in which only new words are presented (between conditions). If, however, WM flexibly benefits via the activation of single words based on their prior familiarity, then both conditions containing pre-learnt words should show improved performance of LTM words compared to new words within the same condition and compared to the condition with just new words at matching serial positions.

We expect a proactive benefit for new words, in case participants are indeed able to draw on the episodic representation during the WM task. This means that the benefit of LTM to WM should be predominant for conditions in which the LTM -word pairs are presented at the very beginning of the list. At this point, encoding of the individual LTM elements does not interfere with any previously encoded information, and chunking or offloading the information should reduce WM load and free-up capacity for upcoming new information. Alternatively, if the benefit of including LTM information (word pairs or singletons) merely manifests as improved performance for that information alone, we would not expect to observe a corresponding benefit for novel information, and the benefit would be best explained by a redintegration account occurring at recall.

Experiment 1

The goal of this Experiment was to investigate how flexible WM can rely on prior LTM information to enhance serial recall performance, either through item activation or by retrieving the entire episodic representation. Furthermore, by manipulating the positions in which LTM words were presented in succession within lists, we aimed to test if including LTM information consistent with pre-learnt episodes can free-up WM capacity selectively pro- or retroactively; or whether including LTM representations only improves immediate performance of those representations themselves, irrespective of the position in which they were presented within the list.

Method

Open Practices Statement

This study was not preregistered. Materials, data, and analysis scripts for the experiments are available on the Open Science Framework at: https://osf.io/r8tyc/.

Participants

We recruited 93 participants online via Prolific (M_age = 28.46 years), who indicated English as their first language, were from English speaking countries and had an approval rate between 90-100. However, only 62 participants met the inclusion criteria for analysis, which required that they successfully learnt at least 70% of the word pairs presented during the initial phase of the experiment. Participants of this and the following Experiments gave informed consent prior to the study. We chose the initial sample size of n = 60 for this experiment because it was sufficient to detect the effects of interest in a similarly complex previous within- subject design. Due to the use of Bayesian Statistics, the sample size could have been increased in case the evidence was ambiguous (Rouder, 2014). We considered a Bayes Factor (BF) > 3 as sufficiently informative to distinguish between the main hypothesis of interest (Kass & Raftery, 1995). The experiments were carried out in agreement with the rules of the Ethics Committee of the Faculty of Arts and Sciences of the University of Zurich and did not require special approval.

Materials and procedure

All experiments were programmed in jsPsych (de Leeuw et al., 2023) using the jspsych-psychophysics plugin (Kuroki, 2021). Figure 1 provides an overview of the general procedure of Experiment 1. It consisted of two main phases: an LTM learning phase and a WM task phase. Stimuli consisted of 320 English nouns chosen randomly for each participant from a set of 1192 nouns. Words were concrete, had a minimum and maximum length of 3 and 7 letters, and had an average frequency of 48.57 (SD = 94.11) according to the recommended Hyperspace Analogue to Language -HAL -criterion (Balota et al., 2007). Out of the 320 words, 40 were used to form 20 LTM word pairs.

Events representing the Procedure of Experiment 1. Panel A: LTM learning Phase. Panel B: WM encoding Phase; three different Conditions from Left to Right: **(a)** LTM Word Pair **(b)** LTM Singletons from different Word Pairs **(c)** New Words. Panel C: WM testing Phase: Typed Recall
*Note*. Doted frames represent the LTM words.

As depicted in Figure 1A, the experiment started with the LTM learning phase, which consisted of the presentation of 20 word pairs divided into 4 mini-blocks. Each mini-block sequentially displayed 5 word pairs at the centre of the screen for 3500 ms (e.g., jeans-cat) with an inter-stimulus interval (ISI) of 500 ms. This was followed by a test of each word pair in random order, in which the first word of each pair was presented as a probe, and participants were required to type in the associated word. After participants pressed the “enter” key, feedback was provided. If the response was correct, a message displaying “Correct!” in green appeared immediately. If the response was incorrect, an “X” with the correct word shown in red appeared instead. Feedback lasted 2000 ms. This process was repeated a second time, with a new random order of the word pairs across the mini-blocks. Finally, participants completed a final test for all the 20 word pairs in random order following the same test procedure described earlier. These testing phases interspersed with the study were intended to boost memory through the testing effect (see Sutterer & Awh, 2016 for a similar approach), as testing has shown to improve the retention of information on the long-term (Roediger III & Karpicke, 2006).

The WM task phase consisted of the immediate serial recall of 6 words presented sequentially, each for 1000 ms with an ISI of 500 ms. This phase was divided into 3 blocks, each consisting of 20 trials. Each block represented one of three conditions: The word pairs, the singletons and the new condition. Blocks and thereby the order of conditions were counterbalanced across participants. In the word pairs condition an LTM word pair was presented in succession within the list, retaining the order in which it was originally learnt (e.g. jeans – cat). The position of the word pair was manipulated across trials, so that across the entire experiment, it appeared twice at each possible position within the list (e.g., at position “1,2”, the first word of the pair was presented at serial position 1, and the second associated word at serial position 2). In the singletons condition two single words of two different episodes (i.e., LTM word pairs) were presented in succession within the list (e.g. jeans – rope). As in the word pairs condition, the position in which the words appeared within the list was counterbalanced across trials. In the new condition, only new words were presented. This served as our baseline condition to measure any beneficial effects of episodic LTM to the WM task (see Figure 1B).

After the presentation of a word list, immediate memory for serial order was tested. For half of the trials, immediate memory was tested via a typed recall test, while for the other half, a 12-alternative forced choice (12-AFC) recognition test was employed. The results of the 12-AFC recognition can be found in the supplementary materials. In brief, results are consistent with the findings of the typed recall test that are described below. The tests were inter-mixed within a block. For the typed recall test, participants were required to type each word in a box at the centre of the screen, in the same order as they were presented. Each box provided a cue indicating the word to type (e.g., “Word 1”), and after typing the word and pressing “enter”, a new empty box with the proper following cue appeared (e.g., “Word 2”). Recall requires retrieval of specific and contextual details (Yonelinas, 2002), which makes the test specifically sensitive to detecting benefits of information presented as an intact episode (i.e., in the word pairs condition).

Data Analysis

All analyses were conducted in a Bayesian framework using R, version 4.3.2 (R Core Team, 2023 with the following main R packages: tidyverse (Wickham et al., 2019), brms (Bürkner, 2017), bayestestR (Makowski et al., 2019). Data was analysed on a trial by trial basis, using the number of correct responses out of the total number of responses within a trial as the dependant variable. Correct responses were defined as accurately recalling a word in the position it was originally presented in –strict recall.

To account for typos in responses, we used the stringdist package (Van der Loo, 2014) in R with the Damerau-Levenshtein distance method. This allowed us to measure the similarity between a participant’s response and the correct answer by computing string distances. Responses with a distance of less than 2 were classified as correct.

To quantify the relative evidence in favour or against our hypotheses of interest, we computed BFs for nested models. If one model includes a parameter reflecting an effect of interest (M1), and a competing model omits this parameter, the BF can be used to quantify the relative evidence in favour or against this effect given the observed data. For instance, a BF₁₀ = 10 for a model including an effect of interest against a model omitting this effect of interest, would mean that the data are 10 times more likely under the model including the effect, compared to a model omitting the effect. Conversely, we can calculate the evidence against the presence of an effect, by computing the reciprocal of BF₁₀. Hereby, BF₀₁ =1/ BF₁₀ would indicate that the data are ten times more likely under the model omitting the effect over the model including the effect. Here, we used the Savage-Dickey density ratio method as an approximation of a nested model comparison for estimating the BF (Wagenmakers et al., 2010). The Savage-Dickey density ratio is calculated as the ratio between the prior and the posterior density of a parameter of interest (i.e., the effect of interest) at a theoretically interesting value. Here, this ratio is calculated for each parameter of interest at the value of 0, as this allows to quantify the evidence that the estimated parameter is different from 0, thereby indicating the presence or absence of an effect.

We implemented Bayesian hierarchical logistic regression models in brms (Bürkner, 2017) to estimate the binary accuracy of strict recall (correct/incorrect). We assumed a Binomial distribution predicted by the model through a logit link function. For all the analyses across the experiments, we included the maximal random-effects structure including random intercepts, as well as random participant effects for all the fixed-effects and their interactions in the model. Cauchy priors with a location parameter of 0 were assigned to all effect parameters. To assess the robustness of our results, we varied the scale of the Cauchy prior (.25, .5, .75, & 1) and re-estimated the BFs five times to guarantee results’ stability. From this, we report the median of the BFs along with the minimum and maximum values.

To test whether there were credible differences between the LTM and new words within both LTM conditions (word pairs and singletons), we fit a model where condition [word pairs, singletons and new] and word type [LTM, new] were included as fixed-effects and then computed the conditional effects for the pairwise comparisons of interest (LTM vs. new words within both the word pairs and singletons conditions). This allowed us to test whether there was credible evidence in favor of an immediate performance benefit for LTM words over new words within trials and whether this was due to a local advantage for episodic representations (benefits only for LTM words that matched pre-learnt word pairs) or to item activation of the words independent of their pre-learnt binding (benefits for both LTM words in the word pairs and singletons conditions).

To test whether there were differences for new words between conditions at each serial position of interest (i.e. proactive and/or retroactive benefits of LTM on WM), we fit three independent models using serial positions [1,2; 2,3; 3,4; 4,5; 5,6]; and condition [word pairs, singletons and new] as fixed-effects. We then computed the conditional effects for the pairwise comparisons of interest that will be described next.

The first model aimed to compare performance for LTM words of the word pairs and singletons conditions to performance of words of the new condition at the same serial positions in which the LTM words were introduced. For instance, immediate memory performance for remembering the LTM words presented at serial positions 1 and 2 to words presented at matching serial positions 1 and 2 in the new condition. This allowed us to test whether an immediate performance benefit for LTM words over new words between conditions was due to a global advantage for episodic representations (an advantage for LTM words that matched pre-learnt word pairs over words from the new condition at the same serial positions) or to item activation of the words independent of their pre-learnt binding (an advantage for both LTM words in the word pairs and singletons conditions over words from the new condition at the same serial positions).

The second and third model aimed to test whether there was a reduction in WM load by including LTM words in WM trials, depending on where the LTM words were introduced within the list. Specifically, the second model tested whether there was a proactive benefit for words introduced after the presentation of LTM words. For example, if LTM words were presented at positions 1 and 2, we analyzed performance differences of words at positions 3, 4, 5, and 6 across all conditions, instead of comparing performance differences to the aggregated performance of all words of the new condition. This allowed for an unbiased comparison focusing only on the serial positions of interest between conditions. The third model aimed to test whether there was a retroactive benefit for words introduced before the presentation of LTM words. Similarly to the second model, we analyzed performance differences across conditions at the serial positions of interest. Hence, if LTM words were presented at serial positions 5 and 6, we analyzed performance differences of words at positions 1, 2, 3 and 4 across all conditions.

Results

Figure 2 shows the proportion of correct responses as a function of (A) condition and (B) for LTM words and new words separately, within the word pairs and singletons conditions. As can be seen in Figure 2B, and supported by the analysis, we observed that immediate memory for LTM versus new information within trials was superior for LTM words that match a pre-learnt word pair in the word pairs condition (BF₁₀ = 1.18 × 10⁵ [1.92 × 10⁴ – 1.15 × 10¹¹]). However, in the singletons condition, when LTM information consisted of two words from different episodes, immediate memory for LTM information was not better compared to new words of the same trials (BF₁₀ = 0.12 [0.07–0.28]).

Proportion of immediate Recall Performance as a Function of Condition. **Panel A:** Overall Performance. **Panel B:** Performance for LTM and new Words for LTM conditions.

This means that LTM information included in WM trials benefited immediate memory performance only when the two successive words matched an entire episode stored in LTM. In other words, LTM does not benefit serial recall performance through item activation of each singleton, but rather via the stored binding of the LTM episode.

Next, we examined performance between conditions for LTM, as well as for new information, in order to determine global effects of LTM information, as proactive and retroactive effects depending on the position in which LTM information was presented within the list. Figure 3A shows the serial positions curves for all conditions based on the position of the LTM words within the lists. Figure 3B depicts the comparisons of interest: LTM information of words pairs and singletons conditions compared to words matching the same serial positions of the new condition, and performance of new words following (proactive effect) or preceding (retroactive effect) LTM information (or the matching serial positions of the new condition) within the list.

Experiment 1. **Panel A:** Serial Position Curves depending on the Serial Positions in which the LTM words were introduced within the List. **Panel B:** Performance of Words at the Serial positions of interest (LTM Words in the word pairs and singletons Conditions, Words at matching Positions of the new Condition); proactive and retroactive Performance.
*Note*. The numbers (e.g., 1,2) denote the serial positions at which the LTM words were introduced within the list in both LTM conditions, and correspond to the same positions of words in the new condition. Proactive and retroactive effects indicate the aggregated performance of words presented in serial positions following or preceding the introduction of LTM words, respectively. These specific serial positions are the same for words in the new condition to ensure an unbiased comparison.

Supported by the statistical results presented in Table 1, we found a consistent pattern of superior performance for LTM words (in the word pairs condition) at any position within the list compared to words of the new condition at the same serial positions, except at positions 4 and 5. When comparing the word pairs to the singletons condition, there was anecdotal evidence of superior performance for LTM words in the word pairs condition when introduced at the beginning of the list (at positions 1 and 2, 2 and 3) and credible differences at the very end of the list.

Table 1

Experiment 1. Bayes factors (BF₁₀) of the pairwise comparisons between conditions for each serial position in which the LTM words were introduced in the lists.

	WORD PAIRS VS SINGLETONS	WORDS PAIRS VS NEW	SINGLETONS VS NEW
Serial positions 1,2	1.44 [0.89–3.39]	2041.03 [730.03–7191.33]	4.51 [3.18–5.09]
Proactive	0.08 [0.05–0.18]	0.06 [0.03–0.15]	0.09 [0.06–0.22]
Retroactive	–	–	–
Serial positions 2,3	2.51 [1.66–6.01]	776.12 [394.52–1860.73]	0.53 [0.34–0.76]
Proactive	0.06 [0.04–0.15]	0.07 [0.04–0.18]	0.09 [0.05–0.21]
Retroactive	0.17 [0.11–0.32]	0.07 [0.14–0.36]	0.35 [0.23–0.58]
Serial positions 3,4	0.82 [0.45–2.35]	24.06 [17.96–48.80]	.24 [0.13–0.47]
Proactive	0.08 [0.05–0.18]	0.06 [0.03–0.14]	0.09 [0.05–0.20]
Retroactive	0.16 [0.10–0.28]	0.12 [0.07–0.21]	0.54 [0.41–0.66]
Serial positions 4,5	0.21 [0.10–0.83]	0.34 [0.17–1.40]	0.08 [0.04–0.19]
Proactive	0.11 [0.06–0.22]	0.09 [0.06–0.19]	0.11 0.07–0.24]
Retroactive	0.11 [0.07–0.22]	0.10 [0.06–0.23]	0.43 [0.27–0.73]
Serial positions 5,6	27.25 [15.27–41.91]	18.65 [9.01– 37]	0.13 [0.07–0.27]
Proactive	–	–	–
Retroactive	0.08 [0.05–0.18]	0.06 [0.03–0.15]	0.09 [0.05–0.23]

[i] Note. Serial positions represent the positions in which the LTM words were introduced in the word pairs and singletons conditions, which are compared to the words presented at the same serial positions of the new condition.

Credible BFs are printed in bold.

The comparison of two words of different episodes (singletons condition) to the new condition only yielded an advantage for the former, when the words were presented at the very beginning of the list (positions 1 and 2).

Regardless of where the LTM words were introduced within the list, whether matching an episode or as singletons, no proactive or retroactive benefit was found for the new words included in the list, meaning that immediate memory performance for new words across conditions was similar irrespective of whether the LTM words were presented before or after them. This suggests that including LTM words at any position of the lists did not free-up WM capacity –meaning it did not result in any advantage or disadvantage for the rest of the words included in the lists.

Discussion

The goal of Experiment 1 was to investigate the flexibility of WM in utilizing LTM representations during serial recall. We aimed to determine whether WM can draw on entire LTM episodes or benefit from the independent activation of each element in the LTM representation when encountered individually. Additionally, we examined whether serial position influences any advantage of LTM words, either through redintegration or reducing WM load.

Our results revealed two main findings: First, we observed superior immediate memory performance for LTM information over new information within and between conditions, consistently only for words belonging to LTM episodes (i.e., pre-learnt word-pairs). Second, incorporating LTM words did not free-up WM capacity, as performance did not improve for novel words of the same lists, regardless of the serial positions of the LTM words.

The advantage of LTM words over words of the new condition was evident when two words from the same LTM episode were sequentially presented compared to introducing two words from different episodes. This speaks against an item activation account and is against our prediction that activating items whether from the same or different episodes would lead to higher performance via redintegration. Instead, it suggests that episodic LTM can benefit WM, when participants can retrieve intact episodic information for serial recall.² Although participants seemed to benefit from matching episodes at any position, there was no benefit for subsequent new information, even when presented at the beginning of the list. This contrasts with previous research showing that semantic chunks introduced at the list‘s onset can free WM capacity (Portrat et al., 2016; Thalmann et al., 2019), but is consistent with previous findings that WM capacity is not freed-up during encoding when two-item chunks are included (Norris et al., 2020).

Overall, the evidence from Experiment 1 is inconsistent with both, the idea that pre-learnt information is chunked or offloaded during encoding, as we did not observe any benefit on new information within mixed lists including LTM information; and the idea that it benefits via redintegration at recall, as our results show that WM is only benefitting from the retrieval of entire episodes but not from item activation.

One possibility to resolve these inconsistencies is to assume that episodic LTM contributions arise from redintegration at recall only for words stored in LTM as full episodes rather than as individual words, thereby facilitating the redintegration effect for intact pairs but not singletons. A trial, which contains an intact pre-learnt word pair could make it more likely that at least some information about this pair is still available at test, thereby increasing the chance for its successful redintegration. Our findings would be consistent with this account, where the advantage is specific to pre-learnt episodes, but there is a lack of a benefit on novel words within the lists (Norris et al., 2020).

Another possibility could be that the current task made it more difficult to properly engage in freeing-up capacity strategies (i.e. chunking or offloading). The current paradigm requires that participants remember each word bound to their positional context. This may limit the usefulness of LTM item-item associations, as recalling each word’s position remains necessary to successfully completing the task. Thus, WM cannot exclusively rely on differently structured LTM traces, limiting the utility of freeing-up capacity strategies at encoding.

A critical difference between this and previous studies in which episodic knowledge improved WM performance is the level of facilitation in recognizing LTM information. Previous studies used tasks that inherently facilitated the recognition of pre-learnt episodes during encoding by utilizing the same item-item structure (Bartsch & Shepherdson, 2022, 2023; Chen & Cowan, 2005, 2009) or clearly separating pre-learnt information from new information (Thalmann et al., 2019). In our study, words that matched prior knowledge were less salient. By presenting them centrally like new words, we potentially reduced the effectiveness of chunking or offloading strategies that are ought to occur at encoding. Therefore, an alternative explanation for the lack of freeing-up WM capacity benefits in our results is that the different structures of LTM and WM representations made it harder to immediately recognize matching pre-learnt episodes. As a result, participants may have had to initially encode both words independently, limiting their ability to engage in encoding strategies that could otherwise reduce WM load.

Indeed, recent evidence highlights the role of awareness at encoding in WM for retrieving episodic memory. Specifically, a recent study on Hebb repetition learning (Musfeld et al., 2023), demonstrated that participants only benefited from repetitions once they had recognized a repeating episode during re-encoding. Thus, recognizing previously encountered information during WM encoding is critical for learning and retrieving LTM information.

In summary, the different representational structures used in WM and LTM might have prevented participants from recognizing and subsequently chunking or offloading redundant information during encoding. Instead, they likely encoded the information as item-context bindings, preventing the freeing of WM capacity and resulting in no advantage for new information. We test this possibility in Experiment 2.

Experiment 2

In Experiment 2 we tested the hypothesis of whether increasing the saliency of LTM information in the WM task would increase recognizability and the beneficial contributions to WM performance. This aimed to not only improving immediate serial recall performance for the LTM words compared to new words (as in Experiment 1) but also free-up WM capacity at encoding. To address this, we highlighted the respective LTM words at encoding in red and informed that the colored words could match words that they had previously learnt. To control for effects of saliency, we presented two successive new words in the new condition in red as well, counterbalancing the serial positions across the trials.