Skip to main content
Have a personal or library account? Click to login
An Open Diachronic Dataset of Requests in Italian: Annotation, Challenges and a Case Study Cover

An Open Diachronic Dataset of Requests in Italian: Annotation, Challenges and a Case Study

Open Access
|May 2026

Full Article

(1) Introduction: context and motivation

Making requests lies at the very core of human interaction, revealing the interdependent and cooperative nature of everyday social experience. Rossi (2015, p.1) defines the need to appeal to others as “a primordial and pervasive component of social interaction”, highlighting a key feature of requests in interpersonal relations: reliance between individuals, understood as a structural dimension of our social existence. The ability to coordinate and guide others’ behavior constitutes the interactive fabric that makes social life possible, as well as efficient and organised, and seems to have a universal character (Drew & Couper-Kuhlen, 2014).

In addition to being an inherently social action, making requests is also a linguistic act, typically – though not always – performed through specific linguistic means. Languages provide a wide range of constructions and strategies to shape requests according to different socio-interactional variables, which has led Drew and Couper-Kuhlen (2014, p. 2) to refer to “social forms of requesting” as interacting with a range of linguistic forms, highlighting the complexities that can emerge from such combinations.

Requests are illocutionary acts in which a speaker communicates to a hearer a desire for the hearer to perform an action that benefits the speaker (see, e.g., Trosborg, 1995, p. 187). They are socially delicate actions, involving a complex interplay of demands and obligations: thus, speakers often feel compelled to render their linguistic expressions in a polite manner, softening their imposition and thereby reducing potential conflict and preserving relational balance. In more technical terms, according to Brown and Levinson’s seminal work (1987), requests are face-threatening acts that inherently challenge the addressee’s negative face – that is, their wish for autonomy and freedom from external impositions. In this light, speakers may opt for indirect (off-record) requests, drawing on a repertoire of linguistic and contextual resources – such as politeness markers, mitigation devices, minimizers, and supportive moves like justifications – to limit the impact of the imposition.

The aspects briefly outlined above – namely, the fundamental role of requests within our systems of cooperation and social solidarity, and the rich repertoire of linguistic strategies that languages have developed to perform this key action – make this speech act particularly compelling to explore from a variety of perspectives. As a result, requests have become one of the most extensively studied speech acts, especially from a cross-cultural point of view, but also through socio-interactional, acquisitional, and, to a lesser extent, diachronic approaches. Within this latter strand of research, studies focus primarily on English. Important studies examining the “old synchrony” of the English language are for example Kohnen (2000) on Old English and Culpeper and Archer (2008) on Early Modern English. A notable example of a diachronic study is Jucker (2020), who investigates the evolution of politeness norms in the history of English and demonstrates how these norms directly shape the linguistic realization of requests, illustrating how strategies of mitigation, indirectness, and deference vary across historical periods.

Studies on Italian, by contrast, are still very limited and thematically and/or chronologically circumscribed. Fedriani (2020) offers a first pilot study focusing exclusively on the mitigation of requests and based on a very small corpus of texts sampled at roughly two-century intervals. The doctoral dissertation by Parodi (2025), on the other hand, examines requests in Italian in the diachronic span from the seventeenth to the nineteenth century, primarily adopting a contrastive perspective, with Italian requests examined in systematic comparison with Spanish and Genoese.

Based on these premises, further research is needed to enrich our understanding of pragmalinguistic changes in the speech act of requesting across the long diachrony of Italian. To foster and support research aimed at filling this gap, we compiled a pragmatically annotated dataset of requests drawn from texts ranging from Old Italian to Contemporary Italian. We first present the dataset (Section 2), the annotation scheme and the annotation procedure (Section 3), and then discuss the main annotation challenges (Section 4). We also present a case study based on the dataset (Section 5) and outline the dataset’s potential applications (Section 6).

(2) Dataset description

The Diachronic Dataset of Requests in Italian contains 1,667 requests extracted from dialogues in Italian literary texts from the thirteenth to the twentieth century. For each request, the author, the title of the source work, and its century of publication are provided. Each request is pragmatically annotated into its constituent elements, following an annotation scheme based on Blum-Kulka et al. (1989) and described in Section 3.1.

Repository location

https://doi.org/10.7910/DVN/C2LVJF

Repository name

Harvard Dataverse

Object name

A Diachronic Dataset of Requests in Italian

Format name

Excel

Creation dates

2025-12-10 to 2026-03-21

Dataset creators

Chiara Fedriani, Francesca Strik-Lievers (University of Genoa)

Language

Italian; English

License

CC0 1.0

Publication date

2026-03-21

(3) Method

(3.1) Data and annotation scheme

The dataset was built following two steps: 1) Collection of request acts from Italian literary texts spanning from the thirteenth up to the twentieth century; 2) Annotation of all request acts based on the scheme proposed in Blum-Kulka et al. (1989).

The main resource we used to collect the requests is DIADIta (Napoli et al. submitted). DIADIta is a diachronic corpus of Italian annotated for pragmatic categories and is freely accessible through the website www.diadita.it. It currently includes 24 literary texts (plays and narrative works) ranging from the 13th century to the first half of the 20th century, for a total of 594,317 words. The corpus was designed as a reference resource for research in Italian historical pragmatics, particularly with regard to the pragmatics of interaction. All annotations therefore focus on the dialogic portions of the texts and cover a wide range of pragmatically relevant categories, including speech acts. A detailed description of the tagset structure and of the 57 pragmatic tags it includes, as well as of the annotation procedure, can be found in Napoli et al. (submitted) (see also De Felice & Strik-Lievers, 2024). Among the 31 speech acts included in the annotation scheme are requests, a type of directive act whose tag “Directive – Request/Supplication” is defined as follows in DIADIta’s annotation guidelines:

“Tag used to annotate the speech act by which the speaker asks the interlocutor to perform or refrain from performing a certain action or behaviour; the level of imposition is medium in the case of supplication and low in the case of requests, but never as high as in orders or commands. The act primarily serves the interests of the speaker.” (our translation; the annotation guidelines are available here on DIADIta’s website)

Thus, the most relevant criterion adopted by the DIADIta annotators to distinguish orders from requests was not formal in nature (for instance, both orders and requests may occur in the imperative mood), but functional, that is, related to the degree of imposition the speaker intends to exert on the addressee.

We extracted from DIADIta all textual segments annotated with this tag, for a total of 1,475 occurrences (a few instances were later excluded during the annotation process; an example is notate verbum, domine potestas ‘mark the word, o mighty lord’ from the Novelle Porretane, which is correctly annotated as a request in DIADIta; however, since it is in Latin, we excluded it because it falls outside the scope of our dataset).

The most recent texts in DIADIta are those by A. G. Cagna, both published in 1923. To extend the dataset with texts providing broader coverage of contemporary Italian, including works still under copyright, we manually identified request acts in five plays from the second half of the 20th century: Sabato, domenica e lunedì by E. De Filippo; Morte accidentale di un anarchico and Dio li fa e poi li accoppa by D. Fo; and Due donne di provincia and Dialogo di una prostituta con il suo cliente by D. Maraini. The identification was based on the same definition of request used in DIADIta. Table 1 lists all texts included in the dataset and the number of requests in each text.

Table 1

Texts included in the dataset, in chronological order, and number of requests.

AUTHORTITLECENTURYN. OF REQUESTS
AnonymousNovellino (selection)13th11
BoccaccioDecameron (selection)14th60
SacchettiTrecentonovelle (selection)14th14
Masuccio SalernitanoNovellino (selection)15th74
Degli ArientiNovelle Porretane15th59
AriostoCassaria16th46
MachiavelliLa Mandragola16th19
MachiavelliClizia16th27
AretinoLa Talanta16th83
AndreiniLo schiavetto17th139
Della PortaLa tabernaria17th59
ScalaIl finto marito17th147
GoldoniLa bottega del caffè18th74
GoldoniLa locandiera18th95
ChiariLa contadina incivilita dal caso18th71
TorelliI mariti19th63
GiacosaTristi amori19th62
VergaVita dei campi (selection)19th11
SvevoUn marito20th94
DeleddaCanne al vento20th114
PirandelloIl berretto a sonagli20th40
PirandelloEnrico IV20th39
CagnaPresso la culla20th13
CagnaVince chi… torna20th17
De FilippoSabato, domenica e lunedì20th33
FoMorte accidentale di un anarchico20th36
MarainiDue donne di provincia20th11
MarainiDialogo di una prostituta con un suo cliente20th16
FoDio li fa e poi li accoppa20th140
total1,667

All requests were further annotated according to a set of parameters considered relevant in the literature on request acts. Our main reference is Blum-Kulka et al. (1989), in which, as part of a large cross-linguistic and cross-cultural project on requests and apologies, the authors developed a detailed annotation scheme for these speech acts based on questionnaire data. Of course, our data are different in nature, since they involve historical varieties and literary texts. Given, on the one hand, the intrinsic limitations of the pragmatic interpretation of historical texts and, on the other, the fact that, as Mancini (2012, p. 247, our translation) observes, “the filter of writing functions as a selector of variables”, we adopted a simplified version of the annotation scheme originally proposed by Blum-Kulka et al., retaining its main categories.

A primary distinction is that between the head act, that is, “the minimal unit which can realize a request” (Blum-Kulka et al., 1989, p. 275), and possible modifiers used to modulate the illocutionary force of the request. The head act may be expressed directly, when the speaker’s illocutionary intention is explicitly encoded, for instance through the use of specific verbal moods or tenses (such as the imperative) or through particular verb classes (e.g. performative verbs such as to ask). Alternatively, it may be expressed indirectly, when the request is implicit to varying degrees. In this case, speakers may use expressions conventionally employed to formulate requests (e.g. could you…?), or expressions not conventionally associated with requests but interpretable as such by the hearer based on the pragmatic context.

As for modifiers, they are optional and may occur either internally or externally to the head act. Internal modifiers can be expressed through both lexical and syntactic means, and they can either upgrade or downgrade the illocutionary force of the request. For instance, the pragmatic marker please can be used to express politeness, and a similar type of modification of illocutionary force can be achieved by using the subjunctive or the conditional instead of the imperative. External modifiers, which may precede or follow the head act, can also either mitigate or aggravate the request, serving different specific functions such as providing motivation for the request or promising a reward if the request is fulfilled.

Strategies for expressing the head act are mutually exclusive. For example, an act that is direct cannot at the same time be indirect, and if it is expressed through the imperative mood it will not be expressed through a performative verb. By contrast, multiple modifiers may co-occur within the same request. Table 2 presents the full set of annotated categories together with examples from our dataset.

Table 2

Annotation scheme for requests.

Head actDirect strategiesVerbMood/tenseApritemi, ser Cione (Sacchetti, Trecentonovelle, 14th c.)
‘Open the door for me, Ser Cione’
Performativeio vi priego che voi il pigliate (Boccaccio, Decameron, 14th c.)
‘I beg you to take it’
Obligation statementMi dovete perdonare (De Filippo, Sabato, domenica e lunedì, 20th c.)
‘You must forgive me’
Want statementVoglio che tu ti segga (Torelli, I mariti, 19th c.)
‘I want you to sit down’
NounNoun phraseSignori, silenzio (Andreini, Lo schiavetto, 17th c.)
‘Gentlemen, silence’
Indirect strategiesConventionally indirectPosso sdraiarmi? (Maraini, Dialogo di una prostituta con il suo cliente, 20th c.)
‘May I lie down?’
Nonconventionally indirectBarone… fa freddo in questa stanza (Torelli, I mariti, 19th c.)
‘Baron… it’s cold in this room.’
(the Baron answers: Vi faccio un bel fuoco, aspettate…
‘I’ll make you a nice fire, wait…’)
ModifiersInternalUpgradersAscolta ascolta (Aretino, La Talanta, 16th c.)
‘Listen listen’
DowngradersVi supplico dirmelo per cortesia (Goldoni, La bottega del caffè, 18th c.)
‘I beg you to tell me, please’
ExternalUpgradersFermatevi, vi dico (Goldoni, La locandiera, 18th c.)
‘Stop, I’m telling you’
DowngradersAbbia pazienza, vada senza di me (Giacosa, Tristi amori, 19th c.)
‘Please be patient, go without me’
PrecommitmentPadre,m’avete a fare un’altra grazia, di perdonare a Cappio (Della Porta, La tabernaria, 17th c.)
‘Father, you must grant me another favor: to forgive Cappio’
MotivationMi vuoi accompagnare in camera mia, comincia a fare freschetto (De Filippo, Sabato, domenica e lunedì, 20th c.)
‘Would you accompany me to my room? It’s starting to get a bit chilly’
Ritual apologiesScusa, Matilde, mandami qui Giulia (Torelli, I mariti, 19th c.)
‘Sorry, Matilde, send Giulia here’

(3.2) Annotation procedure and inter-annotator agreement

The annotation was carried out by the two authors. In a first phase, both annotators independently annotated the same subset of the data, drawn from texts from different historical periods, for a total of 240 requests (14% of the dataset):

  • – Anonymous, Novellino

  • – Sacchetti, Trecentonovelle

  • – Boccaccio, Decameron

  • – Della Porta, La tabernaria

  • – Fo, Morte accidentale di un anarchico

  • – Maraini, Dialogo di una prostituta con il suo cliente

  • – Maraini, Due donne di provincia

  • – De Filippo, Sabato, domenica e lunedì

For this subset of 240 requests, we calculated the inter-annotator agreement using Krippendorff’s α. The resulting values were 0.885 for the head act and 0.651 for modifiers. According to commonly adopted thresholds for Krippendorff’s α, values above 0.80 indicate reliable agreement, while values around 0.67 may be considered acceptable for tentative conclusions (Krippendorff, 2004). The value obtained for head acts therefore indicates very high agreement, while the value for modifiers can be considered acceptable, given the highly interpretative nature of pragmatic annotation tasks and the fact that multiple modifiers may occur within a single request.

(4) Sources of annotation disagreement

An examination of the cases of disagreement shows that, aside from occasional annotation oversights, divergences largely stem from intrinsic interpretative difficulties in the data. This is hardly surprising. As Weisser (2014, p. 84) notes, “[a]ny type of linguistic annotation is a highly complex and interpretive process, but none more so than pragmatic annotation”. A key reason lies in the lack of a one-to-one correspondence between pragmatic forms and functions: “the form–function mismatch of most pragmatic phenomena means that automatic assignment of tags will often lack precision, and manual implementation of tags (which is time- and resource-intensive) is unavoidable” (Rühlemann & Aijmer, 2014, p. 11). In addition, many pragmatic categories have fuzzy boundaries and cover functional domains that overlap with neighbouring categories. As we will discuss in this section, these properties frequently give rise to cases of annotator disagreement, stemming from ambiguity and from the availability of multiple plausible interpretations. We discuss here some illustrative examples, beginning with the cases of disagreement that emerged in the annotation of head acts, which are less than half of those in the annotation of modifiers (21 vs. 48). This in itself is an interesting point, to which we will return later.

One case of disagreement in the annotation of the head act is given in (1). The first annotator classified it as a direct request in the form of a want statement expressed by the verb intendere in the sense of ‘to wish/want’. The second annotator interpreted intendere as a performative verb (‘to declare’) and therefore classified the request differently.

  • (1)   acciò che tua sorella senza dote non sia, io intendo che egli e non altri abbia questo beneficio che il re promette così grande per te […] (Boccaccio, Decameron, 14th c.)

          ‘So that your sister may not remain without a dowry, I intend that he, and no other, should receive this great benefit that the king promises for you.’

In this case, the interpretative difficulty stems from polysemy. The verb intendere already exhibited the two meanings (‘to wish/want’ and ‘to declare’) in fourteenth-century Italian (see GDLI, s.v.). Closer examination of the broader context, together with in-depth discussion among the annotators, led to the conclusion that, in this instance, intendere conveys a want-based request, a reading supported by the interlocutor’s subsequent compliance in the following stretch of text.

As noted above, the level of disagreement in the annotation of modifiers proved to be higher, and this functional domain presented a wider range of annotation challenges, which were also more complex. A recurrent type of divergence concerned expressions such as the one highlighted in example (2), annotated by both authors as an external modifier, but interpreted by one as expressing motivation and by the other as a downgrader.

  • (2)   Per che io vi priego, per cotanto amore quanto è quello che io vi porto, che voi non neghiate il vostro verso di me […] (Boccaccio, Decameron, 14th c.)

          ‘Therefore I beg you, for the great love I bear you, that you do not deny me yours [love] toward me’

The categories of “motivation” and “downgrader” may indeed partially overlap, because providing a reason or justification for the request can also function as a mitigating strategy that reduces its impositive force. Adopting a prototypical approach, we agreed to annotate such expressions, which are particularly common in Old Italian, as downgraders, since their core function appears to be the modulation, in terms of mitigation, of the request, while the motivational component remains more implicit.

Another case of divergence concerned the classification of the pragmatic marker per carità (‘for heaven’s sake’) in example (3), interpreted by both annotators as an internal modifier but considered a downgrader by one and an upgrader by the other.

  • (3)   Per carità, lasci perdere quel “primo consigliere”. Non ci tengo (Fo, Morte accidentale di un anarchico, 20th c.)

          ‘For heaven’s sake, drop that “chief adviser” business. I don’t care about it.’

The pragmatic marker per carità has historically been used in Italian both in polite and impolite requests. To resolve this disagreement, we relied on the existing literature on the topic. In particular, according to Paternoster (2023), who adopts a combination of methods, both qualitative (focusing on the mention of the formula in contemporary etiquette manuals) and quantitative (examining the use of the formula in a diachronic corpus), impolite uses of per carità emerged in the nineteenth century and are predominant today, often being used to express disagreement – as in (3). For this reason, we ultimately annotated this twentieth-century occurrence as an upgrader.

A final noteworthy case is illustrated in example (4), concerning come sia tuo piacero:

  • (4)   priegoti che mi doni oro o argento o robbe, come sia tuo piacero

          ‘I pray that you grant me gold, silver, or other items, as it pleases you’ (Anonymous, Novellino, 13th century)

One annotator did not mark this expression, interpreting it purely propositionally as attached to argento o robbe – that is, ‘silver or other items, as you wish’. The other annotator, however, interpreted come sia tuo piacero as an instantiation of one of the pragmatic rules for polite expression in Lakoff’s politeness model, namely “give options”, which entails phrasing utterances so as to allow the addressee flexibility in responding to or interpreting the request (Lakoff, 1975). Following discussion, it was decided to annotate it according to this line of reasoning, classifying it as an external downgrading modifier.

Space constraints prevent us from discussing further cases of interpretative disagreement. These examples nevertheless highlight the challenges involved in classifying pragmatic modifiers, whose context-dependent and multifunctional nature makes their identification particularly susceptible to individual analytical judgments.

An in-depth discussion of these cases allowed us to clarify problematic instances and then proceed independently with the annotation of the remaining requests, consulting each other whenever further ambiguities arose.

(5) Results: diachronic trends in direct vs indirect requests

In this section, we provide an illustrative example of research based on the dataset just described. In particular, as a testing ground, we have decided to focus on one of the most widely studied aspects of requests across different languages and historical periods, namely the degree of (in)directness of requests. Blum-Kulka et al. (1989) show that, in the five languages examined in their study (English, French, German, Hebrew, and Spanish), conventionally indirect strategies are by far the most frequent (between 58% and 82%, depending on the language), followed by direct strategies (between 10% and 40%) and non-conventionally indirect strategies (between 2% and 8%). Culpeper and Archer (2008), analysing trial records and drama texts from Early Modern English (1640–1760), observed, by contrast, a marked predominance of direct strategies (about 73%; note, however, that their category requests is used in a broader sense, also encompassing commands; consequently, their findings are not directly comparable to ours). In any event, this discrepancy led us to explore what happens in the history of Italian with regard to the parameter of (in)directness. Let us now thus see what the data from our dataset show from this perspective.

We organised the data into six consecutive periods, partially following the periodization adopted in the MIDIA diachronic corpus of Italian (D’Achille & Grossmann, 2017): Period 1 (13th c.–1375), Period 2 (1376–1532), Period 3 (1533–1691), Period 4 (1692–1840), Period 5 (1841–1947), Period 6 (1948–1978). Figure 1 shows, for each period, the proportion of direct, conventionally indirect, and non-conventionally indirect requests relative to the total number of requests in that period.

Figure 1

Distribution of direct, conventionally indirect, and non-conventionally indirect strategies in the requests of the dataset.

It can be observed that up to the mid-20th century requests are overwhelmingly direct. This result matches the trend also observed by Culpeper and Archer in their analysis of Early Modern English. In the second half of the 20th century the situation changed: conventionally indirect requests increased substantially, reaching 30% of the requests in Period 6. Non-conventionally indirect requests, by contrast, remain consistently marginal, as is also the case both in Blum-Kulka et al. (1989) and in Culpeper and Archer (2008).

It therefore appears that, whereas in historical data from Italian (and English) requests were predominantly formulated directly, in contemporary Italian (as in other present-day languages) the gap between direct and indirect strategies has narrowed. The trend that emerges here, although limited to the dataset used, seems to suggest a developmental tendency that may be cross-linguistically valid, at least for European languages that have shared similar historical trajectories and cultural developments. For a more in-depth, culturally and historically informed account of the reasons underlying this evolutionary tendency, we refer the reader to Fedriani and Strik-Lievers (in preparation).

(6) Applications and potential reuse scenarios

Building on the illustrative case study on (in)directness in Italian requests presented in Section 5, we now turn to other potential reuse scenarios of the dataset, which, in our view, support multiple lines of research across different fields.

First, it can be used to investigate a wide range of pragmalinguistic phenomena from a historical pragmatic perspective, including the use of performative verbs, (im)politeness strategies, mitigation and intensification, as well as diachronic changes in support moves – for instance, the increasing conventionalization of ritual apologies or the gradual pragmaticalization of Italian politeness markers over the centuries. A related area of application concerns the study of dialogic interaction patterns, such as the use and evolution of terms of address and endearment, as well as variation and change in attention-getting forms and their communicative functions.

In stylistic and literary studies, the dataset allows the identification of recurring stylistic devices employed by specific authors. For example, earlier research on the requests included in the dataset highlighted Andreini’s strong preference for the discourse marker di grazia in his work Lo schiavetto, which contributes to a marked increase in the use of discourse markers in seventeenth-century requests included in the dataset (Fedriani & Strik-Lievers, 2025). At the same time, the dataset provides a valuable resource for examining authorial preferences in dialogue construction and characterization.

In computational linguistics and NLP, it can support the training of language models on pragmatically annotated data and the development of automated systems for request detection and classification. More broadly, this aligns with recent work exploring the use of LLMs for pragmatic annotation tasks, such as Yu et al. (2024) on apologies, Brocca et al. (2026) on cancellations after accepting an invitation, and Bianco et al. (2026) on humour.

Finally, the dataset also has educational and digital humanities applications, serving as a resource for courses in pragmatics, stylistics, and historical linguistics, as well as for pragmalinguistic annotation exercises – an approach the authors have implemented in research seminars with PhD candidates and in MA-level courses.

Acknowledgements

We are grateful to the Editors of the Special Issue and the audience at the conference Quantitative Diachronic Linguistics and Cultural Analytics: Data-Driven Insights into Language and Cultural Change (King’s College London, 15–16 January 2026) for their valuable suggestions. We also thank the three anonymous reviewers for their careful reading and insightful comments. Our analysis has further benefited from the thoughtful feedback of Davide Garassino, to whom we extend our sincere gratitude.

Author Contributions

Chiara Fedriani: conceptualization, data curation, investigation, methodology, writing of Sections 1, 5, and 6.

Francesca Strik-Lievers: conceptualization, data curation, investigation, methodology, writing of Sections 3 and 4.

DOI: https://doi.org/10.5334/johd.541 | Journal eISSN: 2059-481X
Language: English
Page range: 68 - 68
Submitted on: Mar 20, 2026
Accepted on: Apr 23, 2026
Published on: May 25, 2026
Published by: Ubiquity Press
In partnership with: Paradigm Publishing Services
Publication frequency: 1 issue per year

© 2026 Chiara Fedriani, Francesca Strik-Lievers, published by Ubiquity Press
This work is licensed under the Creative Commons Attribution 4.0 License.