| Variable | Description | Example relative to the sentence just send Christmas cards … to people you don’t see from year to year |
|---|---|---|
| Verb | The verb lemma, one of “give”, “lend”, “show”, “send”, “offer”, and “sell”. | send |
| VerbSemTag | The semantic tag of the verb, obtained from the corpus semantic annotation, based on UCREL [5] semantic analysis system USAS; tags are available at http://ucrel.lancs.ac.uk/usas/semtags.txt. | M2 (‘Putting, taking, pulling, pushing, transporting &c.’) |
| Pattern | The observed dative construction, one of “VNPP” or “VNN” | VNPP |
| Recipient | The recipient’s noun phrase | people you don’t see |
| RecLen | The number of characters in the recipient | 21 |
| RecHead | The recipient’s syntactic head | people |
| RecPrn | Boolean defined programmatically based on the semantic tag of the recipient. If the semantic tag is ‘Z8’, the value is TRUE; otherwise, the value if FALSE. | NA |
| RecSemTag | String with the UCREL [5] semantic tag of the recipient’s syntactic head | S2 (‘people’) |
| AnimateRec | Boolean indicating whether the recipient’s head is animate (TRUE) or inanimate (FALSE). This was manually annotated | FALSE |
| Theme | String with the theme’s noun phrase | Christmas cards |
| ThemeLen | The number of characters in the theme | 15 |
| ThemeHead | String with the theme’s syntactic head | cards |
| ThemePrn | Boolean defined programmatically based on the semantic tag of the theme. If the semantic tag is ‘Z8’, the value is TRUE; otherwise, the value if FALSE. | FALSE |
| ThemeSemTag | String with the UCREL semantic tag of the theme’s syntactic head | Q1 (‘LINGUISTIC ACTIONS, STATES AND PROCESSES; COMMUNICATION’) |
| ThemeField | First letter of the semantic tag of the theme’s syntactic head. | Q |
| DefTheme | Boolean indicating if the theme is expressed as a definite phrase (TRUE) or indefinite (FALSE) | FALSE |
| AnimateTheme | Boolean indicating whether the theme’s head is animate (TRUE) or inanimate (FALSE) | FALSE |
| Variable | Description | Example |
|---|---|---|
| NumSpeakers | Number of speakers in the conversation | Texts with 2 speakers |
| Location | Location where the conversation took place | Speakers’ home |
| Relation | Relationship between the speakers in the conversation | Close family, partners, very close friends |
| Subject | Subject of conversation | Mother and daughter talking about theatre |
| Topics | Topics covered in the conversation | Theatre, Disney films, websites, post, Christmas, jobs| |
| ExactAge | Exact age of the main speaker in the conversation | 44 |
| AgeRange | The age range of the main speaker in the conversation | 40_49 |
| AgeRangeMid | Mid-point of the age range of the main speaker in the conversation. This variable is automatically calculated | 45 |
| AgeImputed | Equals the exact age of the main speaker in the conversation if it is recorded; it is the mid-point of the age range of the main speaker in the conversation, if the age range is recorded but not the exact range; otherwise, NA. This variable is automatically calculated | 44 |
| Gender | Gender of the main speaker in the conversation (M or F) | F |
| Nationality | Nationality of the main speaker in the conversation | British |
| BirthCountry | Country of birth of the main speaker in the conversation | England |
| L1 | First language of the main speaker in the conversation | English |
| LingOrigin | Country of linguistic origin of the main speaker in the conversation | England |
| Accent | Accent of the main speaker in the conversation | South East England |
| City | City where the conversation took place | High Wycombe |
| Country | Country where the conversation took place | England |
| Level1Dialect | First level of granularity in the categorization of the dialect of the main speaker in the conversation | uk |
| Level2Dialect | Second level of granularity in the categorization of the dialect of the main speaker in the conversation | english |
| Level3Dialect | Third level of granularity in the categorization of the dialect of the main speaker in the conversation | south |
| Level4Dialect | Fourth level of granularity in the categorization of the dialect of the main speaker in the conversation | southeast |
| SpeakerHighestQual | Highest qualification of the main speaker in the conversation | Graduate |
| Occupation | Occupation of the main speaker in the conversation | Team leader |
| SpeakerSocGrade | Social grade of the main speaker in the conversation, according to the classification developed by the National Readership Survey (https://web.archive.org/web/20110303033539/http://www.nrs.co.uk/lifestyle.htm) | E |
| ForeignLangs | Foreign languages spoken by the main speaker in the conversation | French–level unspecified; Spanish–level unspecified |
| NumUtterances | Number of utterances of the conversation’s main speaker in the whole corpus | 99 |
| NumWords | Number of words uttered by the conversation’s main speaker in the whole corpus | 1622 |
