
Figure 1
A sample page from Allacci’s Drammaturgia.
Table 1
Extraction patterns.
| FIELD | DESCRIPTION | REGEX LOGIC | VALUE TYPE |
|---|---|---|---|
| Entry | Full text of the entry | None | string |
| Title | Main title | All text until the first full stop. | string |
| Subtitle | Subtitle or alternative title | Text between a set of expression indicating a potential subtitle (o vero, o sia) and the first full stop | string |
| Author | Author, writer, librettist | A dash, di, Poesia di + following two words (≈name/surname) | string |
| Genre | Dramatic genre (as indicated in the entry) | Text between the first full stop and the second full stop or parenthesis | string |
| City | Place of publication (city or town) | in + following word | string |
| Location | Physical location of first recorded performance (usually, a theatre) | First two/three words after Teatro di | string |
| Publisher | Publisher, printer, or typographer | pointer7+ following two words (≈name/surname) | string |
| Year | Year of publication or performance | in + yyyy, else the first yyyy found | integer |
| Format | Typographical format (quarto, octavo, etc.) | in + one/two-digit number | integer |
| Mode | Poetry or prose | Find prosa for prose; versi/ottava rima for verse | string |
| Translation | Only direct translations are considered, not adaptations | If the entry contains translation-related language (tradot-, traduz-), mark as True | boolean |
| Libretto | Indication of the ‘musical’ nature of the work | If the entry contains music-related language (per/in M/musica-), mark as True | boolean |
| Composer | For libretti: author of the score | Musica di/del + following two words (≈name/surname) | string |

Figure 2
A sample entry; extracted fields are marked.
