“aSimMatrix” Dimensions: A Scalable Framework for Benchmarking Intertextual Similarity Cover

“aSimMatrix” Dimensions: A Scalable Framework for Benchmarking Intertextual Similarity

Journal of Open Humanities Data

Volume 12 (2026): Issue 1

By: Shellie Audsley

Open Access

|Feb 2026

Figures & Tables

Similarity Matrix. The leftmost column and row headers jointly provide a notational scheme for labelling and scoring the intertextual pairs in the dataset; common intertextual entitles or referential relation are sorted into the corresponding conceptual space of similarity, including specific phenomena like “heteroglossia” (Bakhtin, 1981) in the box for “Phrase-Parallel”.

Table 1

Model details.

MODEL	MODEL DETAILS
Word2Vec	(out of the box, no adjustment)
Base SBERT*	all-MiniLM-L6-v2
MPNet (masked model)*	all-mpnet-base-v2
Multilingual MPNet*	paraphrase-multilingual-mpnet-base-v2
Question-Answer & Retrieval*	multi-qa-mpnet-base-dot-v1
Distilled Question-Answer & Retrieval*	multi-qa-disilbert-cos-v1
E5*	e5-base-v2
Note	*SBERT family

A portion of the dataset illustrating how word-mirroring is weighted down by paragraph-level sense of opposition.

A demonstration of one way to label intertextual elements and calculate a similarity score, compared with basic NLP approaches to STS.

A conceptual representation of the proposed intertextual similarity (aSimMatrix) score.

A sample of pairwise n-gram and label distributions.

An example of semantic/conceptual difference.

DOI: https://doi.org/10.5334/johd.486 | Journal eISSN: 2059-481X

Journal RSS Feed

Language: English

Submitted on: Nov 19, 2025

|

Accepted on: Jan 23, 2026

|

Published on: Feb 18, 2026

Published by: Ubiquity Press

In partnership with: Paradigm Publishing Services

Publication frequency: 1 issue per year

Keywords:

intertextuality,

semantic textual similarity (STS),

compositionality gap,

literary theory

© 2026 Shellie Audsley, published by Ubiquity Press
This work is licensed under the Creative Commons Attribution 4.0 License.

Previous article Volume 12 (2026): Issue 1