Have a personal or library account? Click to login
“aSimMatrix” Dimensions: A Scalable Framework for Benchmarking Intertextual Similarity Cover

“aSimMatrix” Dimensions: A Scalable Framework for Benchmarking Intertextual Similarity

By: Shellie Audsley  
Open Access
|Feb 2026

Figures & Tables

johd-12-486-g1.png
Figure 1

Similarity Matrix. The leftmost column and row headers jointly provide a notational scheme for labelling and scoring the intertextual pairs in the dataset; common intertextual entitles or referential relation are sorted into the corresponding conceptual space of similarity, including specific phenomena like “heteroglossia” (Bakhtin, 1981) in the box for “Phrase-Parallel”.

Table 1

Model details.

MODELMODEL DETAILS
Word2Vec(out of the box, no adjustment)
Base SBERT*all-MiniLM-L6-v2
MPNet (masked model)*all-mpnet-base-v2
Multilingual MPNet*paraphrase-multilingual-mpnet-base-v2
Question-Answer & Retrieval*multi-qa-mpnet-base-dot-v1
Distilled Question-Answer & Retrieval*multi-qa-disilbert-cos-v1
E5*e5-base-v2
Note*SBERT family
johd-12-486-g2.png
Figure 2

A portion of the dataset illustrating how word-mirroring is weighted down by paragraph-level sense of opposition.

johd-12-486-g3.png
Figure 3

A demonstration of one way to label intertextual elements and calculate a similarity score, compared with basic NLP approaches to STS.

johd-12-486-g4.png
Figure 4

A conceptual representation of the proposed intertextual similarity (aSimMatrix) score.

johd-12-486-g5.png
Figure 5

A sample of pairwise n-gram and label distributions.

johd-12-486-g6.png
Figure 6

An example of semantic/conceptual difference.

DOI: https://doi.org/10.5334/johd.486 | Journal eISSN: 2059-481X
Language: English
Submitted on: Nov 19, 2025
|
Accepted on: Jan 23, 2026
|
Published on: Feb 18, 2026
Published by: Ubiquity Press
In partnership with: Paradigm Publishing Services
Publication frequency: 1 issue per year

© 2026 Shellie Audsley, published by Ubiquity Press
This work is licensed under the Creative Commons Attribution 4.0 License.