Please use this identifier to cite or link to this item:
doi:10.22028/D291-47930 | Title: | Integrating language model embeddings into the ACT-R cognitive modeling framework |
| Author(s): | Meghdadi, Maryam Duff, John Demberg, Vera |
| Language: | English |
| Title: | Frontiers in Language Sciences |
| Volume: | 5 |
| Publisher/Platform: | Frontiers |
| Year of Publication: | 2026 |
| Free key words: | ACT-R associative priming cognitive modeling distributional semantics language models psycholinguistics |
| DDC notations: | 004 Computer science, internet 400 Language, linguistics |
| Publikation type: | Journal Article |
| Abstract: | In 2025, psycholinguistic research has the benefit of large, high-quality datasets of human behavior, and massively-scalable metrics for variables of interest like frequency and association. This means we have more data than ever before to shed light on classic language processing phenomena like associative priming. But in order to build and test rigorous theories against this data, we also need computational modeling tools that can simulate cognitive mechanisms and generate quantitative predictions at the same scale. In this paper, we assemble one such case, adapting the ACT-R cognitive modeling framework to make use of association metrics derived from language model embeddings, in service of a scalable model of associative priming in the Lexical Decision Task. ACT-R implements a model of memory retrieval that can use itemwise predictors like frequency and association to predict task response times (RTs), via interpretable and meaningfully-parameterized components like spreading activation. But currently, ACT-R’s spreading activation calculations rely on manually-coded similarity scores, which are labor-intensive andpronetoinaccuracies,particularly for large vocabularies. In this study, we replace these hand-coded associations with cosine similarity scores derived from Word2Vec and BERT embeddings, thereby improving both scalability and predictive accuracy while retaining ACT R’s interpretability. We compare various versions of our model against observed human RTs from the Semantic Priming Project dataset, observing impressive item-wise prediction accuracy, and achieving the strongest alignment with a model where spreading activation is penalized via a scalable approximation of the classic “fan effect.” These findings provide a proof of concept for integrating embedding-based representations into algorithmic-level models of language processing. More than an insight into models of priming, we see this as a first step toward scalable and specific models of more complex phenomena. |
| DOI of the first publication: | 10.3389/flang.2026.1721326 |
| URL of the first publication: | https://doi.org/10.3389/flang.2026.1721326 |
| Link to this record: | urn:nbn:de:bsz:291--ds-479301 hdl:20.500.11880/41920 http://dx.doi.org/10.22028/D291-47930 |
| ISSN: | 2813-4605 |
| Date of registration: | 28-May-2026 |
| Description of the related object: | Supplementary material |
| Related object: | https://public-pages-files-2025.frontiersin.org/articles/1721326/file/Data_Sheet_1.pdf/1721326_data-sheet_1/1 |
| Faculty: | MI - Fakultät für Mathematik und Informatik P - Philosophische Fakultät |
| Department: | MI - Informatik P - Sprachwissenschaft und Sprachtechnologie |
| Professorship: | MI - Prof. Dr. Vera Demberg P - Keiner Professur zugeordnet |
| Collections: | SciDok - Der Wissenschaftsserver der Universität des Saarlandes |
Files for this record:
| File | Description | Size | Format | |
|---|---|---|---|---|
| flang-5-1721326.pdf | 640,92 kB | Adobe PDF | View/Open |
This item is licensed under a Creative Commons License

