Please use this identifier to cite or link to this item:
doi:10.22028/D291-48144 | Title: | Automated PROMISE V2 Scoring from PSMA PET/CT Reports Using Large Language Models: A Comparative Evaluation of Prompt Design and Model Performance |
| Author(s): | Speicher, Tilman Demirkol, Isa Ethem Blickle, Arne Bastian, Moritz B. Maus, Stephan Schaefer-Schuler, Andrea Bartholomä, Mark Burgard, Caroline Ezziddin, Samer Rosar, Florian |
| Language: | English |
| Title: | Current Oncology |
| Volume: | 33 |
| Issue: | 6 |
| Publisher/Platform: | MDPI |
| Year of Publication: | 2026 |
| Free key words: | PROMISE LLM large language model prostate cancer PSMA PET/CT |
| DDC notations: | 610 Medicine and health |
| Publikation type: | Journal Article |
| Abstract: | Large language models (LLMs) are increasingly explored for clinical use. However, the extent to which such models can reliably support physicians in reporting, staging, and the assessment of classification remains an active area of research. This study aimed to evaluate and compare multiple LLMs for automated PROMISE V2 classification for prostate cancer. A total of 126 unambiguous German-language PSMA PET/CT text reports were retrospectively analyzed, with reference standards established by expert consensus based on image interpretation and the original report text. Five LLMs (GPT-5.4, DeepSeek-V3.2, Claude Sonnet 4.6, Gemini 3 Flash and Grok 4) were assessed using two English-language prompting strategies of varying complexity. Agreement with the reference standard served as the primary endpoint. Performance varied in the short-prompt setting (36.5–79.4%) but improved consistently with the long prompt (74.6–86.5%), with Gemini 3 Flash achieving the highest agreement. Across PROMISE V2 subcategories, agreement rates were high (miT: 81.0–92.1%, miN: 92.9–96.0%, miM: 92.9–95.2%), despite inter-model differences. In conclusion, contemporary LLMs demonstrate promising performance in deriving PROMISE V2 scores from unambiguous original report texts, particularly when guided by detailed prompts. |
| DOI of the first publication: | 10.3390/curroncol33060349 |
| URL of the first publication: | https://doi.org/10.3390/curroncol33060349 |
| Link to this record: | urn:nbn:de:bsz:291--ds-481446 hdl:20.500.11880/42106 http://dx.doi.org/10.22028/D291-48144 |
| ISSN: | 1718-7729 |
| Date of registration: | 29-Jun-2026 |
| Description of the related object: | Supplementary Materials |
| Related object: | https://www.mdpi.com/article/10.3390/curroncol33060349/s1 |
| Faculty: | M - Medizinische Fakultät |
| Department: | M - Radiologie |
| Professorship: | M - Prof. Dr. Samer Ezziddin |
| Collections: | SciDok - Der Wissenschaftsserver der Universität des Saarlandes |
Files for this record:
| File | Description | Size | Format | |
|---|---|---|---|---|
| curroncol-33-00349-v3.pdf | 882,18 kB | Adobe PDF | View/Open |
This item is licensed under a Creative Commons License

