Direkt zum Inhalt


Aktuelles: Demo Paper auf der ECIR 2026

von Gregor Donabauer, Samy Ateia, Udo Kruschwitz und Kollegen aus der Urologie

30. März 2026, von Melanie A. Kilian

  • Informatik und Data Science
  • Forschung
  • Publikation

LLMs are powerful. But are their results reproducible enough for clinical decision support? 🤖🩺


Our ECIR 2026 demo paper, “MedNuggetizer: Confidence-Based Information Nugget Extraction from Medical Documents”, addresses this question.


🧗 The challenge

Large language models open up powerful opportunities for automated evidence synthesis in medicine but they also introduce a critical issue: reproducibility.

The same prompt can produce different outputs across runs due to stochastic sampling and model variance. In clinical decisions, this instability is risky.


💡 Our solution: MedNuggetizer

MedNuggetizer helps clinicians explore reliable medical evidence from multiple long documents.
It is a query-driven tool that repeatedly extracts and clusters “information nuggets” from medical documents, estimating confidence through sampling and aggregation.

Instead of relying on a single LLM output, the system:

  • Performs repeated nugget extraction
  • Clusters results across and within documents
  • Highlights confidence to support transparent, reproducible evidence exploration


🩺 Real-world evaluation in urology

Together with our clinical partners at
🏥 Caritas-Krankenhaus St. Josef and
🏥 Barmherzige Brüder Klinikum Straubing,

we evaluated MedNuggetizer on the debated topic of antibiotic prophylaxis before prostate biopsy.

Using four major guidelines (EAU & AWMF, 2024–2025) and ten recent PubMed-indexed studies (systematic reviews and randomized controlled trials), two urologists manually assessed:

  • 155 clusters (coherence/consistency)
  • 406 information nuggets (query relevance)


Results show:
✅ High relevance of extracted nuggets
✅ Meaningful clustering into multiple information layers (context, current evidence, recommendations, limitations)
⚠️ Areas for improvement in the nuggets (undefined abbreviations, contextualization, partial cluster overlap)


👥 Team effort 🧑‍💻🤝🩺

Technical lead: Gregor Donabauer, Samy Ateia, and Udo Kruschwitz (Information Science Regensburg)

Clinical application: our colleagues in urology in Caritas-Krankenhaus St. Josef and Barmherzige Brüder Klinikum Straubing: Maximilian Burger, Matthias May, Christian Gilfrich, Maximilian Haas, Julio Rubén Rodas Garzaro, and Christoph Eckl

 

To support reproducibility, you can find the app’s source code and evaluation data on GitHub.


📄 Pre-print: https://arxiv.org/abs/2512.15384 (externer Link, öffnet neues Fenster)

💻 GitHub repository: https://github.com/SamyAteia/mednuggetizer-ecir2026 (externer Link, öffnet neues Fenster)


We are excited to demo MedNuggetizer at ECIR 2026 in Delft! Come chat with us about how confidence-based nugget extraction can strengthen trustworthy evidence synthesis in medicine. 😊🇳🇱

 

#ProfessionalSearch #MedicalSearch
#Urology #ProstateBiopsy
#InformationExtraction #InformationNuggets
#LargeLanguageModels #LLMs #ClinicalAI
#AIResearch #PhDResearch
#MedicalInformatics #HealthTech
#Reproducibility #OpenScience
#InformationRetrieval #IR
#NaturalLanguageProcessing #NLP
#EuropeanConferenceOnInformationRetrieval #ECIR #ECIR26 #ECIR2026
#ResearchSuccess #ResearchPaperAccepted
#InformationScienceRegensburg #StayInformed
 

nach oben