Chantal Pellegrini
About Me
I am a PhD candidate in Computer Science at the Technical University of Munich with Prof. Nassir Navab and previously interned at Apple, focusing on deep learning based verification of Siri Visual Responses. My research focuses on domain-specific vision–language models for medical environments, from fine-grained image understanding, collaborative radiology assistance, and operating room understanding to multimodal patient modeling and forecasting from Electronic Health Records.
- CV: Download (PDF)
- Location: Munich, Germany
- Email: chantal.pellegrini(at)tum.de
Experience
Apple — AI/ML Intern
06/2025 – 09/2025
New York City, USA
- Development of AI-based error detection for Siri Visual responses
- Investigated both zero-shot use of foundation models as well as fine-tuning based approaches
- Deployed final tool as web server and integrated in CI/CD pipeline for automated checks
Ph.D. Candidate
11/2022 – tentatively 11/2026
Technical University of Munich, Munich, Germany
- Multimodal Medical Foundation Models: Worked on large vision-language models combining visual information, structured representations, and language models for interactive conversation and reporting in radiology and operating room (OR) understanding (Best Paper Runner-Up at MICCAI 2024).
- Reinforcement Learning for LLMs: Reinforcement learning for calibrated confidence expression in large language models in factual question answering and for sequential clinical decision making, learning in LLM-simulated clinical environments.
- Longitudinal Sequence Modeling with LLMs: Learning from multi-modal patient information for modeling and forecasting patient pathways during hospital stays.
- Medical Image Understanding: Developed methods for fine-grained and interpretable medical image understanding, with a focus on explainable zero-shot X-ray diagnosis and structured radiology reporting via hierarchical visual question answering.
- Teaching and supervision: Teaching courses in deep learning for medical imaging and supervising bachelor’s and master’s theses as well as practical courses.
Student Researcher
02/2020 – 08/2020
Itestra GmbH, Munich, Germany
- Analysis of the usefulness of source code comments using classical ML and deep learning-based NLP
- Development of a Visual Studio Code plugin to show analysed comment usefulness while coding
Software Developer
02/2019 – 01/2020
Qualicen GmbH, Munich, Germany
- Backend development for automated requirements analysis
- Development of an automated testing framework for the company’s software
Featured Publications
Rewarding Doubt: A Reinforcement Learning Approach to Confidence Calibration of Large Language Models
Accepted at ICLR 2026
From EHRs to Patient Pathways: Scalable Modeling of Longitudinal Health Trajectories with LLMs
under review
RaDialog: Large Vision-Language Models for X-Ray Reporting and Dialog-Driven Assistance
Medical Imaging with Deep Learning (MIDL), 2025
Oracle: Large Vision-Language Models for Knowledge-Guided Holistic OR Domain Modeling
MICCAI 2024 (Best Paper Runner-Up) · arXiv:2404.07031
Xplainer: From X-Ray Observations to Explainable Zero-Shot Diagnosis
MICCAI 2023 · arXiv:2303.13391
Rad-ReStruct: A Novel VQA Benchmark and Method for Structured Radiology Reporting
MICCAI 2023 · arXiv:2307.05766
Unsupervised Pre-training of Graph Transformers on Patient Population Graphs
Medical Image Analysis 89, 2023 · arXiv:2207.10603
Additional Publications
- D. Bani-Harouni, C. Pellegrini, E. Özsoy, M. Keicher, N. Navab. Language Agents for Hypothesis-driven Clinical Decision Making with Reinforcement Learning. Accepted at ICLR 2026. Paper
- E. Özsoy, F. Holm, C. Pellegrini, T. Czempiel, M. Saleh, N. Navab, B. Busam. Location-free Scene Graph Generation. CVPR Workshops, 2025. Paper
- E. Özsoy, C. Pellegrini, T. Czempiel, F. Tristram, K. Yuan, D. Bani-Harouni, et al. MM-OR: A Large Multimodal Operating Room Dataset for Semantic Understanding of High-Intensity Surgical Environments. CVPR, 2025. Paper
- E. Özsoy, C. Pellegrini, D. Bani-Harouni, K. Yuan, M. Keicher, N. Navab. ORQA: A Benchmark and Foundation Model for Holistic Operating Room Modeling. under review, 2025. Paper
- E. Özsoy, T. Czempiel, F. Holm, C. Pellegrini, N. Navab. Labrad-or: Lightweight Memory Scene Graphs for Accurate Bimodal Reasoning in Dynamic Operating Rooms. MICCAI, 2023. Paper
- I. Ezhov, T. Mot, S. Shit, J. Lipkova, J.C. Paetzold, F. Kofler, C. Pellegrini, et al. Geometry-aware Neural Solver for Fast Bayesian Calibration of Brain Tumor Models. IEEE Transactions on Medical Imaging 41(5), 2021. Paper
- C. Pellegrini, E. Özsoy, M. Wintergerst, G. Groh. Exploiting Food Embeddings for Ingredient Substitution. HEALTHINF, 2021. Paper
Education
Ph.D. Candidate in Computer Science
11/2022 – tentatively 11/2026
Technical University of Munich
- Focus: Vision-Language Models for Medical Image Understanding
- Advisor: Prof. Nassir Navab
M.Sc. in Computer Science
04/2020 – 07/2022
Technical University of Munich — Grade: 1.2 (top 10 %)
Exchange Semester
08/2018 – 12/2018
National University of Singapore
B.Sc. in Computer Science
10/2016 – 03/2020
Technical University of Munich — Grade: 1.5 (top 10 %)
studium MINT (Certificate in STEM subjects)
04/2016 – 09/2016
Technical University of Munich
German Abitur
2007 – 2015
Max-Planck High School Munich