Validating AI-Based Detection of I-PASS Elements in Verbal Handoffs
A Proof-of-Concept Study
Joshua Pankin, MD 1 Vishal Pallerla 2 Samantha Pendleton, DO 1 Hannah Miller, MD 1 Don Woodlock 2 Jonathan Teich, MD 2 Qi Li, MD 2 Christopher Landrigan, MD, MPH 3 Molly Senn-McNally, MD 1 Amy Starmer, MD, MPH 1
- 94.8%
- 77
- 385
- 100%
We built and validated a Whisper + GPT pipeline that automatically detects each of the five I-PASS handoff elements in verbal pediatric handoffs. Across 77 simulated cases (385 element-level assessments), the system reached 94.8% overall accuracy against a 10-reviewer consensus standard. 80% of errors were false negatives, the safer failure mode for educational feedback, since the tool defers rather than falsely confirms adherence. Action-List detection was perfect (100%); receiver-synthesis was the lowest performer (87%), driven by ASR errors and overlapping speech.